Software Engineer, Agents
Design and build agentic systems for AI-native video creation, integrating LLMs and evaluation frameworks to power creative workflows. Requires 5+ years building ML/agentic systems in production.
Responsibilities
- Build and ship end-to-end agentic systems that meaningfully improve creative workflows, balancing quality, performance, and reliability
- Design agent architectures — how agents gather context, plan, select tools, and execute creative tasks reliably at scale
- Integrate state-of-the-art models, combining internal research and external capabilities to power new agent experiences
- Measure and improve agent quality in production, using experimentation, evaluation frameworks, and real-world feedback to guide iteration
Requirements
- 5+ years of professional industry experience
- A track record of designing and developing ML systems or agentic pipelines in production
- Deep experience with context engineering: RAG systems, token optimization, context management at scale
- Experience building evaluation systems and agentic infrastructure (context gathering, tool selection, multi-step planning)
- Exceptional problem solving fundamentals and ability to learn quickly in a rapidly evolving space
- Able to operate effectively in an extremely fast-paced environment and scope and deliver projects end-to-end
Nice-to-Haves
- Experience with multi-agent architectures
- Experience fine-tuning LLMs for specific use cases
- A pulse on where generative media and agentic AI technology is going
- Grounded, collaborative, and willing to do whatever it takes to help the team win
- Startup founder or early engineer at one
Benefits
- Comprehensive medical, dental, and vision plans
- 401K with employer match
- Commuter Benefits
- Catered lunch multiple days per week
- Dinner stipend every night if you're working late
- Grubhub subscription
- Health & Wellness Perks
- Multiple team offsites per year with team events every month
- Generous PTO policy
Staff ML Engineer
Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.
Member of Technical Staff - Research Fellow
3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.
Senior Software Engineer — LLM Post-Training Platform
Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.