ML Engineer, Agentic Systems
Build and extend agentic LLM systems for multimodal creative tasks, focusing on video understanding, reasoning, and structured generation. Requires strong production ML experience and deep LLM expertise.
Responsibilities
- Design and build end-to-end agentic systems for creative tasks
- Develop novel approaches for training and adapting the large language models that power these agents
- Design new objectives, datasets, and fine-tuning strategies to improve agent behavior and reliability
- Explore multimodal reasoning and structured generation for creative control
- Run systematic experiments to evaluate and improve agent performance in real-world tasks
- Design evaluation frameworks for agentic workflows in video analysis and editing
- Analyze failure modes across the full agent loop (planning, tool use, execution) and iterate on improvements
Requirements
- BS/MS/PhD in CS, ML, or related field
- Strong track record building production ML systems or agentic pipelines
- Deep understanding of transformers and modern LLM techniques
- Experience with fine-tuning, alignment, or post-training methods, especially for adapting models to generate structured outputs or drive tool use
- Comfort owning the full stack, from model-level experiments to deployed agent systems
- Strong experimental rigor and good taste for what makes agents actually work in practice
Benefits
- Comprehensive medical, dental, and vision plans
- 401K with employer match
- Commuter Benefits
- Catered lunch multiple days per week
- Dinner stipend every night if you're working late
- Grubhub subscription
- Health & Wellness Perks
- Multiple team offsites per year with team events every month
- Generous PTO policy
Staff ML Engineer
Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.
Member of Technical Staff - Research Fellow
3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.
Senior Software Engineer — LLM Post-Training Platform
Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.