Skip to content

ML Engineer, Agentic Systems

175k – 275kNew York, NYOnsite
Summary

Build and extend agentic LLM systems for multimodal creative tasks, focusing on video understanding, reasoning, and structured generation. Requires strong production ML experience and deep LLM expertise.

About the role

Responsibilities

  • Design and build end-to-end agentic systems for creative tasks
  • Develop novel approaches for training and adapting the large language models that power these agents
  • Design new objectives, datasets, and fine-tuning strategies to improve agent behavior and reliability
  • Explore multimodal reasoning and structured generation for creative control
  • Run systematic experiments to evaluate and improve agent performance in real-world tasks
  • Design evaluation frameworks for agentic workflows in video analysis and editing
  • Analyze failure modes across the full agent loop (planning, tool use, execution) and iterate on improvements

Requirements

  • BS/MS/PhD in CS, ML, or related field
  • Strong track record building production ML systems or agentic pipelines
  • Deep understanding of transformers and modern LLM techniques
  • Experience with fine-tuning, alignment, or post-training methods, especially for adapting models to generate structured outputs or drive tool use
  • Comfort owning the full stack, from model-level experiments to deployed agent systems
  • Strong experimental rigor and good taste for what makes agents actually work in practice

Benefits

  • Comprehensive medical, dental, and vision plans
  • 401K with employer match
  • Commuter Benefits
  • Catered lunch multiple days per week
  • Dinner stipend every night if you're working late
  • Grubhub subscription
  • Health & Wellness Perks
  • Multiple team offsites per year with team events every month
  • Generous PTO policy
Skills
TransformersLLMsFine-tuningMultimodal modelsAgentic systemsStructured generationModel evaluationPythonPyTorchTensorFlow
Similar roles at this salary range
All ML Engineering jobs →
Notable

AI Platform Engineer

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

170k – 205kSan Mateo, CAML EngineeringHybrid5+ YOEGKEHelm
Hinge Health

Staff Machine Learning Scientist

Own ML systems for send-time optimization, propensity modeling, and nudge decisions at consumer scale. Set experimentation standards and mentor a small ML team.

205k – 307kSan Francisco, CAML EngineeringHybrid7+ YOESQLdbt
Docker

Staff ML Engineer

Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.

205k – 330kPalo Alto, CA +1ML EngineeringRemote8+ YOELLMsRetrieval
Nuance Labs

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning
Snowflake

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP