ML Engineer, Generative Video

175k – 275kNew York, NYOnsite2+ YOEJun 11

Summary

Build and scale video generation models at an AI-native video platform. Focus on training, inference optimization, and productionizing large-scale multimodal models.

About the role

Responsibilities

Train and optimize large-scale video and multimodal models
Improve efficiency across training and inference (memory, latency, cost)
Implement techniques such as distillation, quantization, and pruning to aggressively accelerate diffusion and autoregressive generation
Build and maintain distributed training systems
Optimize GPU utilization, parallelism, and throughput
Develop tooling for experimentation, evaluation, and debugging
Translate research models into robust, production-ready systems
Monitor and improve model performance in real-world usage

Requirements

BS/MS/PhD in CS, ML, or related field
2+ years of professional industry experience
Strong experience in deep learning systems and infrastructure
Expertise in PyTorch, CUDA, Triton, and distributed training (FSDP, etc.)
Experience scaling and optimizing large models under low-latency inference constraints
Strong debugging and performance profiling skills
Ability to move quickly from prototype to production

Benefits

Comprehensive medical, dental, and vision plans
401K with employer match
Commuter Benefits
Catered lunch multiple days per week
Dinner stipend every night if you're working late
Grubhub subscription
Health & Wellness Perks
Multiple team offsites per year with team events every month
Generous PTO policy

Skills

PyTorchCUDATritonFSDPDistributed TrainingDeep LearningModel OptimizationQuantizationDistillationGPU Optimization

Similar roles at this salary range

All ML Engineering jobs →

Notable

Jun 12

AI Platform Engineer

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

170k – 205kSan Mateo, CAML EngineeringHybrid5+ YOEGKEHelm

Hinge Health

Jun 12

Staff Machine Learning Scientist

Own ML systems for send-time optimization, propensity modeling, and nudge decisions at consumer scale. Set experimentation standards and mentor a small ML team.

205k – 307kSan Francisco, CAML EngineeringHybrid7+ YOESQLdbt

Docker

Jun 12

Staff ML Engineer

Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.

205k – 330kPalo Alto, CA +1ML EngineeringRemote8+ YOELLMsRetrieval

Nuance Labs

Jun 11

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning

Snowflake

Jun 11

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP

Apply