Skip to content

Software Engineer, ML Performance Optimization

192k – 257kFoster City, CAOnsite4+ YOE
Summary

Drive ML performance optimization initiatives to make autonomous driving models faster and more efficient using distributed training, quantization, distillation, and profiling tools.

About the role

Responsibilities

  • Design, implement, and operate cutting-edge ML Training OR Inference performance optimization techniques to scale VLM, VLA, and Foundational models and deploy them efficiently in robotaxis.
  • Collaborate closely with cross-functional teams, including ML researchers, software engineers, data engineers, and hardware engineers, to define requirements and align on architectural decisions.

Requirements

  • 4+ years of total experience, including 2+ years of working on large-scale model training or inference platforms.
  • Experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT or similar frameworks.
  • Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler for identifying model training and serving bottlenecks.
  • Proficient in Python or C++.

Nice-to-Haves

  • Experience with distributed training techniques, quantization, distillation, and pruning.
  • Work with SOTA accelerators and inference optimization frameworks.
Skills
PyTorchTensorRTPythonC++NVIDIA NsightPyTorch ProfilerDistributed TrainingQuantizationDistillationPruning
Similar roles at this salary range
All ML Engineering jobs →
Databricks

Staff Software Engineer, AI Runtime

Staff Software Engineer building and scaling Databricks' managed large-scale GPU training platform (AIR). Focus on distributed training performance, scheduling, fault tolerance, and developer experience for thousands of accelerators.

190k – 265kMountain View, CA +1ML EngineeringOn-siteFSDPRoCE
Databricks

Senior Software Engineer, AI Runtime

Senior Software Engineer building and scaling Databricks' managed GPU training platform (AI Runtime) for large-scale distributed AI model training. Requires 5+ years in distributed systems and hands-on experience with GPU training frameworks.

160k – 225kMountain View, CA +1ML EngineeringOn-siteFSDPRoCE
Pinterest

Sr. Machine Learning Engineer, Computer Vision

Build and prototype diffusion-based text-to-image generative models (Pinterest Canvas) using large-scale visual-text datasets. Requires 5+ years industry computer vision experience and an M.S. or Ph.D.

161k – 332kSan Francisco, CAML EngineeringRemoteRLHFPyTorch
Checkr

Machine Learning Engineer

Build and ship production ML/AI services powering background checks. Own end-to-end ML systems using LLMs, Python, and modern MLOps practices.

168k – 198kSan Francisco, CAML EngineeringOn-siteNLPdbt
Chime

Senior AI/ML Engineer

Senior AI/ML Engineer building transformer and deep learning models on financial and behavioral data to power personalized growth and marketing experiences at Chime. Requires strong production ML experience with PyTorch, AWS, and large-scale data infrastructure.

172k – 238kChicago, IL +3ML EngineeringHybridSQLAWS