Machine Learning Engineer - Perception Offline Driving Intelligence
Develop and fine-tune multimodal large language models for offline analysis to improve robotaxi environmental understanding and safety. Collaborate across teams to integrate solutions into production autonomous systems.
Responsibilities
- Develop multimodal large language models that enhance our robotaxis' understanding of complex urban environments
- Implement model architectures and sophisticated training techniques
- Build large high quality datasets leveraging all the inputs from our sensor stack and the overall large scale data we have at Zoox
- Drive end-to-end ML solutions from research to production, utilizing Zoox's extensive data pipelines and infrastructure to improve autonomous driving capabilities
- Collaborate with perception, planning, safety, and systems teams to integrate your models into the vehicle's decision-making pipeline
- Validate and optimize your solutions using real-world driving scenarios, directly contributing to the safety and reliability of Zoox's autonomous system
Qualifications
- MS or PhD in Computer Science, Machine Learning, or related technical field
- Demonstrated experience training and deploying large language models (LLMs)
- Experience building and maintaining ML training pipelines, including data preprocessing, model training, and evaluation
- Proficiency in Python and ML libraries (PyTorch, NumPy) demonstrated through professional or research projects
- Experience training models with large scale data
Bonus Qualifications
- Publications in top-tier conferences (CVPR, ICCV, RSS, ICRA)
- Experience with autonomous robotics systems
Staff Software Engineer, AI Runtime
Staff Software Engineer building and scaling Databricks' managed large-scale GPU training platform (AIR). Focus on distributed training performance, scheduling, fault tolerance, and developer experience for thousands of accelerators.
Senior Software Engineer, AI Runtime
Senior Software Engineer building and scaling Databricks' managed GPU training platform (AI Runtime) for large-scale distributed AI model training. Requires 5+ years in distributed systems and hands-on experience with GPU training frameworks.
Sr. Machine Learning Engineer, Computer Vision
Build and prototype diffusion-based text-to-image generative models (Pinterest Canvas) using large-scale visual-text datasets. Requires 5+ years industry computer vision experience and an M.S. or Ph.D.
Senior AI/ML Engineer
Senior AI/ML Engineer building transformer and deep learning models on financial and behavioral data to power personalized growth and marketing experiences at Chime. Requires strong production ML experience with PyTorch, AWS, and large-scale data infrastructure.