Software Engineer - E2E Autonomy
Builds ML tools, infrastructure, and manages large datasets for end-to-end autonomy research and productionizing self-driving software. Works with AI research and engineering teams to scale GPU compute, data, and evaluation systems. Requires strong software generalist skills across ML stack.
Responsibilities
- Build tools and infrastructure to enable large-scale E2E autonomy research led by Dr. Wei Zhan
- Work directly with both AI research and engineering teams to productionize next generation self-driving software
- Help scale end-to-end training by identifying bottlenecks, especially as we continue to scale our GPU compute, onroad data volume, curation efforts, and eval systems
Technologies: Python, Pytorch, CUDA, Bazel, Kubernetes, Spark, Flyte
Requirements
- Exceptional software generalist experience with ambition to contribute up and down the ML stack, onboard the vehicle, and in the cloud
- Strong fundamental skills in software & module design: can tame chaos with libraries & abstractions
- Rapid analytical and problem-solving skills
Nice to Have
- Strong multi-language build system knowledge
- Distributed systems/compute (e.g. k8s, cloud-managed services at scale)
- ML infra fluency - has used and/or contributed to ML and large data systems
Compensation
Base salary range: $153,000 - $222,000 USD annually
Senior Software Engineer, AI Runtime
Senior Software Engineer building and scaling Databricks' managed GPU training platform (AI Runtime) for large-scale distributed AI model training. Requires 5+ years in distributed systems and hands-on experience with GPU training frameworks.
Sr. Machine Learning Engineer, Computer Vision
Build and prototype diffusion-based text-to-image generative models (Pinterest Canvas) using large-scale visual-text datasets. Requires 5+ years industry computer vision experience and an M.S. or Ph.D.
Senior AI/ML Engineer
Senior AI/ML Engineer building transformer and deep learning models on financial and behavioral data to power personalized growth and marketing experiences at Chime. Requires strong production ML experience with PyTorch, AWS, and large-scale data infrastructure.