Skip to content

Research Scientist - World-Action Foundation Model, Robotics

Conducts research on world-action foundation models for robotics and autonomous driving, focusing on 3D vision, multi-modal pretraining, and Gaussian splatting. Requires MSc/PhD in ML/CV, strong publication record, and expertise in Python/PyTorch.

126k – 423kSunnyvale, CAAI ResearchOnsiteEntry level

About the role

Responsibilities

  • Conduct research on pretraining world-action foundation model with various world modalities including vision and physics associated with ego actions, serving the purpose for both robot action and simulation world generation.
  • Dive into relevant topics such as vision-physics modality association, feed-forward Gaussian splatting, world foundation model, human data incorporation, language modality, spatial reasoning, deformable object modeling and simulation.
  • Explore related topics including 3D/world-action foundation model, multi-modal pretraining, feed-forward Gaussian splatting, world foundation model with applications to autonomous driving, and fundamental topics on 3D vision and generation.
  • Work closely with other Research Scientists and interns on research publications for submission to top-tier conferences.
  • Collaborate with Research Engineers and engineering teams to test and deploy algorithms to our autonomy and simulation products.

Requirements

  • Strong research record in the fields of 3D vision, reconstruction and generation for robotics and autonomous systems, with publications in top-tier conferences or journals in the fields of computer vision, machine learning, and robotics.
  • MSc or PhD in machine learning and computer vision with autonomy and robotics applications or closely-related fields.
  • Passion for next-generation, scalable autonomy and robotics for real-world systems.
  • Strong research skills and the ability to work both independently and collaboratively on projects.
  • Technical experience in: Python, PyTorch, computer vision, robotics systems, and distributed machine learning model training.

Nice to Have

  • Hands-on experience in at least one of the following fields:
    • 3D foundation model and pretraining
    • Multi-modal foundation model
    • Feed-forward Gaussian splatting and reconstruction
    • World foundation model and generation
    • 3D/multi-view end-to-end models for autonomous driving or robotics
    • Human data processing and incorporation

Compensation

  • Base salary range: $126,000 - $423,000 USD annually.
  • Equity, comprehensive health/dental/vision/life/disability insurance, 401k with employer match, learning/wellness stipends, paid time off.

Skills

PythonPyTorchComputer Vision3D VisionMachine LearningRoboticsDistributed TrainingGaussian SplattingMulti-Modal ModelsFoundation Models

Similar roles

AI Research jobs

Research Scientist - 3D Vision and Generation, Self-Driving

Conducts cutting-edge research in 3D vision, reconstruction, and generation for autonomous driving and robotics, publishes at top conferences, and deploys algorithms to production systems. Requires MSc/PhD in ML/CV, strong publication record, and expertise in Python, PyTorch, CV, and robotics.

126k – 423kSunnyvale, CAAI ResearchOn-siteEntry levelPythonPyTorch

Junior Research Scientist

Junior Research Scientist develops and refines ML architectures and predictive models for conversational AI in housing and healthcare, applying advanced math and quantitative methods. Requires PhD in math, physics, CS or related, strong ML foundation, and onsite work in San Francisco.

150k – 230kSan Francisco, CAAI ResearchOn-siteEntry levelRLLMs

Post-Doctoral Researcher

Post-doctoral researcher conducting independent and collaborative AI/ML research focused on high-impact domains like medicine, finance, and law. Requires a recent or imminent PhD and publications in top venues.

160k – 220kBellevue, WAAI ResearchHybridEntry levelJAXRAG

Research Scientist – Tabular & Structured Machine Learning

Conduct research and build foundational ML models for structured and tabular data, combining statistical learning theory, probabilistic modeling, and large-scale systems. Requires a PhD and strong experience in tabular/relational ML.

160k – 250kSan Francisco, CAAI ResearchOn-siteEntry levelJAXRust

Machine Learning Research Scientist: Generative Modeling for Planning

Develops state-of-the-art generative models like diffusion and flow-matching for autonomous planning in self-driving tech. Requires PhD or MSc with 2-3 years experience in generative modeling for robotics, strong Python/C++ skills, and top research publications.

160k – 241kMountain View, CAAI ResearchOn-site2+ YOEC++LLMs