AI Researcher

Advances AI products through post-training SOTA LLMs using supervised and reinforcement learning techniques on rich query datasets. Owns data pipelines, training frameworks, and model integration while collaborating across teams. Requires 2-6+ years in large-scale LLMs and Python/PyTorch expertise; PhD preferred.

220k – 485kSan Francisco, CAPalo Alto, CAAI ResearchOnsite2+ YOE

Apply

About the role

Responsibilities

Research & Development

Post-train SOTA LLMs using the latest supervised and reinforcement learning techniques (SFT/DPO/GRPO)
Leverage our rich query/answer dataset to scale model performance across Sonar, Deep Research, Comet, and Search products
Stay current with the latest LLM research, especially in model training, optimization, and personalization techniques
Implement preference optimization and personalization capabilities to enhance user experience
Invent in-house improvements and optimizations to enhance SOTA models
Turn research ideas into algorithms and run experiments to launch new models

Infrastructure & Implementation

Own full-stack data, training, and evaluation pipelines required for model development
Build robust and effective training frameworks (on top of Megatron/PyTorch) for post-training LLMs
Implement necessary infrastructure and components to support cutting-edge model training at scale
Integrate models seamlessly into our product ecosystem

Collaboration

Work closely with engineering teams to integrate models into Perplexity's product suite
Collaborate across teams to ensure cohesive AI experiences throughout our platform
Partner with product teams to understand user needs and translate them into model improvements

Qualifications

Required

Proven experience with large-scale LLMs and Deep Learning systems
Strong programming skills in Python/PyTorch; versatility is a plus
Experience with post-training techniques and reinforcement learning
Self-starter with a willingness to take ownership of tasks
Passion for tackling challenging problems
Minimum 2-6 years of experience on relevant projects (depending on seniority level)

Nice-to-have

PhD in Machine Learning, AI, Systems, or related areas
Experience in post-training LLMs with SFT/DPO/GRPO
C++/CUDA programming skills
Experience building LLM training frameworks
Academic publications and research impact
Experience with agent systems and multi-step reasoning
Background in personalization and preference learning

Skills

PyTorchPythonLLMsReinforcement LearningSftDpoGrpoMegatronCUDAC++

Similar roles

AI Research jobs

Scale AI

Machine Learning Research Engineer, Agents - Enterprise GenAI

Develops and deploys state-of-the-art ML models and agents for enterprise GenAI using RL training and post-training algorithms. Requires 1-3 years LLM production experience, RLHF expertise, recent top publications, and advanced CS degree.

218k – 273kSan Francisco, CA +2AI ResearchOn-site1+ YOEPpoLLMs

Scale AI

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Develops and optimizes post-training algorithms for agent RL platforms, focusing on LLM training, inference frameworks, and multi-agent systems. Requires 1-3 years production LLM experience, expertise in PyTorch/CUDA, RLHF/PPO, and advanced degree.

218k – 273kSan Francisco, CA +2AI ResearchOn-site1+ YOEPpoCUDA

Polymath

AI Research Resident

AI Research Resident collaborates on research projects developing benchmarks and environments for long-horizon AI agents, identifying model failure modes, and training autonomous agents. Requires current MS/PhD enrollment, RL experience, systems engineering, and strong publications.

200k – 200kSan Francisco, CAAI ResearchRemoteEntry levelBenchmarksFrontier Models

Nuro

ML Research Scientist, Prediction & Smart Agents

Build state-of-the-art ML models to predict traffic behavior for autonomous driving, using generative sequence modeling and controllable agents for planning and simulation. Requires PhD preferred, 2+ years deploying ML systems, and expertise in PyTorch and robotics ML.

194k – 291kMountain View, CAAI ResearchOn-site2+ YOEC++Python

Nuro

Machine Learning Research Scientist: Generative Modeling for Planning

Develops state-of-the-art generative models like diffusion and flow-matching for autonomous planning in self-driving tech. Requires PhD or MSc with 2-3 years experience in generative modeling for robotics, strong Python/C++ skills, and top research publications.

160k – 241kMountain View, CAAI ResearchOn-site2+ YOEC++LLMs