Skip to content

AI Researcher

Advances AI products through post-training SOTA LLMs using supervised and reinforcement learning techniques on rich query datasets. Owns data pipelines, training frameworks, and model integration while collaborating across teams. Requires 2-6+ years in large-scale LLMs and Python/PyTorch expertise; PhD preferred.

220k – 485kSan Francisco, CAPalo Alto, CAAI ResearchOnsite2+ YOE

About the role

Responsibilities

Research & Development

  • Post-train SOTA LLMs using the latest supervised and reinforcement learning techniques (SFT/DPO/GRPO)
  • Leverage our rich query/answer dataset to scale model performance across Sonar, Deep Research, Comet, and Search products
  • Stay current with the latest LLM research, especially in model training, optimization, and personalization techniques
  • Implement preference optimization and personalization capabilities to enhance user experience
  • Invent in-house improvements and optimizations to enhance SOTA models
  • Turn research ideas into algorithms and run experiments to launch new models

Infrastructure & Implementation

  • Own full-stack data, training, and evaluation pipelines required for model development
  • Build robust and effective training frameworks (on top of Megatron/PyTorch) for post-training LLMs
  • Implement necessary infrastructure and components to support cutting-edge model training at scale
  • Integrate models seamlessly into our product ecosystem

Collaboration

  • Work closely with engineering teams to integrate models into Perplexity's product suite
  • Collaborate across teams to ensure cohesive AI experiences throughout our platform
  • Partner with product teams to understand user needs and translate them into model improvements

Qualifications

Required

  • Proven experience with large-scale LLMs and Deep Learning systems
  • Strong programming skills in Python/PyTorch; versatility is a plus
  • Experience with post-training techniques and reinforcement learning
  • Self-starter with a willingness to take ownership of tasks
  • Passion for tackling challenging problems
  • Minimum 2-6 years of experience on relevant projects (depending on seniority level)

Nice-to-have

  • PhD in Machine Learning, AI, Systems, or related areas
  • Experience in post-training LLMs with SFT/DPO/GRPO
  • C++/CUDA programming skills
  • Experience building LLM training frameworks
  • Academic publications and research impact
  • Experience with agent systems and multi-step reasoning
  • Background in personalization and preference learning

Skills

PyTorchPythonLLMsReinforcement LearningSftDpoGrpoMegatronCUDAC++

Similar roles

AI Research jobs

Machine Learning Research Engineer, Agents - Enterprise GenAI

Develops and deploys state-of-the-art ML models and agents for enterprise GenAI using RL training and post-training algorithms. Requires 1-3 years LLM production experience, RLHF expertise, recent top publications, and advanced CS degree.

218k – 273kSan Francisco, CA +2AI ResearchOn-site1+ YOEPpoLLMs

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Develops and optimizes post-training algorithms for agent RL platforms, focusing on LLM training, inference frameworks, and multi-agent systems. Requires 1-3 years production LLM experience, expertise in PyTorch/CUDA, RLHF/PPO, and advanced degree.

218k – 273kSan Francisco, CA +2AI ResearchOn-site1+ YOEPpoCUDA

AI Research Resident

AI Research Resident collaborates on research projects developing benchmarks and environments for long-horizon AI agents, identifying model failure modes, and training autonomous agents. Requires current MS/PhD enrollment, RL experience, systems engineering, and strong publications.

200k – 200kSan Francisco, CAAI ResearchRemoteEntry levelBenchmarksFrontier Models

ML Research Scientist, Prediction & Smart Agents

Build state-of-the-art ML models to predict traffic behavior for autonomous driving, using generative sequence modeling and controllable agents for planning and simulation. Requires PhD preferred, 2+ years deploying ML systems, and expertise in PyTorch and robotics ML.

194k – 291kMountain View, CAAI ResearchOn-site2+ YOEC++Python

Machine Learning Research Scientist: Generative Modeling for Planning

Develops state-of-the-art generative models like diffusion and flow-matching for autonomous planning in self-driving tech. Requires PhD or MSc with 2-3 years experience in generative modeling for robotics, strong Python/C++ skills, and top research publications.

160k – 241kMountain View, CAAI ResearchOn-site2+ YOEC++LLMs