Skip to content

Researcher, Synthetic RL

Develops novel reinforcement learning techniques using synthetic environments and feedback to enhance large-scale AI models. Designs experiments, analyzes dynamics, and integrates research into production systems; requires strong RL/ML background and engineering skills.

295k – 445kSan Francisco, CAAI ResearchHybrid

About the role

In this role, you will:

  • Research and develop reinforcement learning algorithms
  • Design and run experiments to study training dynamics and model behavior at scale
  • Collaborate with engineers and researchers to integrate successful approaches into model training pipelines

You might thrive in this role if you:

  • Have a strong background in reinforcement learning, machine learning research, or related fields
  • Have strong engineering and statistical analysis skills
  • Enjoy exploring new problem spaces where data, objectives, and evaluation are imperfect or evolving
  • Are motivated by seeing research ideas influence real-world AI systems

Skills

Reinforcement LearningMachine LearningPythonStatistical AnalysisExperiment DesignSynthetic DataSelf-PlaySimulatorsAi Training PipelinesResearch

Similar roles

AI Research jobs

Researcher, Misalignment Research

Designs worst-case demonstrations and adversarial evaluations to uncover AGI misalignment risks like deception and power-seeking. Builds automated stress-testing infrastructure and researches alignment failure modes to inform OpenAI's safety strategy. Requires 4+ years in AI red-teaming or adversarial ML.

295k – 445kSan Francisco, CAAI ResearchOn-site4+ YOELLMsAi Safety

Researcher, Loss of Control

Designs and implements mitigation stacks to prevent loss of control risks in frontier AI models, including prevention, monitoring, detection, and enforcement. Requires expertise in deep learning, transformers, PyTorch/TensorFlow, and AI safety research.

295k – 445kSan Francisco, CAAI ResearchOn-siteLLMsPyTorch

Research Engineer / Research Scientist, Post-Training

Research and develop improvements to pre-trained models for deployment in ChatGPT and API using reinforcement learning and product-driven approaches. Requires strong ML engineering, research experience with novel models, and ability to debug large codebases.

295k – 555kSan Francisco, CAAI ResearchHybridLLMsPython

Researcher, Pretraining Safety

Develop techniques to predict and mitigate unsafe behaviors in early-stage base models, design safer pretraining architectures, and integrate safety signals throughout training. Collaborate across safety teams to build robust, scalable safety foundations grounded in real-world risks.

295k – 445kSan Francisco, CAAI ResearchOn-siteJAXLLMs

Research Engineer, Codex

Advances AI coding models through research, experimentation, and system optimization on the Codex team. Collaborates to improve code generation, reasoning, and performance for real-world deployment.

295k – 445kSan Francisco, CAAI ResearchHybridLLMsPython