Skip to content

Research Engineer / Research Scientist, Post-Training

Research and develop improvements to pre-trained models for deployment in ChatGPT and API using reinforcement learning and product-driven approaches. Requires strong ML engineering, research experience with novel models, and ability to debug large codebases.

295k – 555kSan Francisco, CAAI ResearchHybrid

About the role

In this role, you will:

  • Own and pursue a research agenda to improve model capability and performance.
  • Collaborate closely with the other research and product teams, allowing customers to optimize their own models.
  • Build robust evaluations for tracking modeling improvements.
  • Design, implement, test, and debug code across our research stack.

You might thrive in this role if you:

  • Have a deep understanding of machine learning and machine learning applications.
  • Have a working knowledge of relevant models, and building evaluations for model capability improvement.
  • Are comfortable diving into a large ML codebase to debug.
  • Thrive in a dynamic and technically complex environment.

Skills

Machine LearningReinforcement LearningPyTorchTensorFlowModel EvaluationMl EngineeringResearchPythonDeep LearningLLMs

Similar roles

AI Research jobs

Researcher, Misalignment Research

Designs worst-case demonstrations and adversarial evaluations to uncover AGI misalignment risks like deception and power-seeking. Builds automated stress-testing infrastructure and researches alignment failure modes to inform OpenAI's safety strategy. Requires 4+ years in AI red-teaming or adversarial ML.

295k – 445kSan Francisco, CAAI ResearchOn-site4+ YOELLMsAi Safety

Researcher, Loss of Control

Designs and implements mitigation stacks to prevent loss of control risks in frontier AI models, including prevention, monitoring, detection, and enforcement. Requires expertise in deep learning, transformers, PyTorch/TensorFlow, and AI safety research.

295k – 445kSan Francisco, CAAI ResearchOn-siteLLMsPyTorch

Researcher, Synthetic RL

Develops novel reinforcement learning techniques using synthetic environments and feedback to enhance large-scale AI models. Designs experiments, analyzes dynamics, and integrates research into production systems; requires strong RL/ML background and engineering skills.

295k – 445kSan Francisco, CAAI ResearchHybridPythonResearch

Researcher, Pretraining Safety

Develop techniques to predict and mitigate unsafe behaviors in early-stage base models, design safer pretraining architectures, and integrate safety signals throughout training. Collaborate across safety teams to build robust, scalable safety foundations grounded in real-world risks.

295k – 445kSan Francisco, CAAI ResearchOn-siteJAXLLMs

Research Engineer, Codex

Advances AI coding models through research, experimentation, and system optimization on the Codex team. Collaborates to improve code generation, reasoning, and performance for real-world deployment.

295k – 445kSan Francisco, CAAI ResearchHybridLLMsPython