Skip to content

Research Scientist, AI Controls and Monitoring

Designs methods, systems, and experiments for AI controls and monitoring to ensure alignment in high-stakes environments, including real-time tracking, fail-safes, and red-team simulations. Requires 3+ years ML experience, published research in generative AI, and strong prototyping skills.

197k – 247kSan Francisco, CANew York, NYAI ResearchHybrid3+ YOE

About the role

Responsibilities

  • Develop monitoring techniques and observability methods that track AI behavior in real time to identify and flag deviations, emergent capabilities, or anomalous outputs.
  • Research mechanisms for layered control, including fail-safes, oversight protocols, and intervention methods that can halt or redirect AI systems when risks are detected.
  • Design red-team simulations to probe weaknesses in oversight and control mechanisms, and build mitigations to close identified gaps.
  • Collaborate with policymakers, engineers, and other researchers to establish standards and benchmarks for AI monitoring and escalation.

Requirements

  • Commitment to promoting safe, secure, and trustworthy AI deployments.
  • Practical experience conducting technical research collaboratively, designing control and monitoring experiments for AI systems, building prototype systems, and turning research ideas into working prototypes.
  • Track record of published research in machine learning, particularly in generative AI.
  • At least three years of experience addressing sophisticated ML problems in research or product development.
  • Strong written and verbal communication skills for cross-functional teams.

Nice to Have

  • Experience with runtime monitoring, anomaly detection, or observability for ML systems.
  • Familiarity with AI control or alignment research (e.g., scalable oversight, interpretability, debate).
  • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches.

Compensation

Base salary range: $197,400 - $246,750 USD (San Francisco, New York, Seattle), plus equity and benefits including health coverage, retirement, learning stipend, PTO, and commuter stipend.

Skills

Machine LearningGenerative AIRLHFDpoAnomaly DetectionRuntime MonitoringScalable OversightInterpretabilityRed-TeamingAi Alignment

Similar roles

AI Research jobs

Research Scientist, Frontier Risk Evaluations

Designs evaluation measures, harnesses, and datasets to assess risks from frontier AI systems, including dangerous capabilities testing. Collaborates with agencies, publishes methodologies for policymakers; requires 3+ years ML experience and publications in generative AI.

197k – 247kSan Francisco, CA +2AI ResearchOn-site3+ YOELLMsAi Safety

Research Scientist, Agent Robustness

Research Scientist focuses on agent robustness, developing tests, exploits, and mitigations for safe AI agents. Requires 3+ years ML experience, RL techniques like RLHF/DPO, and published research in generative AI.

197k – 247kSan Francisco, CA +1AI ResearchHybrid3+ YOEDpoRLHF

Lead Quantum Device Theorist

Leads theoretical modeling of superconducting quantum processors, focusing on noise sources, gate operations, and error correction to enhance qubit performance. Requires PhD in Physics or related field with 5+ years experience in circuit QED and quantum simulations.

195k – 225kBerkeley, CA +1AI ResearchOn-site5+ YOEStimQutip

Research Scientist

Leads original research in action-conditioned world models, physical AI, and generative modeling for embodied systems. Requires PhD in ML/CS/Robotics with top publications and expertise in generative models and large-scale training.

200k – 325kSan Francisco, CAAI ResearchOn-siteDpoRLHF

AI Researcher, Core ML (Turbo)

Develops efficient inference engines and RL/post-training pipelines for production-scale LLMs, optimizing algorithms, systems, and performance across the stack. Requires 3+ years in ML systems/RL/inference and advanced degree.

200k – 280kSan Francisco, CAAI ResearchOn-site3+ YOEDpovLLM