# Researcher, Loss of Control
**Company:** [OpenAI](https://hotfix.jobs/companies/openai)
**Location:** San Francisco, CA
**Salary:** $295K-$445K
**Skills:** PyTorch, TensorFlow, Deep Learning, Transformer Models, LLMs, Supervised Fine-Tuning, Policy Optimization, Data Structures, Algorithms, Software Engineering
**Posted:** 2026-03-10
> Designs and implements mitigation stacks to prevent loss of control risks in frontier AI models, including prevention, monitoring, detection, and enforcement. Requires expertise in deep learning, transformers, PyTorch/TensorFlow, and AI safety research.
## Job Description
## In this role, you will:
- Design and implement mitigation components for loss of control risk—spanning prevention, monitoring, detection, containment, and enforcement—under the guidance of senior technical and risk leadership.
- Integrate safeguards across product and research surfaces in partnership with product, engineering, and research teams, helping ensure protections are consistent, low-latency, and resilient as usage and model autonomy increase.
- Evaluate technical trade-offs within the loss of control domain (coverage, robustness, latency, model utility, and operational complexity) and propose pragmatic, testable solutions.
- Collaborate closely with risk modeling, evaluations, and policy partners to align mitigation design with anticipated failure modes and high-severity threat scenarios, including deceptive alignment, hidden subgoals, reward hacking, and attempts to evade oversight.
- Execute rigorous testing and red-teaming workflows, helping stress-test the mitigation stack against increasingly capable and potentially subversive model behaviors—such as sandbagging, monitor evasion, exploit-seeking, unsafe tool use, or strategic deception—and iterate based on findings.

## You might thrive in this role if you:
- Have a passion for AI safety and are motivated to make cutting-edge AI models safer for real-world use.
- Bring demonstrated experience in deep learning and transformer models.
- Are proficient with frameworks such as **PyTorch** or **TensorFlow**.
- Possess a strong foundation in data structures, algorithms, and software engineering principles.
- Are familiar with methods for training and fine-tuning large language models, including distillation, supervised fine-tuning, and policy optimization.
- Excel at working collaboratively with cross-functional teams across research, policy, product, and engineering.
- Have significant experience designing and evaluating technical safeguards, control mechanisms, or monitoring systems for advanced AI behavior.
- (Nice to have) Bring background knowledge in alignment, control, interpretability, robustness, adversarial ML, or related fields.
**Apply:** https://hotfix.jobs/jobs/researcher-loss-of-control-at-openai-c0b2fb0d-2ac2-4d91-b893-9e38f4cfccc7
**Canonical:** https://hotfix.jobs/jobs/researcher-loss-of-control-at-openai-c0b2fb0d-2ac2-4d91-b893-9e38f4cfccc7