# Research Scientist, Agent Robustness
**Company:** [Scale AI](https://hotfix.jobs/companies/scale-ai)
**Location:** San Francisco, CA, New York, NY
**Salary:** $197K-$247K
**Experience:** 3+ years
**Skills:** RLHF, Dpo, Grpo, Swe-Bench, Webarena, Osworld, Inspect, Red-Teaming, Prompt Injection, Adversarial Testing, Generative AI, Machine Learning, Agent Evaluation, Rl Techniques
**Posted:** 2026-03-20
> Research Scientist focuses on agent robustness, developing tests, exploits, and mitigations for safe AI agents. Requires 3+ years ML experience, RL techniques like RLHF/DPO, and published research in generative AI.
## Job Description
## Responsibilities
- Research the science of AI agent capabilities with a focus on safety, risk factors, and benchmarking methodologies.
- Design and build harnesses to test AI agents’ tendency to take harmful actions when pressured or tricked.
- Design and build exploits and mitigations for failure modes arising from agent affordances like coding, web browsing, and computer use.
- Characterize and design mitigations for failure modes or risks in systems with multiple interacting AI agents.

## Requirements
- Commitment to promoting safe, secure, and trustworthy AI deployments.
- Practical experience conducting technical research collaboratively, including building agent scaffolding, designing evaluation harnesses, and prototyping research ideas.
- Experience with post-training and RL techniques such as **RLHF**, **DPO**, **GRPO**.
- Track record of published research in machine learning, particularly generative AI.
- At least **3 years** of experience addressing sophisticated ML problems.
- Strong written and verbal communication skills.

## Nice to Have
- Hands-on experience with agent evaluation frameworks such as **SWE-bench**, **WebArena**, **OSWorld**, **Inspect**.
- Experience with red-teaming, prompt injection, or adversarial testing of AI systems.
**Apply:** https://hotfix.jobs/jobs/research-scientist-agent-robustness-at-scale-ai-6ab91c5e-6084-4fcb-8280-cc605af029d2
**Canonical:** https://hotfix.jobs/jobs/research-scientist-agent-robustness-at-scale-ai-6ab91c5e-6084-4fcb-8280-cc605af029d2