# AI Research Resident
**Company:** [Polymath](https://hotfix.jobs/companies/polymath)
**Location:** Remote
**Salary:** $200K-$200K
**Experience:** 0+ years
**Skills:** Reinforcement Learning, Benchmarks, Frontier Models, Model Post-Training, Systems Engineering, Production-Quality Code
**Posted:** 2026-04-14
> AI Research Resident collaborates on research projects developing benchmarks and environments for long-horizon AI agents, identifying model failure modes, and training autonomous agents. Requires current MS/PhD enrollment, RL experience, systems engineering, and strong publications.
## Job Description
## Responsibilities
- Identify failure modes in frontier models.
- Develop rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments.
- Train autonomous agents that can reason, plan, and act over extended time horizons.

## Requirements
- Currently pursuing an MS or PhD program in Computer Science or a related field.
- Experience with reinforcement learning, benchmarking frontier models, or model post-training.
- Experience with systems engineering and ability to write production-quality code.
- Strong track record of publications.
- High agency, move quickly, and enjoy working on open-ended research problems.

## Compensation
- $200k / year prorated to the number of hours committed (full-time or part-time).
**Apply:** https://hotfix.jobs/jobs/ai-research-resident-at-polymath-f50e3e2f-5048-4fcf-b99a-3f98e6bb6efd
**Canonical:** https://hotfix.jobs/jobs/ai-research-resident-at-polymath-f50e3e2f-5048-4fcf-b99a-3f98e6bb6efd