# Member of Technical Staff - Research
**Company:** [Polymath](https://hotfix.jobs/companies/polymath)
**Location:** San Francisco, CA
**Salary:** $200K-$350K
**Skills:** Reinforcement Learning, AI Agents, Simulation Environments, Post-Training, Frontier Models, Long-Horizon Reasoning, Benchmarks, Python, Machine Learning, Research Publication
**Posted:** 2026-04-14
> Conducts applied research on long-horizon autonomous AI agents, focusing on evaluation, post-training, environment design, and benchmarks to improve frontier models. Builds simulations, runs experiments, ships production code, and publishes findings.
## Job Description
## Responsibilities
- Advance the frontier of autonomous agents through core research in long-horizon evaluation, agent post-training, and environment design.
- Understand where current models fail and how to improve them.
- Build benchmarks, create environments, write production code, and run rigorous experiments.
- Develop advanced environment simulation engines for training & evaluating autonomous AI agents.
- Investigate failure modes of frontier models.
- Create rigorous benchmarks for complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments.
- Post-training agents in complex simulation environments.
- Publish research.

## Requirements
- Strong engineering & research fundamentals and prolific user of AI tools.
- Experience post-training frontier models.
- Experience shipping reliable, production-quality code.
- Track record of publications.

## Perks
- Comprehensive health, dental, and vision insurance.
- 401(k).
- Unlimited PTO.
- Free meals with the team.
- Wellness stipend & learning stipend.
- Top of the line tech.
- Frequent team activities and outings.
**Apply:** https://hotfix.jobs/jobs/member-of-technical-staff-research-at-polymath-ea4a552a-3a64-4bc8-a541-0f50869befb5
**Canonical:** https://hotfix.jobs/jobs/member-of-technical-staff-research-at-polymath-ea4a552a-3a64-4bc8-a541-0f50869befb5