Agent Post-Training, Frontier Evals and Environments Research
Researcher building frontier RL environments, evaluations, and training signals to steer OpenAI's largest agent training runs and measure model capabilities.
Builds large-scale infrastructure for AI scientist training, evaluation, and deployment, resolving bottlenecks in distributed systems for scientific AGI. Requires 6+ years in infrastructure engineering with expertise in ML stacks, containers, and data pipelines.
Annual Salary: $350,000 — $850,000 USD
Education requirements: At least a Bachelor's degree in a related field or equivalent experience.
Researcher building frontier RL environments, evaluations, and training signals to steer OpenAI's largest agent training runs and measure model capabilities.
Work as a fullstack applied researcher adapting multimodal video foundation models for production. Focus on controllability, personalization, and end-user quality using SFT, RL, and data-driven refinement.
Lead development of models and algorithms for real-time voice agents, advancing speech understanding, naturalness, and production deployment in conversational AI. Requires 5+ years in AI/ML with experience deploying LLMs.
Leads end-to-end research initiatives in machine learning and large language models for conversational AI in housing and healthcare. Requires PhD plus 5+ years post-PhD experience, strong ML expertise, and Python proficiency.