# Machine Learning Research Engineer, Agents - Enterprise GenAI
**Company:** [Scale AI](https://hotfix.jobs/companies/scale-ai)
**Location:** San Francisco, CA, New York, NY, Seattle, WA
**Salary:** $218K-$273K
**Experience:** 1+ years
**Skills:** LLMs, RLHF, Rlvr, Ppo, Grpo, Reinforcement Learning, Multi-Agent Systems, Agent Rl Training, Post-Training Algorithms
**Posted:** 2026-02-12
> Develops and deploys state-of-the-art ML models and agents for enterprise GenAI using RL training and post-training algorithms. Requires 1-3 years LLM production experience, RLHF expertise, recent top publications, and advanced CS degree.
## Job Description
## Responsibilities
- Train state-of-the-art models (internal and community-developed) for enterprise deployment.
- Research and integrate cutting-edge algorithms into the training stack.
- Build agents using proprietary algorithms to optimize datasets, including tools, multi-agent systems, and complex rewards.

## Requirements
- 1-3 years building LLMs in production environments.
- Experience with post-training methods (RLHF/RLVR, PPO/GRPO).
- Publications in top conferences (NeurIPS, ICLR, ICML) within last 2 years.
- PhD or Master's in Computer Science or related field.
**Apply:** https://hotfix.jobs/jobs/machine-learning-research-engineer-agents-enterprise-genai-at-scale-ai-86aa8353-26ac-4335-b8ac-c8772791d98e
**Canonical:** https://hotfix.jobs/jobs/machine-learning-research-engineer-agents-enterprise-genai-at-scale-ai-86aa8353-26ac-4335-b8ac-c8772791d98e