Skip to content

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Develops next-gen Agent RL training platform for enterprise GenAI, integrating cutting-edge research to train state-of-the-art models for complex use cases. Requires 5+ years LLM production experience, RLHF expertise, recent top publications, and advanced CS degree.

218k – 273kSan Francisco, CANew York, NYSeattle, WAAI ResearchOnsite5+ YOE

About the role

Responsibilities

  • Train state-of-the-art models (internal and community-developed) for enterprise customers.
  • Research and integrate cutting-edge algorithms into the training stack.
  • Design solutions for complex multi-agent systems to learn from process and outcome-based rewards.

Requirements

  • 5+ years of LLM training in production environments.
  • Experience with post-training methods (RLHF/RLVR) and algorithms (PPO/GRPO).
  • Publications in top conferences (NeurIPS, ICLR, ICML) within last 2 years.
  • PhD or Master's in Computer Science or related field.

Skills

Llm TrainingRLHFRlvrPpoGrpoMulti-Agent SystemsReinforcement LearningPyTorchKubernetesMachine Learning

Similar roles

AI Research jobs

Staff AI Research Engineer

Staff-level AI Research Engineer building and deploying bandit and LLM models for monetization, balancing revenue and retention. Requires advanced ML degree or equivalent, applied large-model experience, and leadership skills.

221k – 331kNew York, NYAI ResearchOn-site7+ YOELLMsFine-Tuning

Staff AI Research Engineer

Build and deploy AI systems (LLMs, bandits) for monetization, balancing revenue and retention. Requires advanced ML degree or equivalent, applied large-model experience, and leadership skills.

221k – 331kPittsburgh, PAAI ResearchOn-site7+ YOELLMsPython

Staff AI Research Scientist

Lead high-impact research on LLMs and agentic systems, driving post-training, reasoning, and evaluation to power enterprise AI deployments. Requires 7+ years ML research experience, PhD or equivalent, and strong publication record.

234k – 296kSan Francisco, CA +2AI ResearchHybrid7+ YOEJAXSft

Staff Research Scientist, Exotic AI

Build next-generation training infrastructure for physical AI models that perceive, reason, and act in structured environments. Lead development of representation models, latent world models, and policy optimization systems.

236k – 339kBellevue, WAAI ResearchOn-site8+ YOEJAXPyTorch

Staff Research Scientist, AI Agents & LLMs

Leads research in agentic AI and LLMs, developing models for enterprise reasoning, autonomous agents with tool use, and production systems. Requires PhD, expertise in LLM training/fine-tuning, agent systems, and technical leadership.

236k – 339kBellevue, WA +1AI ResearchHybridRlLLMs