Skip to content

Research Engineer, Post-Training (All Industry Levels)

225k – 400kUnited StatesML EngineeringRemote
Summary

Develops alignment algorithms, data pipelines, and sampling methods to optimize post-training AI models for performance and efficiency. Requires PhD or equivalent, ML expertise including reinforcement learning and transformers, and production code experience.

About the role

Responsibilities

  • Develop alignment algorithms and loss functions to improve data sample efficiency.
  • Write data pipelines to process diverse web data into a format models can ingest.
  • Identify quality signals to understand our model’s performance in the real world.
  • Design sampling algorithms to improve serving efficiency of large generative models.

Requirements

  • At least PhD (or equivalent).
  • Write clear and clean production-facing and training code.
  • Experience working with GPUs (training, serving, debugging).
  • Experience with data pipelines and data infrastructure.
  • Strong understanding of modern machine learning techniques (reinforcement learning, transformers, etc).
  • Track-record of exceptional research or creative applied ML projects.

Nice to Have

  • Experience with product experimentation and A/B testing.
  • Experience training large models in a distributed setting.
  • Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).
  • Publications in relevant academic journals or conferences in the field of machine learning.
Skills
Reinforcement LearningTransformersPyTorchGPUsData PipelinesKubernetesDockerGoogle CloudAlignment AlgorithmsDistributed Training
Similar roles at this salary range
All ML Engineering jobs →
Airbnb

Staff Machine Learning Engineer

Build and deploy cutting-edge ML and Generative AI systems to transform Airbnb's customer support experience, focusing on LLM fine-tuning, RAG, and intelligent service automation.

212k – 260kSan Francisco, CAML EngineeringRemote9+ YOELLMRAG
Mixpanel

Senior Software Engineer, AI Platform

Senior Software Engineer building scalable AI infrastructure, agent orchestration frameworks, evaluation systems, and high-performance LLM serving at Mixpanel. Requires 5+ years experience and hands-on LLM/agent work.

226k – 306kSan Francisco, CAML EngineeringHybrid5+ YOELLMsMLOps
Twilio

Tech Lead, Applied Research

Tech Lead driving AI R&D and end-to-end delivery of production-ready prototypes using full-stack development, LLMs, and emerging technologies. Requires 10+ years experience and strong autonomy.

228k – 335kUnited StatesML EngineeringRemote10+ YOEGoSQL
Reddit

Senior Machine Learning Systems Engineer

Build large-scale ML experimentation and training orchestration platforms, including agentic AI execution systems, to accelerate Ads ML development at Reddit. Requires 5+ years infrastructure experience and 2+ years building production ML platforms.

217k – 303kUnited StatesML EngineeringRemote5+ YOERayArgo
Axion

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMCPobservability