Staff AI Research Scientist

234k – 296kSan Francisco, CANew York, NYSeattle, WAAI ResearchHybrid7+ YOEJun 17

Summary

Lead high-impact research on LLMs and agentic systems, driving post-training, reasoning, and evaluation to power enterprise AI deployments. Requires 7+ years ML research experience, PhD or equivalent, and strong publication record.

About the role

What you'll do

Lead an independent, high-impact research agenda on large language models and agentic systems, owning projects from early hypothesis through model training, evaluation, and production deployment
Design and execute large-scale post-training experiments using supervised fine-tuning, reinforcement learning from human feedback (RLHF), RLAIF, DPO, and emerging alignment techniques — with a focus on improving multi-step reasoning, planning, and tool use in enterprise agentic workflows
Build novel evaluation benchmarks and methodologies that push beyond existing limitations, establishing rigorous measures for how well models perform on complex, real-world enterprise tasks
Develop scalable data synthesis and curation pipelines that generate the high-quality training signal driving model improvement — including LLM-as-judge frameworks, synthetic data generation, and adversarial dataset construction
Shape WRITER's model architecture and training roadmap by translating your research insights into concrete improvements to our enterprise-grade LLMs, working hand-in-hand with research engineering and product teams
Publish and present original research at top-tier venues — NeurIPS, ICLR, ICML, ACL, and others — representing WRITER at the frontier of the field and contributing to the broader scientific community
Mentor and uplevel fellow researchers and engineers on the team, helping set a high bar for scientific rigor, experimental design, and research culture

What you need

7+ years of hands-on ML research experience, with deep expertise in large language model pre-training and post-training — you've trained models at scale, debugged distributed jobs, and shipped improvements that made a measurable difference
Expert-level knowledge of post-training methods including SFT, RLHF, RLAIF, DPO, GRPO, and related alignment and reasoning techniques, with a track record of applying them to real, production-grade systems
Strong command of Python and PyTorch (or JAX), with the engineering depth to build and scale training pipelines, evaluation infrastructure, and data synthesis workflows yourself — not just direct others to do it
A meaningful publication record at competitive ML/AI venues (NeurIPS, ICLR, ICML, ACL, EMNLP, or equivalent), evidencing your ability to originate ideas and execute on a multi-month research agenda independently
Hands-on experience designing or evaluating agentic systems — models that plan, reason through multi-step tasks, use tools, and recover gracefully from errors — with a nuanced understanding of where they break and how to fix them
A Ph.D. in Computer Science, Machine Learning, NLP, or a related field — or equivalent demonstrated research experience with a strong portfolio of independent, published work
The instincts and orientation that match WRITER's values: you Connect — you collaborate openly across research, engineering, and product and communicate complex ideas with clarity to both technical and non-technical audiences; you Challenge — you ask the hard questions, push back on conventional wisdom, and pursue the research directions others haven't tried yet; you Own — you drive your projects end-to-end with urgency, take accountability for results, and care deeply about the impact your work has on real customers

Benefits & perks (US Full-time employees)

Generous PTO, plus company holidays
Medical, dental, and vision coverage for you and your family
Paid parental leave for all parents (16 weeks)
Fertility and family planning support
Early-detection cancer testing through Galleri
Flexible spending account and dependent FSA options
Health savings account for eligible plans with company contribution
Annual work-life stipends for: Wellness stipend for gym, massage/chiropractor, personal training, etc.; Learning and development stipend
Company-wide off-sites and team off-sites
Competitive compensation, company stock options and 401k

Skills

PythonPyTorchJAXLLM pre-trainingLLM post-trainingSFTRLHFRLAIFDPOGRPOAgentic systemsMulti-step reasoningTool useEvaluation benchmarksData synthesis pipelines

Similar roles at this salary range

All AI Research jobs →

Snowflake

Jun 6

Staff Research Scientist, Exotic AI

Build next-generation training infrastructure for physical AI models that perceive, reason, and act in structured environments. Lead development of representation models, latent world models, and policy optimization systems.

236k – 339kBellevue, WAAI ResearchOn-site8+ YOEJAXPyTorch

Upstart

Jun 3

Principal Applied Scientist

As a Principal Applied Scientist, you will define the technical direction for offer optimization and conversion modeling systems, working across teams to integrate models and optimization systems. This role involves structuring ambiguous problems, designing solutions, and providing technical oversight to ensure a coherent long-term vision.

220k – 330kUnited StatesAI ResearchRemote8+ YOEFintechStatistics

Luma AI

Jun 1

Simulation Researcher/Engineer

As a Simulation Researcher/Engineer, you will design and build simulation environments for training general-purpose robot policies. This role involves working with generative models and classical physics simulation, developing differentiable pipelines, and driving asset generation.

250k – 450kLos Angeles, CA +2AI ResearchHybridC++PhysX

Luma AI

Jun 1

Research Scientist - World Model

As a Research Scientist on the World Models team, you will invent next-generation world model architectures with a focus on controllability and physical consistency, develop controllability mechanisms, and define and own metrics for physical fidelity and action-following.

250k – 450kLos Angeles, CA +2AI ResearchHybridPyTorchRobotics

The Voleon Group

May 29

Member of Research Staff, Reinforcement Learning

Voleon Securities is seeking an experienced and creative reinforcement learning researcher to join their ML research group. This role focuses on applying state-of-the-art AI/ML techniques to construct market-making strategies and optimize trading models.

250k – 275kBerkeley, CA +1AI ResearchHybridGPUPython

Apply