Applied AI Researcher, Post-Training

150k – 250kSan Francisco, CANew York, NYHybridOct 16

Summary

Develops and evaluates post-training techniques like supervised fine-tuning, RLHF/DPO, and continual adaptation to align foundation models with enterprise systems. Requires expertise in adapting LLMs/SLMs, compound AI systems, and strong prototyping skills.

About the role

Key Responsibilities

Adapt foundation models to real-world performance and alignment requirements using supervised fine-tuning, preference optimization (DPO, RLHF, RLAIF), and continual adaptation.
Develop and evaluate techniques to align models with enterprise systems.
Investigate methods for aligning large models with human and system-level objectives.
Explore trade-offs between generalization and specialization, data efficiency and robustness, capability and controllability.

Requirements

Deep understanding of post-training techniques: supervised fine-tuning, preference optimization (RLHF/DPO), LoRA/PEFT, instruction-tuning pipelines.
Experience adapting frontier models (LLMs/SLMs) to specialized domains via data curation, reward modeling, or continual pretraining.
Expertise in compound AI systems, agentic collaboration (ensembling, ReAct, graph-of-thoughts).
Proven research track record (publications, public work).
Daily use of AI tools (ChatGPT, Cursor, Perplexity).
Strong programming and data analysis skills for prototyping and experiments.

What We Offer

Base salary: $150K–$250K (depending on experience, location, level).
Equity, comprehensive benefits: 100% covered medical/dental/vision, 401(k), commuter benefits, in-office lunch.
Access to state-of-the-art models and AI tools.

Skills

RLHFDPORLAIFLoRAPEFTSupervised Fine-TuningInstruction TuningLLMsSLMsReActGraph-of-ThoughtsData CurationReward Modeling

Similar roles at this salary range

All AI Research jobs →

Snowflake

Jun 16

AI Research Scientist, New Grad – Agents & Reinforcement Learning

Conduct research on autonomous AI agents and reinforcement learning to build self-improving systems that reason, code, and learn at scale within the Snowflake Data Cloud. Requires a PhD (or equivalent) and strong expertise in RL and agentic AI.

176k – 230kBellevue, WAAI ResearchOn-siteEntry levelJAXDPO

Together AI

Jun 12

Frontier Agents Intern

Research intern on the Agents team building and aligning frontier AI systems for complex agentic and scientific tasks. Focus on post-training methods, evaluation frameworks, self-learning, and scalable agent infrastructure.

121k – 131kSan Francisco, CAAI ResearchOn-siteEntry levelJAXNLP

Snowflake

Jun 11

Post-Doctoral Researcher

Post-doctoral researcher conducting independent and collaborative AI/ML research focused on high-impact domains like medicine, finance, and law. Requires a recent or imminent PhD and publications in top venues.

160k – 220kBellevue, WAAI ResearchHybridEntry levelJAXRAG

SpotOn

Jun 8

Senior Software Engineer - Python/Typescript

Senior engineer building AI-driven automation systems to replace manual business workflows across operations, sales, and support. Requires 7+ years experience, production Python/TypeScript skills, and 1-2 years building agentic AI systems.

160k – 190kChicago, IL +3AI ResearchHybrid7+ YOEAWSLLMs

Datology AI

Jun 4

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models

Apply