Skip to content

Research Scientist, Post-Training

180k – 300kRedwood City, CAOnsite3+ YOE
Summary

Leads research on post-training data curation for foundation models, designing algorithms to generate/improve instruction and preference datasets, and unifying pre/post-training optimization. Requires 3+ years deep learning research, post-training experience with vision/language/multimodal models, and PyTorch proficiency.

About the role

What You'll Work On

  • Post-training data curation: conduct research on algorithmically curating post-training data (e.g., generating/refining preference and instruction-following data, curating capability/domain-specific data, making post-training more effective/controllable/generalizable).
  • Unifying pre-training and post-training data curation: pursue research on end-to-end data curation (curate pre-training data to improve post-trainability, jointly optimize pre/post-training data to maximize final model performance).
  • Transform messy literature into practical improvements: source, vet, implement, and improve promising ideas from literature or your own creation.
  • Conduct science driven by real-world needs: guided by customer needs and product improvements.

About You

Required:

  • 3+ years of deep learning research experience
  • Experience with post-training large vision, language, and multimodal models
  • Post-training algorithm development, data curation, and/or synthetic data methods for:
    • Preference-based tuning (e.g. DPO, RLVR, RRHF)
    • Alternative supervision & self-supervision techniques (e.g. self-training, chain-of-thought distillation)
    • SFT (e.g. instruction tuning, demonstration fine-tuning)
  • Post-training tooling development and engineering experience
  • Strong understanding of deep learning fundamentals
  • Software engineering + deep learning framework (PyTorch or willingness to learn) skills for large-scale experiments and production prototypes
  • Track record of success in deep learning research (papers, tools, artifacts)

Nice-to-haves:

  • Experience with data management and distributed data processing (Spark, Snowflake, etc.)
  • Experience building + shipping ML products

Compensation

  • Base salary: $180,000 - $300,000
  • Significant equity
  • 100% covered health benefits (medical, vision, dental)
  • 401(k) with 4% company match
  • Unlimited PTO
  • Annual $2,000 wellness stipend
  • Annual $1,000 learning stipend
  • Daily lunches/snacks
  • Relocation assistance to Bay Area
Skills
PyTorchDPORLVRRRHFSFTinstruction tuningpreference tuningsynthetic datadata curationmultimodal models
Similar roles at this salary range
All AI Research jobs →
Snowflake

AI Research Scientist, New Grad – Agents & Reinforcement Learning

Conduct research on autonomous AI agents and reinforcement learning to build self-improving systems that reason, code, and learn at scale within the Snowflake Data Cloud. Requires a PhD (or equivalent) and strong expertise in RL and agentic AI.

176k – 230kBellevue, WAAI ResearchOn-siteEntry levelJAXDPO
Snowflake

Post-Doctoral Researcher

Post-doctoral researcher conducting independent and collaborative AI/ML research focused on high-impact domains like medicine, finance, and law. Requires a recent or imminent PhD and publications in top venues.

160k – 220kBellevue, WAAI ResearchHybridEntry levelJAXRAG
SpotOn

Senior Software Engineer - Python/Typescript

Senior engineer building AI-driven automation systems to replace manual business workflows across operations, sales, and support. Requires 7+ years experience, production Python/TypeScript skills, and 1-2 years building agentic AI systems.

160k – 190kChicago, IL +3AI ResearchHybrid7+ YOEAWSLLMs
Datology AI

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models
Pindrop

Senior Research Scientist

As a Senior Research Scientist on the Video team, you will drive research initiatives and translate advanced computer vision and deepfake detection models into scalable enterprise solutions. You will focus on audio-visual deepfake detection, synthetic media identification, and real-time video processing.

185k – 215kUnited StatesAI ResearchRemote5+ YOEKerasPython