Skip to content

AI Research Scientist, New Grad – Agents & Reinforcement Learning

176k – 230kBellevue, WAOnsiteEntry level
Summary

Conduct research on autonomous AI agents and reinforcement learning to build self-improving systems that reason, code, and learn at scale within the Snowflake Data Cloud. Requires a PhD (or equivalent) and strong expertise in RL and agentic AI.

About the role

Responsibilities

  • Design and develop agentic frameworks powered by recursive self-improvement loops, enabling AI systems that iteratively refine their own capabilities and strategies
  • Build and evaluate auto research agents — systems capable of autonomously formulating hypotheses, executing experiments, and synthesizing findings
  • Develop coding agents that understand, generate, and debug code across complex, multi-step programming tasks
  • Conduct research in reinforcement learning with a focus on RLHF, DPO, and PPO as mechanisms for aligning and improving agentic behaviors
  • Contribute to multi-agent systems where specialized agents collaborate, negotiate, and self-organize to solve enterprise-scale problems
  • Develop and curate training data pipelines — both synthetic and human-annotated — to support novel agentic and RL research domains
  • Publish research findings at top-tier venues such as NeurIPS, ICML, ICLR, and ACL

Requirements

  • PhD in Computer Science, Machine Learning, Artificial Intelligence, or a closely related field (completing or recently completed; or equivalent research experience)
  • Foundational expertise in reinforcement learning algorithms, including RLHF, DPO, PPO, or multi-agent systems
  • Research experience in LLM post-training, fine-tuning, or reasoning model development
  • Demonstrated ability to implement and experiment with agentic architectures — including tool-use, planning, and self-correction loops
  • Proficiency in Python and at least one deep learning framework (PyTorch or JAX strongly preferred)
  • Strong mathematical and analytical foundation — comfortable working at the intersection of theory and empirical research
  • At least one first-author or co-authored publication or preprint in a relevant AI/ML area

Nice-to-Haves

  • Hands-on experience building or evaluating coding agents or auto research agents
  • Familiarity with recursive self-improvement frameworks or automated AI scientist paradigms
  • Experience with large-scale distributed training or efficient training paradigms
  • Background in mathematical reasoning, structured decision-making, or program synthesis
  • Exposure to domain-specific AI applications in healthcare, finance, or enterprise workflows
Skills
PythonPyTorchJAXReinforcement LearningRLHFDPOPPOMulti-Agent SystemsLLM Post-TrainingAgentic Architectures
Similar roles at this salary range
All AI Research jobs →
Snowflake

Post-Doctoral Researcher

Post-doctoral researcher conducting independent and collaborative AI/ML research focused on high-impact domains like medicine, finance, and law. Requires a recent or imminent PhD and publications in top venues.

160k – 220kBellevue, WAAI ResearchHybridEntry levelJAXRAG
SpotOn

Senior Software Engineer - Python/Typescript

Senior engineer building AI-driven automation systems to replace manual business workflows across operations, sales, and support. Requires 7+ years experience, production Python/TypeScript skills, and 1-2 years building agentic AI systems.

160k – 190kChicago, IL +3AI ResearchHybrid7+ YOEAWSLLMs
Datology AI

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models
Pindrop

Senior Research Scientist

As a Senior Research Scientist on the Video team, you will drive research initiatives and translate advanced computer vision and deepfake detection models into scalable enterprise solutions. You will focus on audio-visual deepfake detection, synthetic media identification, and real-time video processing.

185k – 215kUnited StatesAI ResearchRemote5+ YOEKerasPython
Pindrop

Research Scientist II

As a Research Scientist II on the Video team, you will drive core research initiatives, deliver reproducible experimental results, and help translate machine learning models into real-world product solutions, focusing on real-time video processing and deepfake detection.

160k – 185kUnited StatesAI ResearchRemote3+ YOEGoC++