AI Research Scientist, New Grad – Agents & Reinforcement Learning

176k – 230kBellevue, WAOnsiteEntry levelJun 16

Summary

Conduct research on autonomous AI agents and reinforcement learning to build self-improving systems that reason, code, and learn at scale within the Snowflake Data Cloud. Requires a PhD (or equivalent) and strong expertise in RL and agentic AI.

About the role

Responsibilities

Design and develop agentic frameworks powered by recursive self-improvement loops, enabling AI systems that iteratively refine their own capabilities and strategies
Build and evaluate auto research agents — systems capable of autonomously formulating hypotheses, executing experiments, and synthesizing findings
Develop coding agents that understand, generate, and debug code across complex, multi-step programming tasks
Conduct research in reinforcement learning with a focus on RLHF, DPO, and PPO as mechanisms for aligning and improving agentic behaviors
Contribute to multi-agent systems where specialized agents collaborate, negotiate, and self-organize to solve enterprise-scale problems
Develop and curate training data pipelines — both synthetic and human-annotated — to support novel agentic and RL research domains
Publish research findings at top-tier venues such as NeurIPS, ICML, ICLR, and ACL

Requirements

PhD in Computer Science, Machine Learning, Artificial Intelligence, or a closely related field (completing or recently completed; or equivalent research experience)
Foundational expertise in reinforcement learning algorithms, including RLHF, DPO, PPO, or multi-agent systems
Research experience in LLM post-training, fine-tuning, or reasoning model development
Demonstrated ability to implement and experiment with agentic architectures — including tool-use, planning, and self-correction loops
Proficiency in Python and at least one deep learning framework (PyTorch or JAX strongly preferred)
Strong mathematical and analytical foundation — comfortable working at the intersection of theory and empirical research
At least one first-author or co-authored publication or preprint in a relevant AI/ML area

Nice-to-Haves

Hands-on experience building or evaluating coding agents or auto research agents
Familiarity with recursive self-improvement frameworks or automated AI scientist paradigms
Experience with large-scale distributed training or efficient training paradigms
Background in mathematical reasoning, structured decision-making, or program synthesis
Exposure to domain-specific AI applications in healthcare, finance, or enterprise workflows

Skills

PythonPyTorchJAXReinforcement LearningRLHFDPOPPOMulti-Agent SystemsLLM Post-TrainingAgentic Architectures

Similar roles at this salary range

All AI Research jobs →

Snowflake

Jun 11

Post-Doctoral Researcher

Post-doctoral researcher conducting independent and collaborative AI/ML research focused on high-impact domains like medicine, finance, and law. Requires a recent or imminent PhD and publications in top venues.

160k – 220kBellevue, WAAI ResearchHybridEntry levelJAXRAG

SpotOn

Jun 8

Senior Software Engineer - Python/Typescript

Senior engineer building AI-driven automation systems to replace manual business workflows across operations, sales, and support. Requires 7+ years experience, production Python/TypeScript skills, and 1-2 years building agentic AI systems.

160k – 190kChicago, IL +3AI ResearchHybrid7+ YOEAWSLLMs

Datology AI

Jun 4

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models

Pindrop

Jun 3

Senior Research Scientist

As a Senior Research Scientist on the Video team, you will drive research initiatives and translate advanced computer vision and deepfake detection models into scalable enterprise solutions. You will focus on audio-visual deepfake detection, synthetic media identification, and real-time video processing.

185k – 215kUnited StatesAI ResearchRemote5+ YOEKerasPython

Pindrop

Jun 1

Research Scientist II

As a Research Scientist II on the Video team, you will drive core research initiatives, deliver reproducible experimental results, and help translate machine learning models into real-world product solutions, focusing on real-time video processing and deepfake detection.

160k – 185kUnited StatesAI ResearchRemote3+ YOEGoC++

Apply