Applied Research Engineer

250k – 300kSan Francisco, CAHybrid3+ YOESep 17

Summary

Develops advanced systems for human-in-the-loop AI data alignment using RLHF/DPO, improves data quality, and builds AI-assisted labeling tools. Requires Master's/PhD, 3+ years ML experience, Python/PyTorch proficiency, and top-tier publications.

About the role

Responsibilities

Advance AI alignment by developing methods like RLHF and novel approaches to ensure AI systems reflect human preferences.
Improve human-in-the-loop data quality through measurement and enhancement systems.
Create AI-assisted data labeling tools using active learning and adaptive sampling.
Investigate impacts of human feedback types (demonstrations, preferences, critiques) on model performance.
Optimize human feedback collection with novel algorithms.
Integrate research breakthroughs into Labelbox’s product suite.
Engage with customers and AI community, publish in top conferences, and create technical content.

Requirements

Ph.D. or Master’s in Computer Science, Machine Learning, AI, or related field.
3+ years experience solving complex ML challenges with real-world impact.
Expertise in data quality measurement and refinement systems.
Deep understanding of frontier AI models (LLMs, multimodal) and human data strategies.
Proficiency in Python and deep learning frameworks (PyTorch, JAX, TensorFlow).
Track record of publishing in top AI/ML conferences (NeurIPS, ICML, ICLR, etc.).
Ability to bridge research to prototypes, strong analytical/problem-solving skills.
Exceptional communication and collaboration skills.

Compensation

Annual base salary range: $250,000—$300,000 USD (varies by skills, experience, location; excludes equity/benefits).

Skills

PythonPyTorchJAXTensorFlowRLHFDPOMachine LearningDeep LearningActive LearningLarge Language Models

Similar roles at this salary range

All AI Research jobs →

Upstart

Jun 3

Principal Applied Scientist

As a Principal Applied Scientist, you will define the technical direction for offer optimization and conversion modeling systems, working across teams to integrate models and optimization systems. This role involves structuring ambiguous problems, designing solutions, and providing technical oversight to ensure a coherent long-term vision.

220k – 330kUnited StatesAI ResearchRemoteFintechStatistics

Anthropic

Jun 2

Research Scientist, Life Sciences

Anthropic is seeking a Research Scientist to join their Life Sciences team. This role involves building and shipping agentic tools, designing evaluation benchmarks, and partnering with external users to improve model capabilities on scientific tasks.

300k – 320kSan Francisco, CAAI ResearchHybridLLMsRLHF

Luma AI

Jun 1

Simulation Researcher/Engineer

As a Simulation Researcher/Engineer, you will design and build simulation environments for training general-purpose robot policies. This role involves working with generative models and classical physics simulation, developing differentiable pipelines, and driving asset generation.

250k – 450kLos Angeles, CA +2AI ResearchHybridC++PhysX

Luma AI

Jun 1

Research Scientist - World Model

As a Research Scientist on the World Models team, you will invent next-generation world model architectures with a focus on controllability and physical consistency, develop controllability mechanisms, and define and own metrics for physical fidelity and action-following.

250k – 450kLos Angeles, CA +2AI ResearchHybridPyTorchRobotics

Airbnb

Jun 1

Principle Engineer -In Bayesian, Large Foundational Systems, and Distributional Reinforcement Learning

Lead advanced research and development of cutting-edge AI models with deep expertise in Bayesian Learning and Distributional Reinforcement Learning. This role involves architecting and integrating foundational Bayesian frameworks with advanced architectures and large language models to redefine personalization and decision-making.

296k – 370kUnited StatesAI ResearchRemoteC++Java

Apply