Research Engineer, AI Safety & Alignment

225k – 400kRedwood City, CAAI ResearchOnsiteOct 3

Summary

Develops evaluation methods, alignment techniques, and adversarial testing for large language models to ensure safety and alignment with human values. Requires PhD in ML/CS, production code skills, GPU experience, and transformers/RL expertise.

About the role

Responsibilities

Develop and implement novel evaluation methodologies and metrics to assess the safety and alignment of large language models.
Research and develop cutting-edge techniques for model alignment, value learning, and interpretability.
Conduct adversarial testing to proactively uncover potential vulnerabilities and failure modes in our models.
Analyze and mitigate biases, toxicity, and other harmful behaviors in large language models through techniques like reinforcement learning from human feedback (RLHF) and fine-tuning.
Collaborate with engineering and product teams to translate safety research into practical, scalable solutions and best practices.
Stay abreast of the latest advancements in AI safety research and contribute to the academic community through publications and presentations.

Requirements

Hold a PhD (or equivalent experience) in a relevant field such as Computer Science, Machine Learning, or a related discipline.
Write clear and clean production-facing and training code.
Experience working with GPUs (training, serving, debugging).
Experience with data pipelines and data infrastructure.
Strong understanding of modern machine learning techniques, particularly transformers and reinforcement learning, with a focus on their safety implications.
Passionate about the responsible development of AI and dedicated to solving complex safety challenges.

Nice to Have

Experience with product experimentation and A/B testing.
Experience training large models in a distributed setting.
Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).
Experience with explainable AI (XAI) and interpretability techniques.
Research in AI safety, alignment, ethics, or a related area.
Knowledge of the broader societal and ethical implications of AI, including policy and governance.
Publications in relevant academic journals or conferences in the field of machine learning.

Skills

PyTorchTransformersReinforcement LearningRLHFGPUsData PipelinesInterpretabilityKubernetesDockerExplainable AI

Similar roles at this salary range

All AI Research jobs →

Luma AI

Jun 22

Applied Research Scientist / Engineer

Work as a fullstack applied researcher adapting multimodal video foundation models for production. Focus on controllability, personalization, and end-user quality using SFT, RL, and data-driven refinement.

200k – 450kNew York, NY +1AI ResearchHybrid7+ YOERLSFT

Writer

Jun 17

Staff AI Research Scientist

Lead high-impact research on LLMs and agentic systems, driving post-training, reasoning, and evaluation to power enterprise AI deployments. Requires 7+ years ML research experience, PhD or equivalent, and strong publication record.

234k – 296kSan Francisco, CA +2AI ResearchHybrid7+ YOEJAXSFT

Snowflake

Jun 6

Staff Research Scientist, Exotic AI

Build next-generation training infrastructure for physical AI models that perceive, reason, and act in structured environments. Lead development of representation models, latent world models, and policy optimization systems.

236k – 339kBellevue, WAAI ResearchOn-site8+ YOEJAXPyTorch

Datology AI

Jun 4

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models

Pindrop

Jun 3

Senior Research Scientist

As a Senior Research Scientist on the Video team, you will drive research initiatives and translate advanced computer vision and deepfake detection models into scalable enterprise solutions. You will focus on audio-visual deepfake detection, synthetic media identification, and real-time video processing.

185k – 215kUnited StatesAI ResearchRemote5+ YOEKerasPython

Apply