Skip to content

Research Engineer, AI Safety & Alignment

225k – 400kRedwood City, CAAI ResearchOnsite
Summary

Develops evaluation methods, alignment techniques, and adversarial testing for large language models to ensure safety and alignment with human values. Requires PhD in ML/CS, production code skills, GPU experience, and transformers/RL expertise.

About the role

Responsibilities

  • Develop and implement novel evaluation methodologies and metrics to assess the safety and alignment of large language models.
  • Research and develop cutting-edge techniques for model alignment, value learning, and interpretability.
  • Conduct adversarial testing to proactively uncover potential vulnerabilities and failure modes in our models.
  • Analyze and mitigate biases, toxicity, and other harmful behaviors in large language models through techniques like reinforcement learning from human feedback (RLHF) and fine-tuning.
  • Collaborate with engineering and product teams to translate safety research into practical, scalable solutions and best practices.
  • Stay abreast of the latest advancements in AI safety research and contribute to the academic community through publications and presentations.

Requirements

  • Hold a PhD (or equivalent experience) in a relevant field such as Computer Science, Machine Learning, or a related discipline.
  • Write clear and clean production-facing and training code.
  • Experience working with GPUs (training, serving, debugging).
  • Experience with data pipelines and data infrastructure.
  • Strong understanding of modern machine learning techniques, particularly transformers and reinforcement learning, with a focus on their safety implications.
  • Passionate about the responsible development of AI and dedicated to solving complex safety challenges.

Nice to Have

  • Experience with product experimentation and A/B testing.
  • Experience training large models in a distributed setting.
  • Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud).
  • Experience with explainable AI (XAI) and interpretability techniques.
  • Research in AI safety, alignment, ethics, or a related area.
  • Knowledge of the broader societal and ethical implications of AI, including policy and governance.
  • Publications in relevant academic journals or conferences in the field of machine learning.
Skills
PyTorchTransformersReinforcement LearningRLHFGPUsData PipelinesInterpretabilityKubernetesDockerExplainable AI
Similar roles at this salary range
All AI Research jobs →
Luma AI

Applied Research Scientist / Engineer

Work as a fullstack applied researcher adapting multimodal video foundation models for production. Focus on controllability, personalization, and end-user quality using SFT, RL, and data-driven refinement.

200k – 450kNew York, NY +1AI ResearchHybrid7+ YOERLSFT
Writer

Staff AI Research Scientist

Lead high-impact research on LLMs and agentic systems, driving post-training, reasoning, and evaluation to power enterprise AI deployments. Requires 7+ years ML research experience, PhD or equivalent, and strong publication record.

234k – 296kSan Francisco, CA +2AI ResearchHybrid7+ YOEJAXSFT
Snowflake

Staff Research Scientist, Exotic AI

Build next-generation training infrastructure for physical AI models that perceive, reason, and act in structured environments. Lead development of representation models, latent world models, and policy optimization systems.

236k – 339kBellevue, WAAI ResearchOn-site8+ YOEJAXPyTorch
Datology AI

Research Engineer

As a Research Engineer, you will conduct and enable cutting-edge research, translating it into the core product pipeline. You will develop and improve state-of-the-art data curation strategies, accelerating research and ensuring product innovation.

180k – 300kRedwood City, CAAI ResearchOn-site4+ YOEML ModelsAI Models
Pindrop

Senior Research Scientist

As a Senior Research Scientist on the Video team, you will drive research initiatives and translate advanced computer vision and deepfake detection models into scalable enterprise solutions. You will focus on audio-visual deepfake detection, synthetic media identification, and real-time video processing.

185k – 215kUnited StatesAI ResearchRemote5+ YOEKerasPython