Skip to content

Research Engineer/Research Scientist

295k – 555kSan Francisco, CAML EngineeringHybrid7+ YOE
Summary

Research Engineer/Scientist improving model capabilities for personalized AI experiences. Focus on tool-use, instruction following, evaluations, and training improvements. Requires strong ML engineering and research experience.

About the role

Responsibilities

  • Own and pursue a research agenda to improve model capability and performance
  • Collaborate closely with research and product teams to optimize models for customers
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across the research stack

Requirements

  • Strong ML engineering skills and research experience
  • Deep understanding of machine learning and machine learning applications
  • Working knowledge of relevant models and building evaluations for model capability improvement
  • Comfortable diving into a large ML codebase to debug
  • Ability to thrive in a dynamic and technically complex environment

Nice-to-Haves

  • Passion for creative, product-driven research
Skills
Machine LearningML EngineeringModel EvaluationResearchPythonDeep LearningModel TrainingDebugging ML CodebasesEvaluation FrameworksTool-use
Similar roles at this salary range
All ML Engineering jobs →
xAI

Member of Technical Staff

Hands-on technical contributor focused on stabilizing and advancing large language model training, fine-tuning, and research in AI/deep learning. Requires a bachelor's degree and 2+ years of experience with distributed systems, ML infrastructure, and programming in Rust/C++/Python.

324k – 396kPalo Alto, CAML EngineeringOn-site2+ YOEC++GPU
xAI

Member of Technical Staff

Hands-on technical leader building and scaling large language models and AI systems. Requires 3-5+ years of AI/ML experience with strong Python and deep learning frameworks.

324k – 396kPalo Alto, CAML EngineeringOn-site5+ YOEC++JAX
Anthropic

Research Engineer, Safeguards Labs

Research engineer on the Safeguards Labs team building and evaluating novel safety methods to detect misuse, strengthen model safeguards, and reduce real-world harm from Claude.

350k – 850kSan Francisco, CA +1ML EngineeringHybridPythonClassifiers
Axion

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMCPobservability
Traba

Staff Software Engineer

Founding Staff Agent Engineer building Traba's agentic platform: orchestration, evals, model strategy, and integrations with customer operational systems. Requires 7+ years engineering experience with 2+ years shipping production LLM/agent systems.

240k – 300kNew York, NY +1ML EngineeringOn-site7+ YOEKafkaPython