Research Engineer/Research Scientist

295k – 555kSan Francisco, CAML EngineeringHybrid7+ YOEJun 25

Summary

Research Engineer/Scientist improving model capabilities for personalized AI experiences. Focus on tool-use, instruction following, evaluations, and training improvements. Requires strong ML engineering and research experience.

About the role

Responsibilities

Own and pursue a research agenda to improve model capability and performance
Collaborate closely with research and product teams to optimize models for customers
Build robust evaluations for tracking modeling improvements
Design, implement, test, and debug code across the research stack

Requirements

Strong ML engineering skills and research experience
Deep understanding of machine learning and machine learning applications
Working knowledge of relevant models and building evaluations for model capability improvement
Comfortable diving into a large ML codebase to debug
Ability to thrive in a dynamic and technically complex environment

Nice-to-Haves

Passion for creative, product-driven research

Skills

Machine LearningML EngineeringModel EvaluationResearchPythonDeep LearningModel TrainingDebugging ML CodebasesEvaluation FrameworksTool-use

Similar roles at this salary range

All ML Engineering jobs →

xAI

Jun 24

Member of Technical Staff

Hands-on technical contributor focused on stabilizing and advancing large language model training, fine-tuning, and research in AI/deep learning. Requires a bachelor's degree and 2+ years of experience with distributed systems, ML infrastructure, and programming in Rust/C++/Python.

324k – 396kPalo Alto, CAML EngineeringOn-site2+ YOEC++GPU

xAI

Jun 24

Member of Technical Staff

Hands-on technical leader building and scaling large language models and AI systems. Requires 3-5+ years of AI/ML experience with strong Python and deep learning frameworks.

324k – 396kPalo Alto, CAML EngineeringOn-site5+ YOEC++JAX

Anthropic

Jun 23

Research Engineer, Safeguards Labs

Research engineer on the Safeguards Labs team building and evaluating novel safety methods to detect misuse, strengthen model safeguards, and reduce real-world harm from Claude.

350k – 850kSan Francisco, CA +1ML EngineeringHybridPythonClassifiers

Axion

Jun 22

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMCPobservability

Traba

Jun 22

Staff Software Engineer

Founding Staff Agent Engineer building Traba's agentic platform: orchestration, evals, model strategy, and integrations with customer operational systems. Requires 7+ years engineering experience with 2+ years shipping production LLM/agent systems.

240k – 300kNew York, NY +1ML EngineeringOn-site7+ YOEKafkaPython

Apply