Staff ML Engineer, AI Platform

250k – 300kSan Francisco, CAML EngineeringHybrid7+ YOEFeb 2

Summary

Builds ML platform infrastructure including evaluation/release gates, debug tooling, chart context retrieval, data pipelines, and model serving to accelerate AI improvements for clinical workflows. Requires 7+ years software engineering with 3+ in ML infra/platform, strong Python/TypeScript backend skills.

About the role

What You’ll Own

Eval & Release Infrastructure

Automated graders and release gates that work across product pods
Unified eval dataset versioning and execution to replace fragmented workflows
Production quality monitoring with end-to-end tracing, shared metrics, and automated alerting

Debug Tooling

Encounter replay that reconstructs exact inference inputs (retrieved chart context, packed prompts, model versions) so teams reproduce issues without digging through logs
Diff views comparing known-good runs to regressions

Chart Context & Data Pipelines

The retrieval layer that pulls relevant patient history and assembles it into consistent model-ready inputs
Feedback loops that capture real-world usage and convert it into training signal
End-to-end latency instrumentation across every workflow step

Preference Infrastructure

The system that enables clinician and site-specific behavior across specialties
Different clinics want different defaults, different phrasing, different workflows. Build the platform that supports customization at scale

Model Serving

Performance and reliability layer for critical in-house models with clear SLOs, capacity planning, and regression alerts

Who You Are

7+ years in software engineering, 3+ focused on ML infrastructure, platform engineering, or data systems
Staff-level scope: owned cross-cutting infrastructure, influenced technical direction across multiple teams
Strong backend fundamentals in Python, TypeScript, or similar
Built eval systems, data pipelines, or ML observability infrastructure
Comfortable on both the ML and Eng sides of MLOps
Track record of platform work that measurably accelerated other teams
In SF, 3x/week in-person

Compensation

Base compensation range of approximately $250,000-300,000 per year, exclusive of equity.

Skills

PythonTypeScriptML InfrastructureMLOpsData PipelinesEval SystemsObservabilityKubernetesRetrieval SystemsModel Serving

Similar roles at this salary range

All ML Engineering jobs →

Airbnb

Jun 23

Staff Machine Learning Engineer

Build and deploy cutting-edge ML and Generative AI systems to transform Airbnb's customer support experience, focusing on LLM fine-tuning, RAG, and intelligent service automation.

212k – 260kSan Francisco, CAML EngineeringRemote9+ YOELLMRAG

Mixpanel

Jun 23

Senior Software Engineer, AI Platform

Senior Software Engineer building scalable AI infrastructure, agent orchestration frameworks, evaluation systems, and high-performance LLM serving at Mixpanel. Requires 5+ years experience and hands-on LLM/agent work.

226k – 306kSan Francisco, CAML EngineeringHybrid5+ YOELLMsMLOps

Twilio

Jun 23

Tech Lead, Applied Research

Tech Lead driving AI R&D and end-to-end delivery of production-ready prototypes using full-stack development, LLMs, and emerging technologies. Requires 10+ years experience and strong autonomy.

228k – 335kUnited StatesML EngineeringRemote10+ YOEGoSQL

Jun 22

Senior Machine Learning Systems Engineer

Build large-scale ML experimentation and training orchestration platforms, including agentic AI execution systems, to accelerate Ads ML development at Reddit. Requires 5+ years infrastructure experience and 2+ years building production ML platforms.

217k – 303kUnited StatesML EngineeringRemote5+ YOERayArgo

Axion

Jun 22

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMCPobservability

Apply