Skip to content

AI Engineer

200k – 250kSan Francisco, CAML EngineeringOnsite7+ YOE
Summary

Build and productionize LLM systems for clinical documentation at a healthcare AI startup. Requires 7+ years experience training, fine-tuning, and evaluating models with strong focus on evals and reliability.

About the role

What You'll Do

  • Train and fine-tune open source models, leveraging our vast proprietary dataset to push accuracy beyond what off-the-shelf models can do.
  • Build eval datasets and pipelines that let us measure model accuracy rigorously and improve it continuously.
  • Design how we measure accuracy in the first place: the metrics, harnesses, and feedback loops that turn real clinical outcomes into measurable model improvements.
  • Build scalable, cost-efficient inference infrastructure with great monitoring and observability.
  • Build better agentic infrastructure and partner on the interfaces that turn model capability into a great clinician experience.
  • Stay at the frontier: keep up with the latest research, frontier model capabilities, and open source frameworks, and bring the best of it into production.
  • Prototype quickly, then harden into scalable, secure, and reliable production systems.

You Might Be a Good Fit If You Have

  • 7+ years of professional software engineering experience, with meaningful depth in AI/ML.
  • Experience training, fine-tuning, or evaluating LLMs and open source models, with real opinions about what works and what does not.
  • Shipped real AI software to real users. You can describe something you built, what broke, and how you fixed it.
  • Strong instincts for evals, observability, and the feedback loops that turn user feedback into measurable improvement.
  • Experience building agentic systems: tool use, generator and critic loops, planners and executors, and orchestration where one agent's output drives another's work.
  • Comfort building infrastructure that is fast, reliable, and cost-efficient at scale.
  • Startup experience shipping real features in high-growth environments.
  • A product mindset, comfort across the stack, and the ability to operate in ambiguity without a clean spec.
  • High standards for reliability and accuracy when real clinicians and patients depend on your work.
Skills
LLMsFine-tuningModel EvaluationAgentic SystemsInference InfrastructureObservabilityOpen Source ModelsScalable SystemsML InfrastructureProduction AI Systems
Similar roles at this salary range
All ML Engineering jobs →
Mem0

Senior Research Engineer

Own the end-to-end lifecycle of memory features for AI agents. Fine-tune models, implement research, build evaluations, and ship production systems with Engineering.

175k – 250kSan Francisco, CAML EngineeringOn-site7+ YOERAGvLLM
Ironclad

Senior Software Engineer, AI

Lead design and delivery of high-priority AI initiatives across multiple codebases. Build and ship AI-powered features with strong backend fundamentals and product sense.

180k – 220kSan Francisco, CAML EngineeringHybrid5+ YOEReactEvals
Mercury

Senior Machine Learning Operations Engineer

Build and operate Mercury's real-time ML inference platform for fraud risk decisioning. Own model deployment, observability, and lifecycle tooling with strong backend Python fundamentals.

167k – 208kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLSHAP
Plaid

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps
Plaid

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython