Machine Learning Engineer

Owns end-to-end production ML systems for clinical workflows, including training/fine-tuning LLMs for medical reasoning and question answering. Requires strong ML/software engineering, PyTorch experience, and ability to handle high-stakes ambiguity with real patient impact.

225k – 300kSan Francisco, CAML EngineeringOnsite

Apply

About the role

What You’ll Do

Own end-to-end ML systems, including architecture, data, modeling, evaluation, and production infrastructure
Train and fine-tune large language models (LLMs) for:
- Clinical reasoning
- Medical question answering
- Evidence-grounded generation
Make and own tradeoffs across accuracy, latency, cost, and safety in high-stakes production environments
Develop evaluation frameworks to ensure model safety and clinical validity
Integrate ML systems into product workflows and patient-facing applications
Monitor system performance in production and iterate based on real-world usage and feedback
Define what “correct” means in ambiguous clinical workflows in collaboration with engineers and clinicians

What We’re Looking For

Strong foundation in machine learning and software engineering
Track record of building and owning ML systems in production where performance, reliability, or correctness materially mattered
Experience driving ambiguous ML problems from 0→1, including problem formulation, model design, and productionization
Hands-on experience with PyTorch or similar frameworks
Ability to operate independently in high-ambiguity environments with minimal guidance
Strong product and engineering judgment — you know when to use ML, when not to, and how to scope problems accordingly
Comfort working in a fast-moving, early-stage environment
Experience working on systems where decisions have real-world consequences (e.g., healthcare, finance, infrastructure)

Nice to Have

Experience deploying LLMs in production environments
Experience building distributed systems or large-scale data pipelines
Experience working with clinical, biomedical, or other regulated datasets

Compensation

Base salary: $225,000 – $300,000+ Meaningful equity in an early-stage, Series A company

Skills

PyTorchLLMsReinforcement LearningMachine LearningLlm Fine-TuningDistributed SystemsData PipelinesClinical DataProduction MlEvaluation Frameworks

Similar roles

ML Engineering jobs

Lightning AI

Lead Research Engineer

Leads development of performance optimizations for ML models across graph, kernel, and system levels, advances Thunder compiler with new passes and tools, and ensures seamless PyTorch Lightning integration. Requires strong PyTorch expertise, optimization techniques, and distributed systems knowledge.

225k – 275kNew York, NY +1ML EngineeringRemoteCUDACI/CD

Retell AI

Research Scientist - Audio

Conducts ML research on LLMs and audio models to enhance real-time voice agents' reasoning, latency, and conversational quality. Prototypes models, designs evaluations, and bridges research to production systems requiring strong PyTorch expertise and experimental mindset.

225k – 400kRedwood City, CAML EngineeringOn-siteLLMsPyTorch

Latent

Research Scientist

Owns end-to-end ML research initiatives developing novel architectures, training methods, and evaluation for clinical intelligence using longitudinal patient data. Requires strong ML foundation, PyTorch experience, and ability to drive ambiguous high-stakes problems to validated results.

225k – 300kSan Francisco, CAML EngineeringOn-siteNLPLLMs

character.ai

Research Engineer, Multimodal

Research Engineer advancing video/image generation models for AI characters, leading fine-tuning, novel architectures, data pipelines, and optimizations using PyTorch and multimodal techniques. Requires expertise in generation models and distributed training.

225k – 400kRedwood City, CAML EngineeringOn-siteDitLora

character.ai

Research Engineer, Post-Training (All Industry Levels)

Develops alignment algorithms, data pipelines, and sampling methods to optimize post-training AI models for performance and efficiency. Requires PhD or equivalent, ML expertise including reinforcement learning and transformers, and production code experience.

225k – 400kUnited StatesML EngineeringRemoteGCPGpus