Skip to content

Senior AI Engineer

225k – 325kSan Francisco, CAML EngineeringOnsite6+ YOE
Summary

Build and optimize AI agent orchestration and reasoning systems for the insurance industry. Requires 6+ years in ML/AI, strong Python skills, and exceptional LLM prompting ability.

About the role

What You’ll Do

  • Set up end-to-end evals to measure & improve agent performance
  • Experiment with new agentic techniques (e.g. multi-agent systems, reasoning-from-feedback, RFT, etc)
  • Build lightweight tools, servers, and orchestration layers (e.g. MCP servers) that enable agents to operate reliably in production
  • Stay on top of emerging research and blogs on LLM/AI agents and bring ideas into production experiments

What We’re Looking For

  • Amazing ability to speak with LLMs - Occam's razor in prompting
  • Strong experience with Python
  • 6+ years building in ML/AI
  • Clear communicator - both in person and in writing
  • Bonus: background in B2B SaaS and 0-1 experience
  • Above all: drive, grit, and ownership

Note that this is not a model-training role - you’ll be building orchestration and reasoning systems on top of existing LLMs (think Claude-Code over Claude-Model).

Benefits

  • Fully covered, best-in-class health, dental, and vision benefits
  • Competitive Compensation, meaningful stock options, company 401(k)
  • Unlimited PTO
  • Outstanding in-office culture in the heart of San Francisco
  • Lunch and dinner onsite
  • Team events, such as happy hours and off-sites
  • Pre-tax Commuter benefits
Skills
PythonLLM promptingMulti-agent systemsAgent orchestrationReasoning systemsMCP serversEvalsRFT
Similar roles at this salary range
All ML Engineering jobs →
Ironclad

Senior Software Engineer, AI

Lead design and delivery of high-priority AI initiatives across multiple codebases. Build and ship AI-powered features with strong backend fundamentals and product sense.

180k – 220kSan Francisco, CAML EngineeringHybrid5+ YOEReactEvals
Plaid

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps
Plaid

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython
Airbnb

Senior Machine Learning Engineer

Build and deploy cutting-edge Agentic AI and LLM systems to transform Airbnb's customer service experience, including Chat and Voice AI assistants. Requires 6+ years experience with production ML/AI systems at scale.

196k – 227kUnited StatesML EngineeringRemote6+ YOELLMSFT
Decagon

Staff Software Engineer, Agents

Build and own end-to-end AI agents for enterprise customers, integrating latest text/voice models and iterating based on real-world usage. Requires 8+ years of software engineering experience with Python and TypeScript.

200k – 400kSan Francisco, CAML EngineeringOn-site8+ YOEPythonAI Agents