Senior AI Engineer

225k – 325kSan Francisco, CAML EngineeringOnsite6+ YOEJun 10

Summary

Build and optimize AI agent orchestration and reasoning systems for the insurance industry. Requires 6+ years in ML/AI, strong Python skills, and exceptional LLM prompting ability.

About the role

What You’ll Do

Set up end-to-end evals to measure & improve agent performance
Experiment with new agentic techniques (e.g. multi-agent systems, reasoning-from-feedback, RFT, etc)
Build lightweight tools, servers, and orchestration layers (e.g. MCP servers) that enable agents to operate reliably in production
Stay on top of emerging research and blogs on LLM/AI agents and bring ideas into production experiments

What We’re Looking For

Amazing ability to speak with LLMs - Occam's razor in prompting
Strong experience with Python
6+ years building in ML/AI
Clear communicator - both in person and in writing
Bonus: background in B2B SaaS and 0-1 experience
Above all: drive, grit, and ownership

Note that this is not a model-training role - you’ll be building orchestration and reasoning systems on top of existing LLMs (think Claude-Code over Claude-Model).

Benefits

Fully covered, best-in-class health, dental, and vision benefits
Competitive Compensation, meaningful stock options, company 401(k)
Unlimited PTO
Outstanding in-office culture in the heart of San Francisco
Lunch and dinner onsite
Team events, such as happy hours and off-sites
Pre-tax Commuter benefits

Skills

PythonLLM promptingMulti-agent systemsAgent orchestrationReasoning systemsMCP serversEvalsRFT

Similar roles at this salary range

All ML Engineering jobs →

Ironclad

Jun 18

Senior Software Engineer, AI

Lead design and delivery of high-priority AI initiatives across multiple codebases. Build and ship AI-powered features with strong backend fundamentals and product sense.

180k – 220kSan Francisco, CAML EngineeringHybrid5+ YOEReactEvals

Plaid

Jun 18

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps

Plaid

Jun 18

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython

Airbnb

Jun 18

Senior Machine Learning Engineer

Build and deploy cutting-edge Agentic AI and LLM systems to transform Airbnb's customer service experience, including Chat and Voice AI assistants. Requires 6+ years experience with production ML/AI systems at scale.

196k – 227kUnited StatesML EngineeringRemote6+ YOELLMSFT

Decagon

Jun 18

Staff Software Engineer, Agents

Build and own end-to-end AI agents for enterprise customers, integrating latest text/voice models and iterating based on real-world usage. Requires 8+ years of software engineering experience with Python and TypeScript.

200k – 400kSan Francisco, CAML EngineeringOn-site8+ YOEPythonAI Agents

Apply