Skip to content

Staff Software Engineer, AI Platform

202k – 255kSan Francisco, CAML EngineeringOnsite6+ YOE
Summary

Technical leader building agent infrastructure, observability, evals, and guardrails for production AI systems at Watershed. Requires 6+ years backend/platform/AI engineering experience and production TypeScript systems.

About the role

Responsibilities

  • Design and build the agent infrastructure that powers Watershed's products
  • Develop the observability and tracing layer for agent decisions, making it possible to debug, evaluate, and improve agent behavior at scale
  • Build evals, harnesses, and guardrails that turn agent capabilities into production-grade, dependable systems
  • Collaborate with product and other AI engineering teams to set product and technical strategy, and define the boundaries between autonomous agent behavior, deterministic code, and human oversight
  • Keep up with developments and state-of-the-art in AI and agent infrastructure to determine what is relevant to Watershed
  • Work closely with Watershed product teams to contribute your expertise to build agent experiences across the product
  • Write performant, well-crafted, tested, and maintainable code across our technical stack

Requirements

  • 6+ years of experience in backend, platform, or AI/ML engineering
  • Experience building products and infrastructure that leverage LLMs, embeddings, and other ML technologies
  • Full lifecycle experience building, deploying, and monitoring production systems that depend on LLMs or other ML technologies
  • Experience with model evaluation, agent observability, and making non-deterministic systems reliable
  • Experience building and operating production Typescript systems
  • Must be willing to work from an office 4 days per week
Skills
TypeScriptLLMsEmbeddingsMachine LearningAgent InfrastructureObservabilityModel EvaluationBackend EngineeringProduction SystemsAI/ML Engineering
Similar roles at this salary range
All ML Engineering jobs →
Mem0

Senior Research Engineer

Own the end-to-end lifecycle of memory features for AI agents. Fine-tune models, implement research, build evaluations, and ship production systems with Engineering.

175k – 250kSan Francisco, CAML EngineeringOn-site7+ YOERAGvLLM
Ironclad

Senior Software Engineer, AI

Lead design and delivery of high-priority AI initiatives across multiple codebases. Build and ship AI-powered features with strong backend fundamentals and product sense.

180k – 220kSan Francisco, CAML EngineeringHybrid5+ YOEReactEvals
Mercury

Senior Machine Learning Operations Engineer

Build and operate Mercury's real-time ML inference platform for fraud risk decisioning. Own model deployment, observability, and lifecycle tooling with strong backend Python fundamentals.

167k – 208kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLSHAP
Plaid

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps
Plaid

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython