Skip to content

Staff AI Engineer

240k – 310kSan Francisco, CAML EngineeringHybrid8+ YOE
Summary

Staff AI Engineer shapes technical direction of AI agents for audit workflows, owning architecture, LLM infrastructure, and reliability systems. Requires 8+ years experience with deep expertise in TypeScript/Python, RAG, and enterprise-scale LLM products.

About the role

What You’ll Own

Define and Drive Technical Direction

  • Set the architectural direction for Fieldguide's agentic systems and intelligence layer
  • Identify and resolve systemic gaps that limit reliability, scalability, or product velocity
  • Make high-leverage technical bets and drive them to completion across the organization
  • Establish evaluation standards, observability infrastructure, and reliability frameworks at scale

Build and Ship High-Impact Systems

  • Own the most complex, cross-cutting product areas end-to-end
  • Design agent orchestration and retrieval systems that handle enterprise-grade complexity
  • Lead the evolution of Fieldguide's LLM infrastructure: models, prompts, guardrails, and feedback loops
  • Prototype, validate, and harden novel approaches before sharing with the broader team

Drive Organizational Leverage

  • Partner with Engineering leadership, Product, and Design to shape the AI roadmap
  • Create reusable platforms, patterns, and tooling that multiply team velocity across the org
  • Identify and develop the next generation of senior engineers
  • Translate long-term product vision into concrete technical strategies

Represent and Advance the Field

  • Stay at the frontier of LLM and agent research; rapidly evaluate and adopt what matters
  • Build internal knowledge and shared understanding of AI system design
  • Contribute to Fieldguide's reputation as a technical leader in vertical AI

Who You Are

Strong software engineer with:

  • Technical depth with organizational influence
  • Systems thinker
  • Strong product judgment
  • Multiplier effect
  • Principled pragmatism
  • End-to-end ownership

Experience

  • 8+ years shipping production software, with 3+ years leading complex AI/ML systems
  • Deep expertise in TypeScript, Python, and distributed system design
  • Proven track record shipping LLM-powered products at enterprise scale
  • Experience defining evaluation infrastructure and reliability standards for AI systems
  • Mastery of retrieval architectures, RAG, vector databases, and embedding pipelines
  • Hands-on with cutting-edge LLM APIs and agent frameworks (OpenAI, Anthropic, Gemini, LangGraph, etc.)
  • Experience influencing technical direction across multiple teams or product areas
  • Track record mentoring senior engineers and growing engineering culture

What Should Excite You

  • Enterprise-grade reliability
  • Human-in-the-loop design
  • Nuanced evaluation
  • Explainability
  • Complex domains
  • Shipping daily value

Benefits

  • Competitive compensation with equity
  • Comprehensive health and wellness benefits
  • Flexible time off and work schedules
  • Technology reimbursements
  • 401(k) plan
  • Twice-yearly in-person offsites across the U.S.
  • Wellness benefits starting on your first day
Skills
TypeScriptPythonLLMRAGVector DatabasesOpenAIAnthropicGeminiLangGraphDistributed SystemsRetrieval ArchitecturesAgent Frameworks
Similar roles at this salary range
All ML Engineering jobs →
Plaid

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps
Plaid

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython
Airbnb

Senior Machine Learning Engineer

Build and deploy cutting-edge Agentic AI and LLM systems to transform Airbnb's customer service experience, including Chat and Voice AI assistants. Requires 6+ years experience with production ML/AI systems at scale.

196k – 227kUnited StatesML EngineeringRemote6+ YOELLMSFT
Decagon

Staff Software Engineer, Agents

Build and own end-to-end AI agents for enterprise customers, integrating latest text/voice models and iterating based on real-world usage. Requires 8+ years of software engineering experience with Python and TypeScript.

200k – 400kSan Francisco, CAML EngineeringOn-site8+ YOEPythonAI Agents
Reddit

Staff Machine Learning Engineer, Notifications Relevance

Technical leader for Reddit's Notifications Relevance ML systems, driving large-scale recommendation systems spanning retrieval, ranking, budget optimization, and LLM-powered experiences.

230k – 322kUnited StatesML EngineeringRemote8+ YOEPythonGolang