Skip to content

Applied AI Engineer

180k – 250kSan Francisco, CAOnsite3+ YOE
Summary

Builds production AI agents and workflows for governance, risk, and compliance, fine-tuning LLMs on proprietary data for high-accuracy tasks like regulatory analysis and risk assessment. Requires 3+ years applied AI experience with production ML systems emphasizing explainability and responsibility.

About the role

What you'll do

  • Build & fine-tune models. Fine-tune foundation models on proprietary data and implement novel techniques to achieve world-class accuracy on complex GRC tasks.
  • Develop AI-native workflows. Build sophisticated, multi-step agentic workflows that automate complex GRC processes — from risk assessment to compliance monitoring to evidence collection.
  • Champion responsible AI. Implement and pioneer methods for AI explainability and safety. Our systems must be transparent, auditable, and fair. This is non-negotiable in our domain.
  • Drive innovation. Rapidly prototype, evaluate, and integrate state-of-the-art research (agents, RAG, new architectures) into reliable, production-grade features.

Representative projects

  • Build an AI agent that analyzes thousands of regulatory documents and internal controls, identifying compliance gaps with higher accuracy than a team of human experts.
  • Develop an explainable AI system for risk assessment, allowing auditors and executives to understand and trust the AI's reasoning on high-stakes decisions.
  • Build an advanced RAG pipeline over a massive corpus of unstructured company data to produce precise, verifiable assessments against complex compliance requirements.
  • Partner with GRC subject matter experts to create ground-truth datasets for tasks like third-party risk evaluation, then fine-tune models that become the industry standard.

What you have

  • 3+ years in an applied AI or machine learning engineering role.
  • Proven product sense. You've shipped reliable, production-scale ML products. You know how to use offline evaluation and online experimentation to achieve high-performance results.
  • Hands-on applied AI expertise. Direct experience building with LLMs — fine-tuning, RAG, and agentic systems. Not theoretical. You've put these into production.
  • High agency. You take full ownership of outcomes, move with a bias for action, and have a relentless drive for world-class accuracy. You see constraints as design problems, not blockers.
  • Strong communication. You can work closely with security and GRC research counterparts and articulate technical tradeoffs clearly.

Compensation & benefits

  • Competitive salary + significant equity
  • Flexible PTO
  • Medical, dental, and vision insurance
  • Meals and snacks in the office
  • Relocation and immigration support
Skills
LLMsfine-tuningRAGagentic systemsmachine learningAI explainabilityAI safetyfoundation modelsprompt engineeringevaluation metrics
Similar roles at this salary range
All ML Engineering jobs →
Notable

AI Platform Engineer

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

170k – 205kSan Mateo, CAML EngineeringHybrid5+ YOEGKEHelm
Hinge Health

Staff Machine Learning Scientist

Own ML systems for send-time optimization, propensity modeling, and nudge decisions at consumer scale. Set experimentation standards and mentor a small ML team.

205k – 307kSan Francisco, CAML EngineeringHybrid7+ YOESQLdbt
Docker

Staff ML Engineer

Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.

205k – 330kPalo Alto, CA +1ML EngineeringRemote8+ YOELLMsRetrieval
Nuance Labs

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning
Snowflake

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP