Skip to content

Sr. Applied AI Engineer

133k – 287kSan Francisco, CARemote7+ YOE
Summary

Build and evolve shared AI platform infrastructure including LLM Ops and ML Ops tooling to enable scalable, secure AI development across teams. Requires 7+ years software engineering with 3+ years in production ML/AI systems and 2+ years platform experience.

About the role

Responsibilities

  • Build and evolve shared AI Platform capabilities that serve as the foundation for teams building with machine learning and generative AI across Zapier.
  • Improve LLM Ops and ML Ops capabilities, including observability, monitoring, evaluation, deployment workflows, and operational guardrails.
  • Design and implement systems that help teams measure and improve the performance, reliability, safety, and cost efficiency of AI-powered experiences.
  • Proactively identify tooling gaps and work across teams to standardize best practices for building, deploying, and monitoring AI-driven experiences.
  • Collaborate closely with engineers across product, infra, and data teams to ensure AI components are reusable, well-documented, and easy to adopt company-wide.
  • Evaluate emerging tools, models, and patterns in the AI ecosystem, and help determine which ones should be incorporated into Zapier’s shared platform.

Requirements

  • 7+ years of experience in software engineering, with at least 3 years dedicated to building distributed, scalable, cloud-based ML/AI systems in production environments.
  • At least 2 years of experience in LLM Ops, ML Ops, or adjacent platform/infrastructure work.
  • Experience building shared services, internal platforms, or reusable developer tooling that enable other teams to move faster.
  • Experience working through the full lifecycle of building, testing, deploying, and scaling ML/LLM architectures.
  • Experience building with cloud infrastructure technologies.
  • Comfort with typed languages and modern backend practices (mostly TypeScript & Python).

Nice-to-Haves

  • Experience in TypeScript & Python.
Skills
TypeScriptPythonLLM OpsML OpsCloud InfrastructureObservabilityMachine LearningGenerative AIDeployment WorkflowsMonitoringEvaluation Frameworks
Similar roles at this salary range
All DevOps / SRE jobs →
WHOOP

Senior Platform Engineer - Kubernetes

Senior Platform Engineer responsible for designing, operating, and scaling Kubernetes clusters on AWS. Focuses on CI/CD, infrastructure automation, and developer productivity across WHOOP's technology stacks.

150k – 215kBoston, MADevOps / SREHybrid5+ YOEC#AWS
Snowflake

Software Engineer (AI Engineer/Developer Experience)

Senior engineer on the Developer Experience team building AI-powered infrastructure and tooling (Bazel, IDEs, Cloud Workspaces) to boost Snowflake engineers' productivity. Requires 7+ years experience and fluency in Java/C++/Python/Golang.

128k – 184kBellevue, WA +1DevOps / SREHybrid7+ YOEC++Java
Komodo Health

Senior Data Engineer, Sentinel (Pacific Time Zone)

Senior Infrastructure Engineer building and operating AWS cloud infrastructure for healthcare data platform. Requires Python, Terraform, CI/CD expertise, and big data tools experience.

153k – 210kUnited StatesDevOps / SRERemote5+ YOEAWSVPC
Pinterest

Sr. Production Engineer, Solutions Engineering

Senior Production Engineer building AI agents, platforms, and automation to ensure reliability of Pinterest's large-scale distributed systems serving hundreds of millions of users.

140k – 288kChicago, IL +1DevOps / SRERemote5+ YOEGoAWS
Nuro

Software Reliability Engineer

Build and operate resilient systems for Nuro's autonomous vehicle fleet. Design pipelines, automation, and tools to improve reliability and reduce operational toil. Join on-call rotation and lead investigations.

109k – 163kMountain View, CADevOps / SREOn-siteGoC++