Skip to content

AI Platform Engineer

170k – 205kSan Mateo, CAHybrid5+ YOE
Summary

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

About the role

What You’ll Do

  • Develop and maintain LLM integrations to power AI features across solutions.
  • Ensure scalability, reliability, and performance of AI features in production.
  • Translate abstract requirements into structured, sound technical plans and milestones.
  • Own implementations end-to-end: discovery/requirements → design → build → launch → post-delivery monitoring/iterating.
  • Evaluate and articulate implications and trade-offs of technical choices.
  • Leverage AI agents to improve development velocity and operational efficiency.
  • Collaborate across engineering and adjacent teams to share learnings, improve processes, and continuously raise quality.

Requirements

  • Strong proficiency in Python for production software.
  • Proficiency with Jupyter Notebook or an equivalent environment (e.g., JupyterLab, Databricks, Colab, etc.).
  • Demonstrated experience building, integrating, and operating LLM-powered features/services.
  • Ability to decompose ambiguous problems, write clear technical plans, and execute with high ownership.
  • Experience designing for reliability, scalability, and observability in production systems.
  • You leverage AI Agents for day-to-day efficiency.

Nice to Have

  • Terraform and Helm Charts for infrastructure and deployment.
  • Google Cloud Platform (e.g., GKE, Cloud Run, Cloud Storage).
  • Typescript for service or UI integrations.
  • Postgres for application data modeling and performance.
  • Experience with ML/AI platforms, agents, or orchestration frameworks.
Skills
PythonLLM integrationJupyter NotebookDatabricksTerraformHelmGoogle Cloud PlatformGKECloud RunTypeScriptPostgres
Similar roles at this salary range
All ML Engineering jobs →
Nuance Labs

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning
Pinterest

Machine Learning Engineer II, Computer Vision Applied Science

Build and fine-tune vision-centric VLMs and generative models using Pinterest's visual-text datasets. Requires 2+ years industry computer vision experience and an M.S. or Ph.D.

139k – 286kSan Francisco, CAML EngineeringRemote2+ YOELLMsRLHF
Snowflake

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP
Nuance Labs

Member of Technical Staff — Model Optimization and Inference

Early-career engineer optimizing inference for real-time multimodal AI avatars. Focus on KV cache strategies, serving frameworks, quantization, and latency reduction for LLMs and diffusion models.

200k – 300kSeattle, WAML EngineeringOn-siteEntry levelvLLMCUDA
Otter

Machine Learning Engineer

Lead projects building and deploying large-scale ASR/NLP/LLM systems for meeting intelligence. Architect training, fine-tuning, and inference pipelines using PyTorch/JAX and own ML systems from research to production.

196k – 221kMountain View, CAML EngineeringHybrid3+ YOEJAXASR