AI Platform Engineer

170k – 205kSan Mateo, CAHybrid5+ YOEJun 12

Summary

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

About the role

What You’ll Do

Develop and maintain LLM integrations to power AI features across solutions.
Ensure scalability, reliability, and performance of AI features in production.
Translate abstract requirements into structured, sound technical plans and milestones.
Own implementations end-to-end: discovery/requirements → design → build → launch → post-delivery monitoring/iterating.
Evaluate and articulate implications and trade-offs of technical choices.
Leverage AI agents to improve development velocity and operational efficiency.
Collaborate across engineering and adjacent teams to share learnings, improve processes, and continuously raise quality.

Requirements

Strong proficiency in Python for production software.
Proficiency with Jupyter Notebook or an equivalent environment (e.g., JupyterLab, Databricks, Colab, etc.).
Demonstrated experience building, integrating, and operating LLM-powered features/services.
Ability to decompose ambiguous problems, write clear technical plans, and execute with high ownership.
Experience designing for reliability, scalability, and observability in production systems.
You leverage AI Agents for day-to-day efficiency.

Nice to Have

Terraform and Helm Charts for infrastructure and deployment.
Google Cloud Platform (e.g., GKE, Cloud Run, Cloud Storage).
Typescript for service or UI integrations.
Postgres for application data modeling and performance.
Experience with ML/AI platforms, agents, or orchestration frameworks.

Skills

PythonLLM integrationJupyter NotebookDatabricksTerraformHelmGoogle Cloud PlatformGKECloud RunTypeScriptPostgres

Similar roles at this salary range

All ML Engineering jobs →

Nuance Labs

Jun 11

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning

Jun 11

Machine Learning Engineer II, Computer Vision Applied Science

Build and fine-tune vision-centric VLMs and generative models using Pinterest's visual-text datasets. Requires 2+ years industry computer vision experience and an M.S. or Ph.D.

139k – 286kSan Francisco, CAML EngineeringRemote2+ YOELLMsRLHF

Snowflake

Jun 11

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP

Nuance Labs

Jun 11

Member of Technical Staff — Model Optimization and Inference

Early-career engineer optimizing inference for real-time multimodal AI avatars. Focus on KV cache strategies, serving frameworks, quantization, and latency reduction for LLMs and diffusion models.

200k – 300kSeattle, WAML EngineeringOn-siteEntry levelvLLMCUDA

Otter

Jun 11

Machine Learning Engineer

Lead projects building and deploying large-scale ASR/NLP/LLM systems for meeting intelligence. Architect training, fine-tuning, and inference pipelines using PyTorch/JAX and own ML systems from research to production.

196k – 221kMountain View, CAML EngineeringHybrid3+ YOEJAXASR

Apply