Applied ML Engineer

Designs, deploys, and scales ML infrastructure for production robotics data platform, including inference pipelines, vector databases for semantic search on multimodal data, and training/evaluation workflows. Requires hands-on experience with model serving, cloud infra, and retrieval systems.

183k – 275kSan Francisco, CAML EngineeringOnsite

Apply

About the role

Key Responsibilities

Deploy and operate inference infrastructure for production ML workloads, including model serving, scaling, and cost optimization
Build and maintain vector database integrations and embedding applications to support semantic search over multimodal (image, video, point cloud, and timeseries) robotics data
Design and implement evaluation and training infrastructure, to help us iterate quickly on model performance
Own cloud architecture decisions and tooling that affect inference latency, throughput, cost, and reliability at scale
Collaborate with product engineers to ship application-driven ML features tailored to developers building the cutting edge of robotics and physical AI, not prototype experiments
Identify the right off-the-shelf solutions and adapt them for production, and know when to build vs. buy

What We're Looking For

Strong hands-on experience in production ML infrastructure: cloud inference, model serving optimization frameworks (e.g., TorchServe, vLLM, Triton), and cost management
Experience with the technologies used in building retrieval systems, including vector databases (e.g., Pinecone, Lance, turbopuffer, pgvector) and text-image embedding models
Solid engineering fundamentals: distributed systems, cloud infrastructure (AWS/GCP), and production reliability
A bias toward application and product impact over research; you’re excited by shipping things that work, not writing papers
Proven ability to operate independently, make good tradeoffs, and move fast in a high-ownership environment
Excellent communication skills; you can explain ML tradeoffs to non-ML engineers

Bonus Points

Familiarity with fine-tuning and domain adaptation techniques for LLMs or embedding models (i.e. SFT, PEFT)
Experience with data mining or hybrid search workflows, especially as applied in robotics autonomous vehicles, or physical AI workflows
Experience building ML tooling, data management, and evaluation frameworks from scratch

Skills

TorchservevLLMTritonPineconeLancePgvectorAWSGCPDistributed SystemsLLMsPeftSft

Similar roles

ML Engineering jobs

Foxglove

Applied ML Engineer

Designs, deploys, and scales ML infrastructure for production robotics data platform, including inference pipelines, vector databases for semantic search over multimodal data, and training/evaluation workflows. Requires hands-on experience with model serving, cloud optimization, and retrieval systems.

183k – 275kSan Francisco, CAML EngineeringOn-siteAWSGCP

Zoox

Software Engineer - Tools & Automation

Builds and maintains AI-powered diagnostic tools using agentic systems, data pipelines, and ML models. Requires BS degree, 4+ years experience, strong Python, and AI/ML familiarity.

184k – 231kFoster City, CAML EngineeringHybrid4+ YOELLMsAI/ML

Earnin

Software Engineer (Gen AI)

Build and ship agent-driven GenAI workflows and chatbots for financial products, owning features end-to-end. Requires 3+ years software engineering experience, strong system design, and excitement for AI-assisted development tools.

181k – 222kMountain View, CAML EngineeringHybrid3+ YOELLMsCodex

OpenAI

ML Research Engineer - Hardware Codesign

Research-Hardware Codesign Engineer bridges ML research and silicon architecture, debugging performance gaps, writing quantization kernels, prototyping numerics in RTL, and analyzing system tradeoffs for AI-optimized hardware.

185k – 455kSan Francisco, CAML EngineeringHybridC++JAX

OpenAI

Software Engineer, AI Safety

Build and maintain anti-abuse and content moderation infrastructure to ensure AI safety. Collaborate with engineers on AI alignment techniques, incident response, and risk mitigation using Python and cloud tools.

185k – 325kSan Francisco, CAML EngineeringOn-siteGoC++