Skip to content

Applied ML Engineer

Designs, deploys, and scales ML infrastructure for production robotics data platform, including inference pipelines, vector databases for semantic search on multimodal data, and training/evaluation workflows. Requires hands-on experience with model serving, cloud infra, and retrieval systems.

183k – 275kSan Francisco, CAML EngineeringOnsite

About the role

Key Responsibilities

  • Deploy and operate inference infrastructure for production ML workloads, including model serving, scaling, and cost optimization
  • Build and maintain vector database integrations and embedding applications to support semantic search over multimodal (image, video, point cloud, and timeseries) robotics data
  • Design and implement evaluation and training infrastructure, to help us iterate quickly on model performance
  • Own cloud architecture decisions and tooling that affect inference latency, throughput, cost, and reliability at scale
  • Collaborate with product engineers to ship application-driven ML features tailored to developers building the cutting edge of robotics and physical AI, not prototype experiments
  • Identify the right off-the-shelf solutions and adapt them for production, and know when to build vs. buy

What We're Looking For

  • Strong hands-on experience in production ML infrastructure: cloud inference, model serving optimization frameworks (e.g., TorchServe, vLLM, Triton), and cost management
  • Experience with the technologies used in building retrieval systems, including vector databases (e.g., Pinecone, Lance, turbopuffer, pgvector) and text-image embedding models
  • Solid engineering fundamentals: distributed systems, cloud infrastructure (AWS/GCP), and production reliability
  • A bias toward application and product impact over research; you’re excited by shipping things that work, not writing papers
  • Proven ability to operate independently, make good tradeoffs, and move fast in a high-ownership environment
  • Excellent communication skills; you can explain ML tradeoffs to non-ML engineers

Bonus Points

  • Familiarity with fine-tuning and domain adaptation techniques for LLMs or embedding models (i.e. SFT, PEFT)
  • Experience with data mining or hybrid search workflows, especially as applied in robotics autonomous vehicles, or physical AI workflows
  • Experience building ML tooling, data management, and evaluation frameworks from scratch

Skills

TorchservevLLMTritonPineconeLancePgvectorAWSGCPDistributed SystemsLLMsPeftSft

Similar roles

ML Engineering jobs

Applied ML Engineer

Designs, deploys, and scales ML infrastructure for production robotics data platform, including inference pipelines, vector databases for semantic search over multimodal data, and training/evaluation workflows. Requires hands-on experience with model serving, cloud optimization, and retrieval systems.

183k – 275kSan Francisco, CAML EngineeringOn-siteAWSGCP

Software Engineer - Tools & Automation

Builds and maintains AI-powered diagnostic tools using agentic systems, data pipelines, and ML models. Requires BS degree, 4+ years experience, strong Python, and AI/ML familiarity.

184k – 231kFoster City, CAML EngineeringHybrid4+ YOELLMsAI/ML

Software Engineer (Gen AI)

Build and ship agent-driven GenAI workflows and chatbots for financial products, owning features end-to-end. Requires 3+ years software engineering experience, strong system design, and excitement for AI-assisted development tools.

181k – 222kMountain View, CAML EngineeringHybrid3+ YOELLMsCodex

ML Research Engineer - Hardware Codesign

Research-Hardware Codesign Engineer bridges ML research and silicon architecture, debugging performance gaps, writing quantization kernels, prototyping numerics in RTL, and analyzing system tradeoffs for AI-optimized hardware.

185k – 455kSan Francisco, CAML EngineeringHybridC++JAX

Software Engineer, AI Safety

Build and maintain anti-abuse and content moderation infrastructure to ensure AI safety. Collaborate with engineers on AI alignment techniques, incident response, and risk mitigation using Python and cloud tools.

185k – 325kSan Francisco, CAML EngineeringOn-siteGoC++