Skip to content

Agentic AI Engineer

Builds production-ready agentic AI systems including runtimes, orchestration, reliability, observability, and integrations with LLMs/APIs. Requires strong backend experience, shipped agent/LLM systems, and production reliability expertise.

187k – 253kSeattle, WAWashingtonML EngineeringRemote

About the role

What You Will Do

  • Build agent runtimes and orchestration systems (planning, tool use, memory, coordination)
  • Make agents reliable (retries, failure handling, state management)
  • Make agents observable (tracing, debugging, evaluation)
  • Make agents cheap (cost-aware execution, performance optimization)
  • Make agents useful in production (not demos - real systems people depend on)
  • Integrate LLMs, APIs, and external data into coherent, working systems
  • Define how developers build, debug, and extend agent behavior

What You’ll Bring

Strong Signals

  • You’ve shipped real systems that people depend on
  • You’ve taken ambiguous problems and turned them into working products
  • You’ve built or deeply explored agent systems, LLM pipelines, or automation beyond simple demos
  • You understand system behavior in production - failure modes, tradeoffs, reliability
  • You move fast and iterate - you don’t wait for perfect specs

Technical Foundation

  • Strong backend / systems experience (distributed systems, APIs, infrastructure)
  • Proficiency in at least one core language (Python, Go, TypeScript, Rust, etc.)
  • Experience with reliability patterns (state, retries, observability)
  • Comfort working across the stack when needed

Nice To Have

  • Experience with agent frameworks (LangChain, AutoGen, CrewAI, or custom systems)
  • Experience building multi-agent or orchestration systems
  • Familiarity with LLM evaluation, prompting, or performance tuning
  • Experience with workflow engines (Temporal, Airflow, Dagster)
  • Experience building developer platforms, SDKs, or internal tools
  • Exposure to retrieval systems, vector databases, or memory architectures
  • Interest in crypto, distributed systems, or verifiable compute

Compensation

Target salary range: $187,000 - $253,000. Competitive salary and non-cash compensation (tokens and equity). World class benefits package (medical/dental/vision), remote work setup stipend, flexible hours, flexible time off, 401(k) + match, monthly wellness benefit, yearly off-sites, paid parental leave.

Skills

PythonGoTypeScriptRustLLMsLangChainAutogenCrewaiTemporalAirflowDagsterDistributed SystemsAPIs

Similar roles

ML Engineering jobs

Software Engineer, Machine Learning Platform

Build and operate Chime's ML platform on AWS, including distributed training systems, feature stores, data pipelines, and CI/CD tooling. Partner with ML teams to improve reliability, observability, and developer experience for production models.

187k – 259kSan Francisco, CAML EngineeringHybrid5+ YOEGoAWS

Software Engineer, Inference

Develops and optimizes inference engines for multimodal AI models, integrating new architectures, building scheduling systems, and managing large-scale GPU deployments. Requires strong Python, model serving frameworks like PyTorch/vLLM, and Kubernetes expertise.

188k – 395kPalo Alto, CAML EngineeringHybridvLLMLinux

Software Engineer, ML Platform

Builds foundational ML platform infrastructure including model serving pipelines, GPU scheduling systems, and CI/CD for large-scale multimodal AI models. Requires 5+ years in distributed systems with expertise in Python, Kubernetes, and AWS.

188k – 395kPalo Alto, CAML EngineeringHybrid5+ YOES3AWS

Research Scientist / Engineer – Training Infrastructure

Builds and optimizes distributed training infrastructure for large-scale multimodal AI models across thousands of GPUs. Requires deep expertise in PyTorch, CUDA, parallelization techniques, and GPU clusters.

188k – 395kPalo Alto, CAML EngineeringHybridMpiCUDA

Machine Learning Engineer

Design, train, and deploy large-scale ML recommendation systems and models that power personalization and discovery on Reddit. Requires a Master's degree and 3+ years building production ML systems.

188k – 260kSan Francisco, CAML EngineeringRemote3+ YOES3AWS