Skip to content

Member of Technical Staff - Engineering

Builds infrastructure and simulation engines for training autonomous AI agents in complex environments using reinforcement learning. Requires strong engineering skills in distributed systems, containerization, networking, and data systems.

200k – 350kSan Francisco, CADevOps / SREOnsite

About the role

Responsibilities

  • Develop an advanced environment simulation engine for training & evaluating autonomous AI agents
  • Build scalable infrastructure to run thousands of simulation environments in parallel
  • Optimize the performance of complex, stateful simulation environments
  • Create tooling to improve environment and task creation processes
  • Publish research

Requirements

  • Strong engineering fundamentals and prolific use of AI tools
  • Experience with infrastructure, containerization, and networking
  • Experience with large scale distributed systems and data systems
  • Interest in reinforcement learning environments

Skills

Reinforcement LearningDistributed SystemsContainerizationNetworkingSimulation EnginesAI AgentsData SystemsInfrastructureKubernetesDocker

Similar roles

DevOps / SRE jobs

Staff Software Engineer, Infrastructure

Hands-on Infrastructure Tech Lead building and scaling AWS cloud infrastructure from scratch for an AI-driven enterprise analytics platform. Owns architecture, IaC, security/compliance (SOC 2), and operational excellence.

200k – 300kSan Francisco, CADevOps / SREHybrid7+ YOEAWSGCP

Member of Technical Staff, DevOps

The Member of Technical Staff, DevOps will own progressive delivery, GitOps, and on-demand environment tooling to improve deployment safety and speed for engineering teams. This role requires a platform-as-a-product mindset and experience with infrastructure as code and CI/CD pipelines.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoEKS

Member of Technical Staff, Site Reliability Engineer

Vapi is seeking a Site Reliability Engineer to drive 99.99% call completion for their Voice AI platform. This role involves running incident command, owning SLOs and error budgets, building reliability culture, and shipping code for platform services in Go or TypeScript.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoKeda

Staff Platform Engineer, Interoperability

Staff Platform Engineer building developer tooling, CI/CD automation, and scalable web applications using NodeJS, React, and AWS. Requires 10+ years experience and expertise in Temporal, Terraform, PostgreSQL, and Snowflake.

200k – 272kSan Francisco, CADevOps / SREHybrid10+ YOEGitJest

Member of Technical Staff - System Engineering

Builds and operates core distributed systems, infrastructure, and service architecture for Phylo's agentic AI platform, ensuring reliable scaled execution across cloud and enterprise environments. Requires 3+ years in backend/infrastructure engineering with Kubernetes and cloud expertise.

200k – 300kSouth San Francisco, CADevOps / SREOn-siteGoAWS