Skip to content

Member of Technical Staff - System Engineering

Builds and operates core distributed systems, infrastructure, and service architecture for Phylo's agentic AI platform, ensuring reliable scaled execution across cloud and enterprise environments. Requires 3+ years in backend/infrastructure engineering with Kubernetes and cloud expertise.

200k – 300kSouth San Francisco, CADevOps / SREOnsite

About the role

Responsibilities

  • Design and build production systems that orchestrate agent execution and power AI-driven scientific workloads.
  • Build and operate scalable, reliable infrastructure across cloud, hybrid, and on-prem enterprise environments.
  • Develop systems for sandboxed execution, secure task isolation, and controlled compute environments.
  • Design and implement security, access control, and compliance foundations suitable for enterprise deployments.
  • Partner closely with ML and science teams to translate computational workflows into robust, production-grade distributed systems.

Requirements

  • 3+ years of industry experience in backend, infrastructure, or distributed systems engineering.
  • Strong proficiency in at least one programming language (Python, Go, Rust, or similar).
  • Experience designing and operating distributed systems in production.
  • Deep hands-on experience with containerization and Kubernetes.
  • Experience with infrastructure-as-code tooling (Terraform, Pulumi, or equivalent).
  • Experience operating systems on at least one major cloud provider (AWS, GCP, or Azure).
  • Comfort owning systems end-to-end in fast-moving, high-autonomy environments.

Nice to Haves

  • Experience building enterprise SaaS platform that supports single-tenant, customer hosted deployment patterns.
  • Experience in building R&D infrastructure at Pharma/Biotech.
  • Experience with job orchestration, task scheduling, or workflow engines.
  • Experience with sandboxed or isolated execution frameworks (gVisor, Kata Containers, Firecracker).
  • Familiarity with distributed storage, observability systems, or high-performance compute environments.

Compensation & Benefits

  • Competitive salary and equity share.
  • Full medical, dental, and vision coverage, including free therapy sessions and eyewear stipend.
  • 401(k).
  • Unlimited PTO (US only).
  • Lunch and snacks when in office.
  • Regular team offsites and company events.

Skills

KubernetesTerraformPythonGoRustAWSGCPAzurePulumiDistributed Systems

Similar roles

DevOps / SRE jobs

Staff Software Engineer, Infrastructure

Hands-on Infrastructure Tech Lead building and scaling AWS cloud infrastructure from scratch for an AI-driven enterprise analytics platform. Owns architecture, IaC, security/compliance (SOC 2), and operational excellence.

200k – 300kSan Francisco, CADevOps / SREHybrid7+ YOEAWSGCP

Member of Technical Staff, DevOps

The Member of Technical Staff, DevOps will own progressive delivery, GitOps, and on-demand environment tooling to improve deployment safety and speed for engineering teams. This role requires a platform-as-a-product mindset and experience with infrastructure as code and CI/CD pipelines.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoEKS

Member of Technical Staff, Site Reliability Engineer

Vapi is seeking a Site Reliability Engineer to drive 99.99% call completion for their Voice AI platform. This role involves running incident command, owning SLOs and error budgets, building reliability culture, and shipping code for platform services in Go or TypeScript.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoKeda

Staff Platform Engineer, Interoperability

Staff Platform Engineer building developer tooling, CI/CD automation, and scalable web applications using NodeJS, React, and AWS. Requires 10+ years experience and expertise in Temporal, Terraform, PostgreSQL, and Snowflake.

200k – 272kSan Francisco, CADevOps / SREHybrid10+ YOEGitJest

Member of Technical Staff - Engineering

Builds infrastructure and simulation engines for training autonomous AI agents in complex environments using reinforcement learning. Requires strong engineering skills in distributed systems, containerization, networking, and data systems.

200k – 350kSan Francisco, CADevOps / SREOn-siteDockerAI Agents