Skip to content

Infrastructure Engineer, Sandboxing

300k – 405kSan Francisco, CANew York, NYSeattle, WAHybrid5+ YOE
Summary

Builds and scales secure sandboxed execution environments for AI research, focusing on distributed systems, container orchestration, and cloud infrastructure to ensure safe AI code experimentation.

About the role

Responsibilities

  • Design, build, and operate distributed backend systems that power secure sandboxed execution environments
  • Scale infrastructure to meet growing research and product demands while maintaining reliability and performance
  • Implement and maintain serverless architectures and container orchestration systems
  • Collaborate with research teams to understand requirements and translate them into robust infrastructure solutions
  • Develop monitoring, alerting, and observability systems to ensure operational excellence
  • Participate in on-call rotations and incident response to maintain system reliability
  • Contribute to infrastructure automation and tooling that improves developer productivity
  • Partner with security teams to ensure sandboxing infrastructure maintains appropriate isolation guarantees

You may be a good fit if you

  • Have 5+ years of experience building and operating backend infrastructure at scale
  • Have deep expertise in distributed systems design and implementation
  • Have strong operational experience, including debugging complex production issues
  • Are proficient with cloud platforms, particularly GCP/GCS (experience with AWS or Azure is also valuable)
  • Have experience with containerization technologies (Docker, Kubernetes) and understand their security implications
  • Are comfortable working with infrastructure as code and modern DevOps practices
  • Have strong programming skills in languages such as Python, Go, or Rust
  • Are results-oriented with a bias towards flexibility and impact
  • Care about the societal impacts of your work and are motivated by Anthropic's mission

Strong candidates may also have experience with

  • Serverless architectures and functions-as-a-service platforms (Cloud Functions, Cloud Run, Lambda)
  • Designing and implementing secure multi-tenant systems
  • High-performance computing environments or ML infrastructure
  • Linux systems internals, including namespaces, cgroups, and seccomp
  • Network security and isolation techniques
  • Building systems that support research workflows and rapid iteration

Compensation

Annual Salary: $300,000 — $405,000 USD

Skills
Distributed SystemsKubernetesDockerGoogle CloudPythonGoRustServerless ArchitecturesInfrastructure as CodeLinux
Similar roles at this salary range
All DevOps / SRE jobs →
Sentry

Staff Software Engineer, AI Developer Tooling

Own AI-assisted coding tooling at Sentry. Build harnesses, context systems, and API integrations so AI agents can operate across the full software development lifecycle.

240k – 320kSan Francisco, CADevOps / SREHybridCI/CDPython
Together AI

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Design and operate multi-petabyte distributed storage systems for large-scale AI training and inference, integrating parallel filesystems and building Kubernetes-native storage platforms.

250k – 300kSan Francisco, CADevOps / SREOn-siteGoCeph
Anthropic

Staff Software Engineer, Infrastructure Asset Systems

As a Staff Software Engineer, you will build and extend systems for tracking, governing, and reporting on infrastructure assets. This involves designing data models, workflow engines, and integrations with financial and procurement systems, ensuring compliance and auditability.

320k – 405kSan Francisco, CA +1DevOps / SREHybridGoSQL
Zoox

Staff Site Reliability Engineer

Zoox is seeking a Staff Site Reliability Engineer to lead source control, owning the technical strategy and roadmap for their Git-based monorepo. This role involves migrating from GitHub Enterprise to GitHub Cloud, building developer tooling, and partnering with various teams to enhance source control as a strategic asset.

250k – 300kFoster City, CADevOps / SREHybridBuckCI/CD
Crusoe

Senior Staff Network Engineer, Automation

Senior technical leader owning Crusoe's network automation platform, source of truth, intent-based config systems, and self-healing workflows across hyperscale multi-vendor fabrics. Requires 12+ years of production network automation experience with deep expertise in Python/Go, model-driven telemetry, and observability at 10K+ device scale.

245k – 295kSan Francisco, CADevOps / SREOn-siteGogNMI