Skip to content

Staff Software Engineer, Developer Productivity

405k – 485kSan Francisco, CANew York, NYDevOps / SREHybrid7+ YOE
Summary

Staff-level engineer to own end-to-end development environments at Anthropic, focusing on container lifecycle, cold-start optimization, environment isolation, and pre-push validation for AI researchers and engineers.

About the role

Key responsibilities

  • Own the local and hosted development environment end to end — container lifecycle, dependency provisioning, hot reload, and the single command an engineer runs to start working
  • Drive down cold-start time for fresh development environments and keep it low as the codebase grows
  • Design and implement the environment isolation model (sandboxes, ephemeral environments, namespace separation) that lets engineers experiment freely without risk to shared systems
  • Build and maintain the pre-push validation surface so failures are caught on the engineer's machine, not in CI
  • Partner with platform, delivery infrastructure, and tooling teams to shape the repository and service topology that best supports a fast inner loop
  • Act as a technical lead across team boundaries — gathering requirements, building consensus, and advocating for the approach that's right for engineers across Anthropic

Minimum qualifications

  • Significant professional software engineering experience in backend or developer infrastructure domains
  • Proficiency in Python
  • Hands-on experience with containers (Docker or equivalent), Kubernetes, and pod-level operations
  • Prior ownership of a developer environment, build system, or paved-path workflow used by a multi-team engineering organization, with demonstrable adoption
  • Experience working across team boundaries to deliver infrastructure that other engineers depend on
  • Daily, hands-on use of AI coding assistants as part of your own development workflow

Preferred qualifications

  • 7+ years of backend or developer infrastructure engineering experience
  • Experience with Rust or Go
  • A track record of reducing cold-start or boot time on a complex multi-service stack to under a minute, with before/after measurements
  • Prior design of environment isolation models such as ephemeral environments, sandboxes, or isolated namespaces
  • Experience leading (or making the case against) a monorepo extraction, repo split, or comparable scope-boundary migration from the developer-tooling side
  • Familiarity with Bazel, Buck, Nix, or similar hermetic build systems
  • Experience operating as a platform tech lead — broad context across the stack and a history of cross-team influence

Representative projects

  • Rebuilding the dev container image pipeline so a new engineer goes from git clone to a running environment in under 60 seconds
  • Designing an ephemeral environment system that gives every branch its own isolated copy of downstream services
  • Shipping a pre-push hook framework that runs the relevant subset of tests and lints locally, cutting CI failure rate for first-attempt pushes
  • Instrumenting the inner loop to produce a live dashboard of p50/p95 edit-build-test latency across the engineering org
  • Authoring the design doc and migration plan for how development environments should evolve alongside a major repository restructure
Skills
PythonDockerKubernetesRustGoBazelBuckNixContainersBuild Systems
Similar roles at this salary range
All DevOps / SRE jobs →
Thinking Machines Lab

Reliability Engineer, Supercomputing

Ensure reliability of large GPU supercomputing clusters by diagnosing hardware/firmware/OS issues, automating monitoring, driving firmware rollouts, and working directly with vendors.

350k – 475kSan Francisco, CADevOps / SREOn-siteBMCRust
Thinking Machines Lab

Network Engineer, Supercomputing

Own and debug multi-thousand-GPU network fabric (RDMA/RoCE, NVLink/NVSwitch) for large-scale AI training and inference. Requires backend language proficiency, large-scale cluster experience, and cross-stack ownership.

350k – 475kSan Francisco, CADevOps / SREOn-siteRustRDMA
Anthropic

Staff Software Engineer, Developer Productivity

Staff-level IC role owning end-to-end CI/CD, merge queue, and deploy pipelines for Anthropic's engineering org. Focus on AI-assisted review, test reliability, and progressive delivery at monorepo scale.

405k – 485kSan Francisco, CA +1DevOps / SREHybrid7+ YOEGoRust
Anthropic

Staff Software Engineer, Node Infra

Own technical strategy and roadmap for node lifecycle management, health automation, and scaling AI clusters across clouds and accelerators. Requires deep distributed systems expertise, ML accelerator experience, and 12+ years leading complex multi-team infrastructure initiatives.

405k – 485kSan Francisco, CA +2DevOps / SREHybrid12+ YOEGoAWS
Anthropic

Staff Software Engineer, Kubernetes Platform

Senior-level engineer to own and scale Anthropic's massive Kubernetes control plane and scheduler for training frontier AI models across hundreds of thousands of nodes. Requires deep Kubernetes internals experience and 12+ years building production distributed systems.

405k – 485kSan Francisco, CA +2DevOps / SREHybrid12+ YOEGoC++