Backend Infrastructure Engineer

175k – 280kSan Francisco, CABellevue, WANew York, NYDevOps / SREOnsite5+ YOEDec 30

Summary

Builds developer infrastructure tools including CI/CD pipelines, build systems, and self-serve frameworks to enable fast, reliable engineering workflows. Requires 5+ years experience with Python/TypeScript, automation focus, and dev tooling expertise.

About the role

Responsibilities

Own the developer experience from onboarding through deploy — dev environments, build systems, CI/CD, test infrastructure, and deploy tooling
Build frameworks and tooling that make CI fast, reliable, and effortless. Engineers shouldn't have to think about it — it should just work
Design self-serve tooling that lets product and ML engineers move quickly without filing tickets or waiting on you
Give teams clear, reliable visibility into what's in production, when it shipped, and how it's performing
Explore what it means to bring AI-assisted intelligence into developer tooling — this is a genuinely exciting frontier, and we're approaching it thoughtfully
Keep the complex systems that support rapid development healthy: build systems, dependency management, test infrastructure, and developer environments all need ongoing care
Think creatively about what a performance-sensitive stack — GPUs, co-located servers, real-time requirements — means for the developer environment. There's no established playbook here, and that's part of the appeal

Required Qualifications

You're a thoughtful software engineer who also loves infrastructure. You write clean, well-designed code and care about the experience of the people using what you build
You've worked on CI/CD systems and have real experience solving the hard problems — slow builds, flaky tests, painful merges — with durable solutions
You've built developer tooling that other engineers genuinely wanted to use: self-serve frameworks, CLI tools, build systems, environment setup, or debugging tools
You're comfortable in Python and/or TypeScript — our monorepo leans on both
You have 5+ years in software engineering, with meaningful time spent on developer infrastructure or tooling
Automation is your default. Doing something manually twice feels like a problem worth solving
You think from the perspective of the engineers using your work. Developer experience isn't a buzzword to you — it's the whole job

Preferred Qualifications

Monorepo tooling and build systems (e.g. Bazel, Pants, Moonrepo, Nx, or similar approaches)
Cloud-based development environments (e.g. Coder, Codespaces, devcontainers)
CI platform depth — going beyond workflow files into managing runners, optimising costs, and building custom tooling
Containerised builds and local/CI parity (e.g. Docker, Dagger)
Deploy pipeline patterns: canary deployments, feature flags, automated rollbacks, GitOps
AI coding tools — not just using them, but configuring, customising, or building tooling around them
Security engineering in developer workflows: dependency scanning, secrets management, supply chain controls
GCP experience

Skills

PythonTypeScriptCI/CDBazelPantsDockerGitOpsGCPKubernetesMonorepo

Similar roles at this salary range

All DevOps / SRE jobs →

Plaid

Jun 19

Staff Site Reliability Engineer, Release Engineering

Staff SRE on the Release Engineering team defining and scaling reliability practices, architecting SLO/error-budget programs, and driving progressive delivery and automated safety gates across product engineering.

208k – 274kNew York, NYDevOps / SREHybrid8+ YOEGoSLO

Fivetran

Jun 18

Senior Site Reliability Engineer

Senior SRE responsible for production infrastructure reliability, incident response, deployment automation, and scaling SaaS systems on Kubernetes and major cloud platforms.

175k – 210kOakland, CADevOps / SREHybrid5+ YOEAWSGCP

Dropbox

Jun 18

Senior Infrastructure Software Engineer, Storage Core

Senior engineer building and operating Dropbox's exabyte-scale distributed storage systems. Focus on replication, erasure coding, performance, and reliability in Go/Rust.

180k – 274kUnited StatesDevOps / SRERemote9+ YOEGoC++

Okta

Jun 17

Staff Site Reliability Engineer - Observability

Staff SRE focused on building and scaling a comprehensive observability platform on GCP using Terraform, Splunk, and Grafana. Requires 5+ years GCP observability experience and strong coding skills in Python or Go.

194k – 267kBellevue, WA +4DevOps / SREHybrid5+ YOEGoGKE

Cribl

Jun 17

Sr Software Engineer, Storage

Senior Software Engineer on the Storage team building autoscaling, self-healing infrastructure-as-code systems that manage petabyte-scale telemetry storage on AWS.

175k – 205kUnited StatesDevOps / SRERemote5+ YOEGoS3

Apply