Skip to content

Senior Software Engineer, Site Reliability

Senior SRE measures software performance, defines SLOs/SLAs, optimizes infrastructure with Temporal/Kubernetes/AWS, handles on-call, and improves developer experience/scalability for growing B2B workflows. Requires 5+ years SRE/DevOps experience.

180k – 200kNew York, NYSan Francisco, CADevOps / SREHybrid5+ YOE

About the role

Responsibilities

  • Monitor core business-logic software via on-call and non-urgent situations, define SLOs/SLAs.
  • Extend and monitor infrastructure stack for business-critical B2B usage.
  • Maintain mental and reified models of systems for risk estimation, project planning, and debugging.
  • Collaborate with engineering team and leadership on site reliability expertise.
  • Work on core orchestration logic using Temporal to run thousands of workflows.
  • Advise on major backend projects from planning to release.
  • Optimize services for scalability, stability, and observability.
  • Improve developer experience, engineering processes, especially with LLM coding agents.

Requirements

  • 5+ years of SRE, DevOps, or Platform engineering experience.
  • Proven record building efficient, performant, extensible systems.
  • Experience with quantitative metrics, SLOs/SLAs, navigating tradeoffs.
  • Familiarity with containerization/orchestration (Docker, Kubernetes), Linux, backend services.
  • Experience implementing/managing AWS infrastructure.
  • Strong communicator, team player.

Nice to Haves

  • Experience with Temporal.
  • Building AI platforms/tooling.
  • Kubernetes/Helm (Amazon EKS).
  • IaaS tools for production cloud.
  • Managing CI/CD pipelines, developer experience.
  • Cloud storage/data modeling products.
  • Early stage startups.

Compensation

Salary Range: $180,000 - $200,000 (SF/NY base, plus equity, health benefits).

Skills

TemporalKubernetesDockerAWSAmazon EksLinuxCI/CDHelmSLOsSlas

Similar roles

DevOps / SRE jobs

Senior Software Engineer, Infrastructure

Senior Infrastructure Engineer responsible for building and operating platform primitives including Kubernetes, CI/CD, observability, and developer tooling at a high-growth AI and data platform company.

180k – 250kBoston, MADevOps / SREHybrid5+ YOEGoGCP

Senior Infrastructure Engineer

As a Senior Infrastructure Engineer, you will own and evolve the foundational systems powering Casca's AI-driven lending platform. You will build and maintain infrastructure, CI/CD pipelines, cloud infrastructure, deployment automation, and observability systems, ensuring the platform is secure, compliant, and highly available.

180k – 215kSan Francisco, CADevOps / SREOn-site5+ YOEGoAWS

Senior Release Engineer

As a Senior Release Engineer, you will design, maintain, and improve deployment processes, build and scale CI/CD systems, and manage production environments. You will partner with engineering teams to eliminate bottlenecks and enhance platform resilience.

180k – 200kUnited StatesDevOps / SRERemote4+ YOEHelmLinux

Sr. Software Engineer, DevOps

Builds and scales reliable infrastructure for SaaS applications using Kubernetes, Terraform, and GitHub CI/CD. Focuses on observability with Grafana/Prometheus, automation to reduce toil, production troubleshooting, and cross-team collaboration. Requires 5+ years Python experience.

180k – 220kSouth San Francisco, CADevOps / SREHybrid5+ YOEAWSGit

Senior DevOps Engineer (Infrastructure & MLOps)

Designs and manages scalable AWS infrastructure, builds CI/CD pipelines for web and ML applications, implements MLOps practices, and ensures reliability through monitoring. Requires 5+ years DevOps experience, AWS expertise, containerization, and Python scripting.

180k – 225kUnited StatesDevOps / SRERemote5+ YOEAWSECS