Skip to content

Staff Software Engineer (SRE)

Staff SRE manages and scales Kubernetes clusters on AWS EKS, automates infrastructure with IaC tools, optimizes performance, maintains blockchain nodes and databases, and improves system reliability using monitoring tools. Requires 5+ years SRE experience with strong Kubernetes and AWS proficiency.

200k – 250kUnited StatesDevOps / SRERemote5+ YOE

About the role

Responsibilities

  • Kubernetes Ownership: Manage and scale Kubernetes clusters on AWS EKS, ensuring reliability, performance, and security.
  • Infrastructure Automation: Implement and maintain Infrastructure-as-Code (Terraform/Pulumi) to automate infrastructure provisioning and management.
  • Performance Optimization: Monitor and optimize system performance, scalability, and resource utilization.
  • Blockchain Infrastructure: Configure and maintain crypto nodes across multiple blockchains to support our wallet’s operations.
  • Database Scaling: Optimize and scale database infrastructure to handle terabytes of blockchain data efficiently.
  • System Reliability: Continuously improve system uptime, monitoring, and observability using tools like Datadog and OpenTelemetry.
  • Collaboration: Work closely with backend and product teams to support feature development and system scaling.

Qualifications

  • 5+ years in a SRE or Software Engineer role.
  • Strong hands-on experience with Kubernetes (EKS) in production environments.
  • Proficiency with AWS infrastructure and services (EC2, S3, RDS, IAM).
  • Solid experience with Docker and Infrastructure-as-Code tools like Terraform or Pulumi.
  • Monitoring and observability experience using tools like Datadog or OpenTelemetry.

Benefits

  • Competitive salary and equity
  • Eligible to participate in the Company's performance bonus program
  • Comprehensive insurance (medical/dental/vision) — 100% covered
  • Stipend for your ideal remote set-up
  • Flexible hours and a supportive remote environment
  • Unlimited vacation
  • 401(k) retirement plan
  • Monthly wellness benefit
  • Weekly meal benefit
  • Global off-sites

Target base salary: $200,000 to $250,000 with equity and benefits.

Skills

KubernetesAws EksTerraformPulumiDockerDatadogOpenTelemetryAws Ec2Aws S3Aws Rds

Similar roles

DevOps / SRE jobs

Staff Software Engineer, Infrastructure

Hands-on Infrastructure Tech Lead building and scaling AWS cloud infrastructure from scratch for an AI-driven enterprise analytics platform. Owns architecture, IaC, security/compliance (SOC 2), and operational excellence.

200k – 300kSan Francisco, CADevOps / SREHybrid7+ YOEAWSGCP

Member of Technical Staff, DevOps

The Member of Technical Staff, DevOps will own progressive delivery, GitOps, and on-demand environment tooling to improve deployment safety and speed for engineering teams. This role requires a platform-as-a-product mindset and experience with infrastructure as code and CI/CD pipelines.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoEKS

Member of Technical Staff, Site Reliability Engineer

Vapi is seeking a Site Reliability Engineer to drive 99.99% call completion for their Voice AI platform. This role involves running incident command, owning SLOs and error budgets, building reliability culture, and shipping code for platform services in Go or TypeScript.

200k – 270kSan Francisco, CADevOps / SREHybrid5+ YOEGoKeda

Staff Platform Engineer, Interoperability

Staff Platform Engineer building developer tooling, CI/CD automation, and scalable web applications using NodeJS, React, and AWS. Requires 10+ years experience and expertise in Temporal, Terraform, PostgreSQL, and Snowflake.

200k – 272kSan Francisco, CADevOps / SREHybrid10+ YOEGitJest

Member of Technical Staff - Engineering

Builds infrastructure and simulation engines for training autonomous AI agents in complex environments using reinforcement learning. Requires strong engineering skills in distributed systems, containerization, networking, and data systems.

200k – 350kSan Francisco, CADevOps / SREOn-siteDockerAI Agents