Skip to content

Senior DevOps Engineer

Senior DevOps Engineer building and operating Kubernetes-based ephemeral environments and cloud infrastructure on AWS to improve developer productivity and platform reliability.

153k – 231kUnited StatesDevOps / SRERemote4+ YOE

About the role

How you’ll make an impact

  • Design, build, and operate Kubernetes-based ephemeral environments that enable engineers to develop, test, and validate software efficiently.
  • Improve the reliability, scalability, performance, and usability of the Ephemeral Infrastructure platform through automation and platform enhancements.
  • Partner with product engineering, platform, security, and reliability teams to integrate infrastructure capabilities and improve developer workflows.
  • Build infrastructure automation, tooling, and self-service capabilities that reduce operational toil and accelerate software delivery.
  • Enhance observability, incident response, and operational practices to improve platform health and engineer productivity.
  • Contribute to the long-term architecture and technical direction of Upstart’s developer platform and cloud infrastructure ecosystem.

Minimum Qualifications

  • Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related field (or equivalent practical experience) and 4+ years of software engineering experience.
  • 4+ years of experience designing, deploying, and operating production Kubernetes environments.
  • Experience building and operating cloud infrastructure on AWS, including services such as EKS, EC2, IAM, and networking components.
  • Experience with Kubernetes operators, controllers, and cloud-native platform architecture.
  • Experience developing software and automation using Go or a comparable programming language.
  • Experience implementing infrastructure-as-code, CI/CD pipelines, and automated operational workflows in production environments.

Preferred Qualifications

  • Certified Kubernetes Administrator/Architect (CKA/CKAD) or equivalent certification.
  • Experience with Terraform, Helm, GitOps practices, and tools such as ArgoCD.
  • Experience operating distributed systems with a focus on observability, reliability, and incident response.
  • Ability to influence platform adoption and collaborate effectively across multiple engineering teams.
  • Experience building internal developer platforms, ephemeral environment solutions, or developer productivity tooling.

Skills

KubernetesAWSEKSEC2IAMGoInfrastructure As CodeCI/CDTerraformHelmArgo CDGitOps

Similar roles

DevOps / SRE jobs

Senior Data Engineer, Sentinel (Pacific Time Zone)

Senior Infrastructure Engineer building and operating AWS cloud infrastructure for healthcare data platform. Requires Python, Terraform, CI/CD expertise, and big data tools experience.

153k – 210kUnited StatesDevOps / SRERemote5+ YOEAWSVpc

Senior Manager, DevOps

Lead DevOps strategy and team to improve engineering velocity, platform reliability, and operational efficiency across multi-cloud (AWS/GCP) environments. Drive IaC, Kubernetes delivery, observability, AI-powered tooling adoption, and cross-functional collaboration.

155k – 185kUnited StatesDevOps / SRERemote6+ YOEGoAWS

Senior Asset Pipeline Engineer

Design and own the OpenUSD-based asset pipeline for a high-fidelity sensor simulation platform. Build automated DCC-to-engine pipelines, custom schemas, material conversion, and validation systems at library scale.

151k – 230kSunnyvale, CADevOps / SREOn-site5+ YOEMdlCI/CD

Senior Platform Engineer, Interoperability

Lead development of scalable platform systems and infrastructure tools that enable internal and external developers to build faster, more reliable applications. Requires 5+ years of software engineering experience with 3+ years in Node.js.

151k – 205kSan Francisco, CADevOps / SREHybrid5+ YOEEs6Git

Team Lead, Site Reliability Engineering - Storage Layer Service

Leads a team of SREs for MongoDB's Storage Layer Services, defining SLOs, capacity plans, and roadmaps for multi-tenant distributed storage systems underpinning Atlas. Requires 10+ years in distributed systems and 2+ years managing teams, with expertise in Kubernetes and IaC tools.

151k – 297kBoston, MA +5DevOps / SREHybrid10+ YOEAWSGCP