Skip to content

Sr. Site Reliability Engineer

Senior SRE responsible for reliability, scalability, and performance of AWS and Azure cloud systems. Requires 5+ years SRE experience, strong cloud infrastructure skills, and automation expertise.

170k – 196kSunnyvale, CADevOps / SREOnsite5+ YOE

About the role

Responsibilities

  • Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
  • Oncall duty for production uptime and support for customer escalations
  • Release upgrades and maintenance activities including hotfixes and infrastructure updates
  • Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post-incident reviews
  • Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
  • Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies

Requirements

  • Bachelor’s degree in computer science, Engineering, or related field; or equivalent work experience
  • 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
  • Hands-on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
  • Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
  • Strong understanding of CI/CD principles and experience with tools such as Azure DevOps, Jenkins, or GitLab CI/CD
  • Excellent analytical, problem-solving, and communication skills, with the ability to collaborate effectively with cross-functional teams

Nice-to-Haves

  • Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments
  • AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure DevOps Engineer, or Azure Security Engineer

Skills

AWSAzurePythonGoPowerShellCI/CDJenkinsGitlab Ci/CdAzure DevOpsDockerKubernetes

Similar roles

DevOps / SRE jobs

Senior Software Engineer, Developer Productivity Cloud Infrastructure

Senior engineer focused on developer productivity and cloud infrastructure. Designs scalable internal tools, re-architects build systems, and improves CI/CD workflows using Terraform, Go/Python/C++.

170k – 240kSan Mateo, CADevOps / SREHybrid5+ YOEGoC++

Senior Software Engineer - Observability and Reliability

Build observability platforms and tools (metrics, logging, tracing, alerting) using Go, OpenTelemetry, and Kubernetes. Requires 5+ years experience building production software and strong CS fundamentals.

170k – 240kNew York, NYDevOps / SREOn-site5+ YOEGoGCP

Senior Software Engineer - Observability and Reliability

Build observability tools and platforms (metrics, logging, tracing, alerting) using Go, OpenTelemetry, and Kubernetes. Requires 5+ years experience building high-quality software that other engineers use.

170k – 240kSan Francisco, CADevOps / SREOn-site5+ YOEGoGCP

Sr. Site Reliability Engineer

Senior SRE responsible for reliability, scalability, and performance of AWS and Azure cloud infrastructure. Requires 5+ years SRE experience, strong cloud platform skills, and automation expertise.

170k – 196kSunnyvale, CADevOps / SREOn-site5+ YOEGoAWS

Senior Software Engineer, Infrastructure

Senior Infrastructure Engineer responsible for re-architecting Kubernetes infrastructure, improving continuous deployment, and making code changes across the stack to support drone platform needs.

170k – 258kSan Mateo, CADevOps / SREHybrid4+ YOEGoPython