Skip to content

Sr. Site Reliability Engineer

Senior Site Reliability Engineer ensures reliability, scalability, and performance of AWS and Azure cloud infrastructure. Monitors systems, handles oncall and incidents, automates improvements, and implements security best practices. Requires 5+ years SRE experience and cloud expertise.

170k – 196kSunnyvale, CADevOps / SREOnsite5+ YOE

About the role

Responsibilities

  • Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
  • Oncall duty for production uptime and support for customer escalations
  • Release upgrades and maintenance activities including hotfixes and infrastructure updates
  • Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post-incident reviews
  • Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
  • Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies

Requirements

  • Bachelor’s degree in computer science, Engineering, or related field; or equivalent work experience
  • 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
  • Hands-on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
  • Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
  • Strong understanding of CI/CD principles and experience with tools such as Azure DevOps, Jenkins, or GitLab CI/CD

Nice-to-Haves

  • Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments
  • AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure DevOps Engineer, or Azure Security Engineer

Skills

AWSAzureKubernetesDockerPythonPowerShellGoCI/CDJenkinsAzure DevOps

Similar roles

DevOps / SRE jobs

Senior Software Engineer, Developer Productivity Cloud Infrastructure

Senior engineer focused on developer productivity and cloud infrastructure. Designs scalable internal tools, re-architects build systems, and improves CI/CD workflows using Terraform, Go/Python/C++.

170k – 240kSan Mateo, CADevOps / SREHybrid5+ YOEGoC++

Senior Software Engineer - Observability and Reliability

Build observability platforms and tools (metrics, logging, tracing, alerting) using Go, OpenTelemetry, and Kubernetes. Requires 5+ years experience building production software and strong CS fundamentals.

170k – 240kNew York, NYDevOps / SREOn-site5+ YOEGoGCP

Senior Software Engineer - Observability and Reliability

Build observability tools and platforms (metrics, logging, tracing, alerting) using Go, OpenTelemetry, and Kubernetes. Requires 5+ years experience building high-quality software that other engineers use.

170k – 240kSan Francisco, CADevOps / SREOn-site5+ YOEGoGCP

Sr. Site Reliability Engineer

Senior SRE responsible for reliability, scalability, and performance of AWS and Azure cloud infrastructure. Requires 5+ years SRE experience, strong cloud platform skills, and automation expertise.

170k – 196kSunnyvale, CADevOps / SREOn-site5+ YOEGoAWS

Sr. Site Reliability Engineer

Senior SRE responsible for reliability, scalability, and performance of AWS and Azure cloud systems. Requires 5+ years SRE experience, strong cloud infrastructure skills, and automation expertise.

170k – 196kSunnyvale, CADevOps / SREOn-site5+ YOEGoAWS