Skip to content

Site Reliability Engineer

127k – 249kBoston, MAMiami, FLNew JerseyNew York, NYDevOps / SREHybrid6+ YOE
Summary

Senior or Staff Site Reliability Engineer focused on continuous delivery infrastructure using Argo Workflows, ArgoCD, and Kubernetes. Owns deployment tooling, onboarding flows, and participates in 24/7 on-call. Requires 6+ years building and operating distributed systems.

About the role

Responsibilities

  • Contribute to developing a world-class continuous deployment experience, enabling the rapid and reliable shipment of MongoDB products
  • Contribute to open-source projects or engineer software-based approaches like Kubernetes operators to streamline processes
  • Own the onboarding flow other engineering teams follow when launching a new product or service
  • Collaborate with other teams within Platform Engineering to ensure a consistent service-onboarding experience
  • Provide internal support for our deployment systems, including answering questions and addressing issues
  • Participate in a 24/7 on-call rotation to resolve issues involving the deployment infrastructure

Requirements

  • 6+ years of experience in software development and operating distributed systems
  • Proficiency in Python, Go, or a similar language
  • Proven experience building and operating large-scale continuous integration and continuous deployment (CI/CD) pipelines
  • Customer-focused mindset
  • Value efficiency in processes and operations
  • Prefer automation over manual process (“allergic to ops work”)
  • Experience using and extending containerization technologies, particularly Kubernetes, to enhance application agility, optimize resource utilization, and accelerate time-to-market
  • Expertise in cloud infrastructure platforms, including AWS, Google Cloud Platform (GCP), or Azure
  • Understanding of Linux operating system internals and networking concepts (e.g., TCP/IP, DNS, TLS, routing)
Skills
PythonGoKubernetesAWSGoogle Cloud PlatformAzureArgo WorkflowsArgoCDCI/CDLinuxTCP/IPDNSTLS
Similar roles at this salary range
All DevOps / SRE jobs →
Northwood Space

Senior Network Engineer

Design, deploy, and operate enterprise network infrastructure for corporate facilities and hybrid cloud environments with zero-trust architecture and compliance requirements. Requires 5+ years enterprise networking experience and ability to obtain TS/SCI clearance.

133k – 215kLos Angeles, CA +1DevOps / SREOn-site5+ YOEAWSVLAN
Pinterest

Site Reliability Engineer II

Operate and scale a cloud-native CTV advertising platform on AWS and Kubernetes. Focus on reliability, GitOps workflows, infrastructure automation, observability, and incident response.

114k – 235kSan Francisco, CADevOps / SRERemote4+ YOEAWSEKS
Forterra

Senior Software Engineer-Internal Tools

Senior Software Engineer on the DevOps and Tooling team building internal tools. Requires 3-5+ years experience, Rust or strong systems background, TypeScript/React, Linux, Docker, and CI/CD.

125k – 140kArlington, VA +1DevOps / SREOn-site5+ YOEAWSRust
Beacon AI

Software Engineer, Cloud Infrastructure

Build and operate AWS cloud infrastructure and LLM platform services including RAG pipelines, vector search, model endpoints, and data ingestion for an aviation AI company.

135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOEAWSGlue
CommandLink

Senior Network Engineer

Senior Network Engineer building and supporting carrier interconnects, private circuits, NNIs, and cloud connectivity for a managed network services provider. Requires hands-on service provider experience with Layer 2/3 protocols and direct carrier coordination.

120k – 160kUnited StatesDevOps / SRERemote5+ YOEBGPVRF