# Team Lead, Site Reliability Engineering - Storage Layer Service
**Company:** [MongoDB](https://hotfix.jobs/companies/mongodb)
**Location:** Boston, MA, Charlotte, NC, New York, NY, Philadelphia, PA, Pittsburgh, PA, Washington, DC
**Salary:** $151K-$297K
**Experience:** 10+ years
**Skills:** Kubernetes, Terraform, Crossplane, AWS, GCP, Azure, Distributed Systems, Containerization, Iac, Storage Systems, MongoDB, Operators
**Posted:** 2026-03-25
> Leads a team of SREs for MongoDB's Storage Layer Services, defining SLOs, capacity plans, and roadmaps for multi-tenant distributed storage systems underpinning Atlas. Requires 10+ years in distributed systems and 2+ years managing teams, with expertise in Kubernetes and IaC tools.
## Job Description
## Responsibilities

- Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers
- Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs
- Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges
- Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations

## Requirements

- 10+ years of experience working on software and operating distributed systems, with 2+ years managing engineering teams
- Customer-focused mindset, treating internal developers as your primary users
- Value efficiency in processes and operations, and have a track record of optimizing team workflows
- Prefer automation over manual processes, fostering a culture of building software solutions to eliminate toil
- Deep technical familiarity with **Kubernetes** ecosystems, containerization technologies, and modern IaC tooling (e.g., **Terraform**, **Crossplane**, or Operators)
- Operated or supported stateful storage or database systems at scale and comfortable with durability, consistency and recovery trade-offs
- Excel at translating complex business and engineering requirements into actionable, phased technical roadmaps
- High level of empathy, responsibility, ownership, and accountability
- Excellent verbal and written technical communication skills

## Nice-to-Haves

- Leading major architectural shifts, such as moving from legacy storage stacks to new multi-tenant storage architectures, including planning and executing large-scale data and workload migrations with tight availability and durability requirements
- Managing and scaling infrastructure across multi-cloud environments (**AWS**, **GCP**, or **Azure**)
- Designing secure, multi-tenant runtime environments at scale
**Apply:** https://hotfix.jobs/jobs/team-lead-site-reliability-engineering-storage-layer-service-at-mongodb-6461e530-5c59-4056-9aae-42cfd3e730c1
**Canonical:** https://hotfix.jobs/jobs/team-lead-site-reliability-engineering-storage-layer-service-at-mongodb-6461e530-5c59-4056-9aae-42cfd3e730c1