Skip to content

Sr. Staff Technical Program Manager - Reliability

Leads reliability strategy and execution for Databricks' multi-cloud infrastructure, partnering with senior engineering leaders to drive roadmaps, programs, and best practices. Requires 10+ years in cloud/SRE with hyperscale experience.

190k – 256kMountain View, CASan Francisco, CATechnical Program ManagementOnsite10+ YOE

About the role

Responsibilities

Lead Reliability Strategy + Multi-Quarter Roadmaps

  • Partner with senior engineering leadership to define long-term Reliability roadmap and ensure alignment across Platform Engineering, Compute Fleet Management, SRE, Security, and Cloud Partnerships.

Drive Execution of Critical Reliability Programs

  • Own end-to-end program execution: planning, risk management, dependency mapping, trade-off decisions, status reporting, and delivery.
  • Identify process/architecture gaps and drive improvements with TLs.

Partner Deeply with Engineering & Influence Technical Direction

  • Use infrastructure/SRE background to guide design and prioritization.
  • Facilitate cross-functional alignment.
  • Apply systems thinking to improve scalability, fault tolerance, automation, and tooling.

Elevate Reliability Culture

  • Drive adoption of best practices: error budgets, incident reviews, design-for-resilience, operational readiness.
  • Implement governance, processes, metrics, and documentation.
  • Evangelize reliability expectations and engineer-empowering processes.

Required Experience & Qualifications

  • 10+ years managing large-scale technical programs in cloud infrastructure, distributed systems, SRE, or platform engineering.
  • Experience with 2+ hyperscale clouds (AWS, Azure, GCP), multi-AZ/region architecture, control/data plane patterns.
  • Success leading Reliability Programs: availability, failover, operational excellence, incident reduction.
  • Strong understanding of infrastructure/distributed systems/SRE; engineering/SRE experience preferred.
  • Partnering with senior leadership on strategy and multi-team initiatives.
  • Translate ambiguous goals into actionable plans with milestones, KPIs, metrics.
  • Manage cross-org dependencies, risks, multi-quarter timelines.
  • Deliver programs across multiple clouds/large-scale services.
  • Build/scale engineering processes and frameworks.

Preferred Qualifications

  • Background in distributed systems engineering, SRE, platform infrastructure, cloud services.
  • Experience with compute fleets, container orchestration, autoscaling, control-plane.
  • Familiarity with SLOs, error budgets, chaos engineering, failure analysis, incident management.
  • Expertise with Jira or equivalent.
  • Bachelor's in CS/Engineering or related; advanced degree preferred.

Skills

AWSAzureGCPDistributed SystemsSRECloud InfrastructureJiraSLOsError BudgetsChaos Engineering

Program Manager

Oversee implementation programs and ensure world-class experience for largest strategic accounts in a healthcare SaaS platform. Requires 10+ years customer-facing experience and 5+ years leading complex technical implementations.

190k – 240kBoston, MATechnical Program ManagementHybrid10+ YOECrm IntegrationErp Integration

Lead Program Manager

Leads cross-functional technical programs in a health tech company, driving planning, execution, risk mitigation, and continuous improvement across teams. Requires 10+ years of program management experience with deep PDLC/SDLC knowledge.

190k – 230kUnited StatesTechnical Program ManagementRemote10+ YOEPMPPdlc

Staff Technical Program Manager

Lead high-impact technical programs for the Insurance platform as the sole TPM, driving cross-functional execution, data rearchitecture, and operational improvements that deliver business-critical outcomes.

190k – 238kUnited StatesTechnical Program ManagementRemote7+ YOERisk ManagementProject Planning

Staff Product Operations Manager

Leads end-to-end delivery of high-impact R&D programs and projects, driving cross-functional alignment, defining OKRs, managing dependencies and risks, and establishing operational best practices. Requires 10+ years experience in product operations or program management, strong stakeholder influence, and collaboration tools expertise.

192k – 292kSan Francisco, CATechnical Program ManagementHybrid10+ YOEJiraOkrs

Senior Staff Program Manager

Leads engineering operations for global software teams, streamlining processes, managing OKRs/KPIs, and driving enterprise-wide initiatives to boost efficiency and execution. Requires 8+ years in tech operations with automation, data analysis, and cross-functional leadership skills.

192k – 240kOakland, CATechnical Program ManagementHybrid8+ YOEAIAWS