Skip to content

Sr. Staff Technical Program Manager - Reliability

Leads reliability strategy and execution for Databricks' multi-cloud infrastructure, partnering with senior engineering leaders to drive roadmaps, programs, and best practices. Requires 10+ years in cloud/SRE with hyperscale experience.

180k – 243kBellevue, WASeattle, WATechnical Program ManagementHybrid10+ YOE

About the role

Responsibilities

Lead Reliability Strategy + Multi-Quarter Roadmaps

  • Partner with senior engineering leadership to define long-term Reliability roadmap and ensure alignment across Platform Engineering, Compute Fleet Management, SRE, Security, and Cloud Partnerships.

Drive Execution of Critical Reliability Programs

  • Own end-to-end program execution: planning, risk management, dependency mapping, trade-off decisions, status reporting, and delivery.
  • Identify process/architecture gaps and drive improvements with Tech Leads.

Partner Deeply with Engineering & Influence Technical Direction

  • Leverage infrastructure/SRE background to guide design and prioritization.
  • Facilitate cross-functional alignment and apply systems thinking to improve scalability, fault tolerance, automation, and tooling.

Elevate Reliability Culture

  • Drive adoption of best practices: error budgets, incident reviews, design-for-resilience, operational readiness.
  • Implement governance, processes, metrics, and documentation.

Required Experience & Qualifications

  • 10+ years managing large-scale technical programs in cloud infrastructure, distributed systems, SRE, or platform engineering.
  • Experience with 2+ hyperscale clouds (AWS, Azure, GCP), multi-AZ/region architecture.
  • Success leading Reliability Programs (availability, failover, incident reduction).
  • Strong understanding of infrastructure/distributed systems/SRE; engineering/SRE experience preferred.
  • Partnering with senior leadership on strategy and multi-team initiatives.
  • Translate ambiguous goals into plans with milestones/KPIs.
  • Manage cross-org dependencies, risks, multi-quarter timelines.
  • Delivering programs across multiple clouds/cloud-native services.
  • Building/scaling engineering processes and frameworks.

Preferred Qualifications

  • Background in distributed systems engineering, SRE, platform infrastructure, or cloud services.
  • Experience with compute fleets, container orchestration, autoscaling, control-plane.
  • Familiarity with SLOs, error budgets, chaos engineering, incident management.
  • Expertise with Jira or equivalent.
  • Bachelor’s in CS/Engineering or related; advanced degree preferred.

Skills

AWSAzureGCPDistributed SystemsSREJiraSLOsError BudgetsChaos EngineeringContainer Orchestration

Staff Technical Program Manager- Unity Catalog

Leads complex, high-visibility cross-functional programs for Databricks' Unity Catalog and core platform experiences, driving end-to-end delivery, stakeholder alignment, and enterprise adoption. Requires 10+ years in enterprise software program management with strong execution and communication skills.

180k – 248kMountain View, CA +1Technical Program ManagementOn-site10+ YOESQLAWS

Senior / Staff Program Engineering Manager

Lead program management and delivery for future autonomous vehicle programs at Zoox. Drive cross-functional planning, scheduling, milestone tracking, and issue resolution for complex hardware-software vehicle development projects.

180k – 250kFoster City, CATechnical Program ManagementOn-site10+ YOEJiraConfluence

Senior/Staff Technical Program Manager - System Safety for Cross Functional Initiatives

Leads cross-functional verification and validation programs for autonomous vehicle safety, coordinating across software, hardware, and operations teams to meet performance and reliability requirements. Requires 7+ years in technical program management, preferably in automotive or robotics.

179k – 246kFoster City, CATechnical Program ManagementOn-site7+ YOEJiraStatistics

Staff Technical Program Manager, Hardware

Co-leads ambitious hardware/software initiatives, driving cross-functional execution, risk management, and strategic alignment for complex programs at scale. Requires 10+ years TPM experience in hardware/software environments and bachelor's in technical field.

176k – 304kSunnyvale, CATechnical Program ManagementOn-site10+ YOERoboticsRisk Management

Senior/Staff Technical Program Manager - Autonomous Test Fleet Data Strategy & Mileage Accumulation

Lead end-to-end strategy and execution of test fleet mileage accumulation (physical and simulation) to enable geofence expansion. Drive cross-functional alignment, resolve bottlenecks, and deliver data collection outcomes for autonomous vehicle validation.

186k – 284kFoster City, CATechnical Program ManagementHybrid7+ YOEData AcquisitionAutonomous Vehicles