Skip to content

Technical Program Manager, Data Center Operations

Own end-to-end data center site handover, SOPs, and incident management programs. Drive cross-functional coordination across design, construction, and ops teams in mission-critical environments.

200k – 270kAustin, TXNew York, NYSan Francisco, CATechnical Program ManagementOnsite5+ YOE

About the role

Role Scope

  • Own the end-to-end site handover framework: define the gates, acceptance criteria, and sign-off procedures that move a new facility from construction to live operations without dropped terms or late surprises.
  • Embed into design, construction, and due diligence teams early enough to shape maintainability requirements before they become field problems.
  • Drive the cross-functional handover rhythm across training, documentation, systems access, and knowledge transfer, surfacing blockers weeks before they hit the go-live schedule.
  • Build and maintain the SOPs that govern critical datacenter operations across the fleet, with metrics that track adoption, execution quality, and efficiency at each site.
  • Lead incident management and stability improvement programs, including post-incident reviews with root cause analysis, corrective action tracking, and preventive maintenance oversight that reduces unplanned outages across the global footprint.
  • Produce the dashboards and reporting that give leadership visibility into stability metrics and incident trends, and run the CAPA programs that turn that data into durable fixes.

What We're Looking For

  • Run program management in mission-critical environments where a delayed handover or missed SOP had real operational consequences, not just schedule slippage.
  • Designed operational frameworks from scratch: handover gates, SOP libraries, incident management programs built without a legacy system to copy from.
  • Quarterback across design, construction, supply chain, and site ops teams simultaneously, and other teams call you when a cross-functional workstream is stuck.
  • Write clearly enough to distill a complex operational issue into a decision and a next action for a site lead, an executive, or a counterparty who was not in the room.
  • Track incident trends and CAPA status in live dashboards and follow corrective actions through to closure, not just to initial assignment.
  • Personally built or maintained SOPs and measured whether they were actually followed, not just whether they existed.

Bonus

  • ITIL, PMP, or PgMP certification.
  • Hyperscale or large colo operator experience.
  • Familiarity with ASHRAE, Uptime Institute, or TIA-942 standards.
  • Exposure to datacenter construction and commissioning processes.

Salary & Benefits

  • Competitive total compensation package (salary + equity)
  • Retirement or pension plan, in line with local norms
  • Health, dental, and vision insurance
  • Generous PTO policy, in line with local norms
  • The base salary range for this position is $200,000 - $270,000 per year, depending on experience, skills, qualifications, and location. Total compensation may also include equity in the form of stock options.

Skills

Program ManagementSop DevelopmentIncident ManagementRoot Cause AnalysisCapaCross-Functional CoordinationDashboard ReportingItilPMPPgmpAshraeTia-942

Technical Program Manager, Data

Leads data collection and annotation initiatives for audio/ML projects, collaborating with research/product teams to gather requirements, manage resources/vendors, and ensure high-quality delivery. Requires 5+ years with ML teams and data labeling experience.

200k – 260kSan Francisco, CATechnical Program ManagementOn-site5+ YOELLMsData Labeling

Technical Program Manager, Quality and Reliability

Own product quality and reliability as Technical Program Manager, leading release management, incident response, and cross-team initiatives to improve test coverage, observability, and metrics like MTTR in a fast-scaling SaaS environment. Requires 5+ years TPM experience, QA background, and bachelor's in technical field.

200k – 275kSan Francisco, CATechnical Program ManagementHybrid5+ YOECI/CDDatadog

Technical Program Manager, Deployments

Leads end-to-end delivery of data center infrastructure and AI cluster deployments, coordinating construction, networking, hardware, and operations teams across sites. Requires 5+ years in data center programs, AI/GPU expertise, and cross-functional influence in ambiguous environments.

200k – 275kNew York, NY +2Technical Program ManagementOn-site5+ YOEIctJira

Operations Program Manager - Robotics Data Acquisition

Own daily operations in robotics data collection facilities, tracking metrics, removing bottlenecks, and rolling out new hardware/processes with engineering and operations teams. Requires 3-5 years in manufacturing/operations and a technical bachelor's degree.

207k – 285kSan Francisco, CATechnical Program ManagementOn-site3+ YOELeanSix Sigma

Technical Program Manager, Safety Systems Engineering

Leads safety systems engineering programs at OpenAI, managing risks, data/compute infrastructure, and stakeholder coordination to ensure safe model deployments. Requires technical expertise, project delivery track record, and knowledge of content moderation in fast-paced environments.

207k – 335kSan Francisco, CATechnical Program ManagementHybridAgi SafetyRisk Management