Skip to content

Technical Program Manager, Quality and Reliability

Own product quality and reliability as Technical Program Manager, leading release management, incident response, and cross-team initiatives to improve test coverage, observability, and metrics like MTTR in a fast-scaling SaaS environment. Requires 5+ years TPM experience, QA background, and bachelor's in technical field.

200k – 275kSan Francisco, CATechnical Program ManagementHybrid5+ YOE

About the role

What You'll Do

  • Own release management end-to-end, ensuring on-time, high-quality product releases through coordination across all teams in a fast-paced environment.
  • Introduce and enforce change safety standards, such as risk assessments, rollback procedures, feature flags, and bug bashes to reduce regressions and customer impact.
  • Lead horizontal reliability initiatives focused on improving test coverage, observability, and incident response readiness.
  • Define, measure, and report on reliability metrics (e.g. change failure rate, MTTR, SLI), and drive accountability for sustained improvement.
  • Identify systemic gaps in release processes, testing, monitoring, and incident response; convert findings into structured improvement plans with clear owners and timelines.
  • Drive rapid triage and resolution of customer-reported issues in partnership with Product and User Operations, ensuring timely followup and continuous improvements.
  • Own and improve the incident management lifecycle: Facilitate rigorous post-incident reviews, ensuring root causes are identified and corrective and preventative actions are tracked to completion.
  • Oversee vendor reliability and SLA compliance, including performance monitoring, incident escalation, and periodic business reviews.

What You Have

  • 5+ years of experience in technical program management or release management, ideally within SaaS or fast-moving tech companies.
  • Prior experience working as Software QA or Test Engineer, preferably with SaaS products.
  • Strong understanding of engineering workflows, including CI/CD, release cycles, and infrastructure planning.
  • Experience partnering with engineering and product leadership to achieve cross-team quality and reliability objectives.
  • Excellent communication skills—you can distill complexity into clarity for both technical and non-technical audiences.
  • A track record of building systems and processes that scale with growth.
  • Comfort in ambiguity and eagerness to build structure where there is none.
  • Bachelor’s degree in Computer Science, Engineering, or related technical field.

Bonus Points

  • Familiarity with incident management tooling (PagerDuty, Incident.io), monitoring stacks (Datadog, Prometheus, Grafana), and test automation frameworks (Playwright, Cypress, Selenium).

Compensation

$200,000 - $275,000 USD

Skills

CI/CDPagerdutyIncident.IoDatadogPrometheusGrafanaPlaywrightCypressSeleniumRelease ManagementIncident ManagementTest Automation

Technical Program Manager, Data Center Operations

Own end-to-end data center site handover, SOPs, and incident management programs. Drive cross-functional coordination across design, construction, and ops teams in mission-critical environments.

200k – 270kAustin, TX +2Technical Program ManagementOn-site5+ YOEPMPCapa

Technical Program Manager, Data

Leads data collection and annotation initiatives for audio/ML projects, collaborating with research/product teams to gather requirements, manage resources/vendors, and ensure high-quality delivery. Requires 5+ years with ML teams and data labeling experience.

200k – 260kSan Francisco, CATechnical Program ManagementOn-site5+ YOELLMsData Labeling

Technical Program Manager, Deployments

Leads end-to-end delivery of data center infrastructure and AI cluster deployments, coordinating construction, networking, hardware, and operations teams across sites. Requires 5+ years in data center programs, AI/GPU expertise, and cross-functional influence in ambiguous environments.

200k – 275kNew York, NY +2Technical Program ManagementOn-site5+ YOEIctJira

Operations Program Manager - Robotics Data Acquisition

Own daily operations in robotics data collection facilities, tracking metrics, removing bottlenecks, and rolling out new hardware/processes with engineering and operations teams. Requires 3-5 years in manufacturing/operations and a technical bachelor's degree.

207k – 285kSan Francisco, CATechnical Program ManagementOn-site3+ YOELeanSix Sigma

Technical Program Manager, Safety Systems Engineering

Leads safety systems engineering programs at OpenAI, managing risks, data/compute infrastructure, and stakeholder coordination to ensure safe model deployments. Requires technical expertise, project delivery track record, and knowledge of content moderation in fast-paced environments.

207k – 335kSan Francisco, CATechnical Program ManagementHybridAgi SafetyRisk Management