# Sr. Technical Program Manager (TPM)
**Company:** [Together AI](https://hotfix.jobs/companies/together-ai)
**Location:** San Francisco, CA
**Salary:** $225K-$265K
**Experience:** 5+ years
**Skills:** AWS, GCP, Azure, Kubernetes, Docker, Observability, Distributed Systems, AI/ML, DevOps, SRE
**Posted:** 2026-04-07
> Leads development and scaling of global GPU infrastructure for AI, owning product roadmaps for observability, storage, networking, and security. Requires 5+ years in AI/ML infrastructure, cloud platforms, and cross-functional leadership in a fast-paced startup.
## Job Description
## Responsibilities

- **Product Development**: Design and build products for AI researchers, developers, and enterprise customers, translating technical requirements into product features and collaborating with research, engineering, and design teams. Develop and execute strategic plans for the **Observability, Storage, Network Engineering, and Security** infrastructure teams.
- **End-to-End Product Ownership**: Own a comprehensive product roadmap, detailing key features, enhancements, and releases. Drive end-to-end product development, manage development and testing, and lead launches.
- **Stakeholder Engagement**: Engage with stakeholders to understand their needs, pain points, and feedback. Drive initiatives to enhance customer satisfaction and loyalty through product improvements and innovative solutions.
- **Cross-Functional Execution**: Lead and align diverse cross-functional teams—including **Research, Engineering, DevOps, SRE, and Go-to-Market**—to ensure seamless project delivery and organizational success.

## Requirements

- **ML Product or Infrastructure Experience**: 5+ years of experience building and scaling **AI/ML-powered products and infrastructure**, specifically collaborating with research and engineering teams.
- Proven experience with large-scale technology deployments, including **cloud computing platforms**, **decentralized cloud infrastructure**, and **distributed systems** (e.g., **containerization and orchestration tools**).
- Familiarity with the technical domains of **Observability, Storage, Network Engineering, and Security** for infrastructure.
- Experience with **cloud-based technologies** (e.g., **AWS, Google Cloud, or Azure**).
- **Technical Foundation**: **Bachelor's or Master's degree** in **Machine Learning, Computer Science, Engineering**, or a related field.
- Exceptional analytical and problem-solving skills, with a demonstrated ability to identify and proactively mitigate technical risks.
- Experience using **AI tools**, such as **ClaudeCode** or similar, to accelerate analytical progress.
- **Executive and Organizational Acumen**:
  - Proven ability to thrive in a fast-paced, ambiguous startup environment, prioritizing complex tasks and managing multiple simultaneous projects.
  - Strong organizational abilities to build cross-functional alignment and establish clear, focused priorities.
  - A proactive and collaborative team-oriented approach, demonstrating a willingness to drive necessary outcomes across the company.
  - Excellent communication and program management skills for effective collaboration with both internal stakeholders and external vendors.

## Compensation

US base salary range: **$225k to $265k + equity + benefits**. Salary determined by location, level, role, experience, skills, and job-related knowledge.
**Apply:** https://hotfix.jobs/jobs/sr-technical-program-manager-tpm-at-together-ai-124cf8de-1ade-41f4-ba5f-9aed580aa0d2
**Canonical:** https://hotfix.jobs/jobs/sr-technical-program-manager-tpm-at-together-ai-124cf8de-1ade-41f4-ba5f-9aed580aa0d2