Skip to content

Product Manager, Public Sector GenAI Test & Evaluation (T&E)

Defines vision and roadmap for GenAI test & evaluation infrastructure in public sector, traversing engineering orgs to resolve bottlenecks and drive execution for government agentic applications. Requires 3+ years technical experience in engineering or program management with AI evaluation expertise.

154k – 257kSan Francisco, CASt. Louis, MONew York, NY+1 moreProduct ManagementOnsite3+ YOE

About the role

Minimum Qualifications

  • Engineering Depth: 3+ years of experience in software engineering, systems architecture, or highly technical program management. Must be able to read code, understand system architecture, and participate in technical design reviews alongside engineering teams.
  • Evaluation Systems Expertise: Proven experience designing, owning the roadmap for, or operating the infrastructure required to continuously measure, improve, and show the performance of AI applications.
  • Problem Distillation: Demonstrated experience taking a vaguely defined problem (e.g., "our evaluation cycles are too slow") and delivering a technical roadmap, resource requirements, and measurable success metrics within a narrow time window.
  • Ambiguity Management: Proven track record of taking a project from "stalled/undefined" to "shipped" in a high-pressure environment. Point to at least two instances where you inherited a failing project and saw it through to production.
  • Cross-Functional Leadership: Led multiple projects that required direct alignment between at least three distinct engineering organizations (e.g., Infrastructure, ML Research, and Product).
  • Operational Execution: Experience using technical project management frameworks (e.g., Linear) to provide consistent weekly reporting on delivery velocity and blockers to executive stakeholders.

Preferred Qualifications

  • Security Clearance: Active Secret, Top Secret, or TS/SCI clearance.
  • GenAI Implementation: Practical experience developing or evaluating features built specifically on LLMs, RAG, or autonomous agent workflows.
  • Technical Rigor: Advanced degree in Computer Science, Engineering, or a related field.
  • Public Sector Expertise: 2+ years of experience working with DoD, IC, or Civil agencies on mission-critical software deployments.

Compensation

Base salary range varies by location:

  • San Francisco, New York: $205,600—$257,000 USD
  • Hawaii, Washington DC, Texas, Colorado: $184,800—$231,000 USD
  • St. Louis: $154,400—$193,000 USD

Includes equity, comprehensive health/dental/vision, retirement, learning stipend, PTO, commuter stipend.

Skills

LLMsRAGAutonomous AgentsSoftware EngineeringSystems ArchitectureAi EvaluationTechnical Project ManagementLinearMl ResearchInfrastructure

Technical Product Manager - AI & Data Analytics

Own AI/ML and analytics product vision, co-develop predictive models and data features with enterprise clients, and drive GTM for customer-facing SaaS data products. Requires 3+ years in PM, analytics, or data engineering plus strong SQL and BI skills.

155k – 160kUnited StatesProduct ManagementRemote3+ YOERSQL

Product Manager II - Application Performance Monitoring

Product Manager II for APM at Datadog, defining and delivering AI-driven observability features including distributed tracing, performance analysis, and intelligent troubleshooting for developer-focused SaaS.

155k – 195kNew York, NYProduct ManagementHybrid3+ YOEData AnalysisDeveloper Tools

Product Manager II - Identity Security

Build identity security features for Datadog's Cloud Security Management, focusing on inventory, risk detection, and permissions analysis for human/non-human identities in multi-cloud environments. Requires 3+ years enterprise software experience, cloud IAM expertise, and strong cross-functional collaboration.

155k – 215kNew York, NYProduct ManagementHybrid3+ YOEScpsClaude

Product Manager II - Model Lab

Lead 0→1 development of Model Lab, Datadog's experiment tracking platform for AI/ML teams. Define vision, conduct discovery with research and engineering teams, and drive execution requiring 4+ years PM experience in developer tools or ML systems.

155k – 190kNew York, NYProduct ManagementHybrid4+ YOEJAXAPIs

Product Manager, Integrations

Own the integrations platform roadmap, connector strategy, and 3rd-party API integrations for an identity governance platform. Drive reliability, self-service experience, and data quality across 250+ connectors.

156k – 210kUnited StatesProduct ManagementRemote5+ YOEETLB2B SaaS