Skip to content

Senior Product Operations Manager, Evaluation

150k – 210kSan Francisco, CAProduct OperationsHybrid4+ YOE
Summary

Build and scale evaluation systems for AI models at Harvey. Own workflows, tooling, and governance to ensure model accuracy and reliability across jurisdictions.

About the role

What You'll Do

  • Build and scale the systems that power model and product evaluations across Harvey
  • Run intake, triage, and prioritization for the evaluation request queue, routing capacity to the highest-value coverage gaps
  • Embed evaluation workflows and readiness checkpoints into the product development lifecycle
  • Create the single source of truth for evaluation status, results, history, and launch readiness
  • Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
  • Manage human data providers and stand up our internal contract-attorney pipeline, ensuring evaluation quality meets legal standards
  • Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
  • Drive evaluation readiness for major product and model launches across geographies and jurisdictions
  • Document and operationalize evaluation governance as complexity increases
  • Help define how Harvey ensures model accuracy, reliability, and trust at global scale

What You Have

  • 4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
  • Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
  • Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data (either natively or with AI tool support)
  • Strong business acumen with an ability to apply an ROI-focused mindset to scaling
  • Ability to work deeply with legal experts and operationalize complex evaluation methodologies
  • Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
  • High attention to detail and a bias toward clarity, rigor, and reproducibility
  • Ability to navigate an evolving landscape and bring order to complex systems
  • Strong communication skills and comfort translating technical nuance for diverse stakeholders
  • Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues
Skills
SQLPythonML/AI evaluationsbenchmarking frameworksstatistical methodologiestechnical program managementproduct operationsresearch operationsevaluation workflowscross-functional coordination
Similar roles at this salary range
All Product Operations jobs →
VGS

Product Operations and Performance Lead

Founding Product Operations Lead defining operational backbone for Product org. Owns planning systems, performance measurement, and execution cadences across Product, Engineering, and GTM teams.

180k – 240kUnited StatesProduct OperationsRemote8+ YOEPM Tool StackRoadmap Planning
Okta

Staff Product Analyst

Staff Product Analyst optimizing end-to-end Revenue Recognition and Order-to-Cash processes using Zuora RevPro and NetSuite. Requires 8+ years experience with hands-on configuration, integration troubleshooting, and financial compliance.

148k – 204kSan Francisco, CAProduct OperationsOn-site8+ YOEBoomiIPaaS
Tennr

AI Operations Lead

Owns internal AI strategy and roadmap, runs discovery across functions, builds and deploys AI tools/workflows, manages tool stack and governance, and drives adoption and enablement.

170k – 220kNew York, NYProduct OperationsOn-site1+ YOESlackNotion
Confido Legal

Product Deployment Specialist

Serves as the bridge between customers, product, and market for an AI-powered deductions platform. Builds internal processes, engages customers, and informs product strategy for CPG finance workflows.

120k – 160kNew York, NYProduct OperationsOn-site3+ YOEMarket ResearchProduct Strategy
Permitflow

Product Operations Manager

Partner with Product, Engineering, and GTM teams to design processes, dashboards, and AI-driven workflows that scale product development and execution at a high-growth AI startup.

120k – 170kNew York, NYProduct OperationsHybrid2+ YOEAI ToolsData Analysis