Skip to content

Data Scientist, Safety Systems

Lead data-driven safety evaluation for AI production systems by defining metrics, implementing statistical methods, building dashboards, and analyzing real-world impacts. Requires 5+ years in quantitative roles with leadership and strong stats expertise.

255k – 405kSan Francisco, CAData ScienceOnsite5+ YOE

About the role

In this role, you will:

  • Lead efforts in understanding and measuring real-world safety impacts of OpenAI’s products
  • Uncover new ways to improve measuring and mitigating harm and abuse
  • Develop and implement statistical methods to operationalize safety-related metrics
  • Provide direction, guidance, and coordination of projects
  • Establish a data-driven culture by defining, tracking, and operationalizing metrics
  • Create and disseminate dashboards, reports, and tools for safety questions
  • Develop safety data flywheel and provide production insights for research

You might thrive in this role if you have:

  • 5+ years experience in a quantitative role in ambiguous environments, ideally as a founding data scientist or team lead at a hyper-growth company
  • Proven leadership skills, including leading data scientists and cross-functional teams
  • Expertise in defining and implementing metrics, operationalizing new ones from scratch
  • Excellent communication skills with product managers, engineers, and executives
  • Strategic insights beyond traditional statistical testing

You could be an especially great fit if you have:

  • Experience in trust and safety, integrity, anti-abuse, or related fields
  • Prior experience in NLP, large language models, or generative AI
  • Strong statistical background, including sampling, regression, causal analysis

Skills

StatisticsMetricsDashboardsSQLPythonNLPLLMsGenerative AICausal AnalysisRegressionSamplingData AnalysisMachine Learning

Similar roles

Data Science jobs

Data Science Manager, Integrity

Leads a data science team focused on trust & safety, fraud prevention, and risk analysis for AI integrity. Drives analytical strategy, scales team operations, and partners cross-functionally to mitigate evolving threats using advanced DS techniques.

255k – 490kSan Francisco, CAData ScienceOn-siteLLMsmetrics

Economist

Economist (up to 5 years post-PhD) conducting empirical research on AI’s economic impacts using large datasets, causal inference, and structural modeling. Requires PhD and strong econometrics/SQL/Python skills.

266k – 385kSan Francisco, CAData ScienceHybrid3+ YOERSQL

Data Scientist, Safety

Data Scientist focused on safety at OpenAI, building analytics to measure harmful behavior, detect fraud, evaluate safety systems, and inform critical decisions on AI deployment. Requires strong SQL/Python, statistical reasoning, and experience with causal analysis.

230k – 325kSan Francisco, CA +1Data ScienceHybridSQLPython

Data Scientist, Financial Engineering

Owns analytics and experimentation for checkout, payments, subscriptions, and pricing to boost revenue, reduce churn, and scale globally. Requires 5+ years in data science or product analytics with SQL/Python fluency and A/B testing expertise in high-growth/fintech settings.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOESQLCUPED

Data Scientist, Codex

Data Scientist measures and accelerates product-market fit for AI developer tools by defining developer productivity metrics, running A/B tests on coding models and UX, and building dashboards. Requires 5+ years quantitative experience, SQL/Python fluency, and experiment design skills.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOESQLNLP