Skip to content

Data Scientist, Codex

Data Scientist measures and accelerates product-market fit for AI developer tools by defining developer productivity metrics, running A/B tests on coding models and UX, and building dashboards. Requires 5+ years quantitative experience, SQL/Python fluency, and experiment design skills.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOE

About the role

Responsibilities

  • Embed with the Codex product team to discover opportunities that improve developer outcomes and growth
  • Design and interpret A/B tests and staged rollouts of new coding models and product features
  • Define and operationalize metrics such as suggestion acceptance, edit distance, compile/test pass rates, task completion, latency, and session productivity
  • Build dashboards and analyses that help the team self-serve answers to product questions (by language, framework, repo size, task type)
  • Diagnose failure modes and partner with Research on targeted improvements (model quality signals, user feedback, evals)

Requirements

  • 5+ years in a quantitative role at a developer-facing or high-growth product
  • Fluency in SQL and Python; comfort with experiment design and causal inference
  • Experience defining product metrics tied to user value
  • Ability to communicate clearly with PM, Eng, and Design—and to influence product direction

Nice-to-Haves

  • Strong programming background; ability to prototype, run simulations, and reason about code quality
  • Familiarity with IDE/extension telemetry or developer tooling analytics
  • Prior experience with NLP/LLMs, code models, or evaluations for generative coding

Skills

SQLPythonA/B TestingCausal InferenceProduct MetricsDashboardsNLPLLMsCode Models

Similar roles

Data Science jobs

Data Scientist, Safety

Data Scientist focused on safety at OpenAI, building analytics to measure harmful behavior, detect fraud, evaluate safety systems, and inform critical decisions on AI deployment. Requires strong SQL/Python, statistical reasoning, and experience with causal analysis.

230k – 325kSan Francisco, CA +1Data ScienceHybridSQLPython

Data Scientist, Financial Engineering

Owns analytics and experimentation for checkout, payments, subscriptions, and pricing to boost revenue, reduce churn, and scale globally. Requires 5+ years in data science or product analytics with SQL/Python fluency and A/B testing expertise in high-growth/fintech settings.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOESQLCuped

Data Scientist, Platform and B2B Products

Drive data-driven decisions for OpenAI's API and B2B platform by defining metrics, running A/B tests, building dashboards, and partnering with PMs and engineers to enhance developer experience and product impact. Requires 5+ years in quantitative roles with strong SQL/Python skills.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOESQLNLP

Data Scientist, Infrastructure

Data Scientist shapes infrastructure scaling for OpenAI's AI models and products by building datasets, metrics, forecasting/optimization models, and dashboards. Partners with engineering, research, and product teams; requires 5+ years experience, SQL/Python expertise.

230k – 385kSan Francisco, CAData ScienceHybrid5+ YOESQLNLP

Data Scientist, Product

Data Scientist embedding with product teams to define metrics, run A/B tests, build dashboards, and drive data-informed decisions for consumer and enterprise AI products. Requires 5+ years quantitative experience with SQL/Python in hyper-growth environments.

230k – 385kSan Francisco, CA +1Data ScienceHybrid5+ YOESQLNLP