Skip to content

Data Scientist, Preparedness

Data Scientist evaluates and improves AI mitigation systems, focusing on classifiers and detection pipelines for risks like biosecurity and cybersecurity. Builds monitoring frameworks and drives cross-functional impact using Python, SQL, and analytical expertise.

347k – 400kSan Francisco, CACaliforniaData ScienceOnsite

About the role

What You’ll Do

  • Evaluate and improve mitigation systems, including classifiers and detection pipelines across domains (e.g., biosecurity, cybersecurity, emerging risk areas).
  • Diagnose false positives and false negatives with deep error analysis, root cause investigation, and clear recommendations.
  • Build monitoring and measurement frameworks to track mitigation effectiveness over time and across user segments/use cases.
  • Identify trends in over-blocking vs. under-blocking, quantify customer impact, and propose prioritized interventions.
  • Develop insights from customer feedback, complaints, and usage patterns to detect shifts in adversarial behavior and system failure modes.
  • Expand risk monitoring into new areas, including cybersecurity threats and model loss-of-control/sabotage scenarios, partnering with domain experts.
  • Communicate results to technical/executive stakeholders with crisp narratives, decision-ready metrics, and clear tradeoffs.

Qualifications

  • Significant experience in data science or applied analytics in high-stakes domains (e.g., security, trust & safety, abuse prevention, fraud, platform integrity, reliability).
  • Strong foundations in experimentation, causal thinking, observational inference; design robust measurement under imperfect data.
  • Fluency in SQL and Python (or equivalent) for analysis, modeling, monitoring workflows.
  • Experience building metrics, dashboards, operational monitoring that changes outcomes.
  • Track record driving cross-functional impact with engineering, product, research partners.

Nice-to-haves:

  • Cybersecurity data science experience (threat modeling, adversarial dynamics, abuse patterns, security telemetry).
  • Classifier evaluation, calibration, thresholding, error analysis at scale; familiarity with detection systems in adversarial settings.
  • Trust & Safety experience.
  • Interest in AI safety, alignment, catastrophic risk prevention.

Skills

PythonSQLData ScienceMachine LearningClassifiersExperimentationCausal InferenceDashboardsMonitoringCybersecurity

Similar roles

Data Science jobs

Research Economist, Economic Research

Measures AI's economic impacts through the Anthropic Economic Index using econometrics, ML, and novel data. Conducts empirical research on labor markets, productivity, inequality; requires PhD in Economics and strong empirical track record.

320k – 405kSan Francisco, CAData ScienceHybridRSQL

Data Scientist, Core Experimentation

Leads evolution of OpenAI's core experimentation platform, driving statistical strategy, designing methodologies, and building scalable Python/Spark pipelines to ensure reliable, trustworthy experiments at massive scale. Requires deep stats expertise, causal inference, and production experimentation experience.

293k – 325kBellevue, WA +1Data ScienceHybridSparkCuped

Data Scientist, Integrity Measurement

Owns measurement, metrics, and analysis for trust & safety harms including prevalence estimation and response gaps using AI-first methods. Requires strong statistics, data programming (Python/R/SQL), and trust/safety experience.

293k – 385kSan Francisco, CA +1Data ScienceHybridRSQL

Data Scientist, Supply

Data Scientist focused on compute allocation and causal inference to optimize AI infrastructure decisions and connect supply choices to user outcomes. Requires strong Python/SQL skills and experience with constrained optimization and production systems.

285k – 460kSan Francisco, CA +1Data ScienceOn-site5+ YOESQLPython

Economist

Economist (up to 5 years post-PhD) conducting empirical research on AI’s economic impacts using large datasets, causal inference, and structural modeling. Requires PhD and strong econometrics/SQL/Python skills.

266k – 385kSan Francisco, CAData ScienceHybrid3+ YOERSQL