Skip to content

Data Science Lead

179k – 200kUnited StatesRemote6+ YOE
Summary

Lead data science strategy and architecture for scalable analytics and ML systems. Design data architectures, evaluate novel data sources, establish analytical methodologies, and bridge R&D to production.

About the role

Data Architecture & Scalable Engineering

  • Design and oversee the evolution of scalable data architectures that support advanced analytics, ML modeling, and real-time processing.

R&D & Novel Data Source Evaluation

  • Scout, evaluate, and pressure-test new internal, external, and alternative data sources (e.g., synthetic data, IoT streams, third-party APIs) for predictive power and commercial viability.
  • Lead the ideation and feature engineering for these data sources and document alignment to current and future data architecture designs.
  • Lead rapid prototyping and PoCs to validate new technologies, algorithms, and data structures before scaling to production.
  • Perform technical vetting of data vendors and partners to ensure data quality, density, and seamless integration capabilities.

Methodology & Analytical Rigor

  • Define and document the organization's gold-standard methodologies for statistical analysis, experimental design (A/B testing), and ML modeling.
  • Establish rigorous validation protocols and evaluation metrics (e.g., precision/recall, drift detection, bias/fairness audits) to ensure model and data integrity.
  • Translate academic research and emerging industry trends into practical business methodologies.

Ingestion & Solution Integration

  • Serve as the bridge between R&D and Production, ensuring complex analytical models and data sources are seamlessly ingested into core business products and solutions.
  • Oversee data delivery contracts between the DS ecosystem and downstream software applications to ensure clean, well-documented APIs.

Key Deliverables (First 12 Months)

  • Data Source Playbook: formalized framework for scoring, vetting, and onboarding new data assets.
  • Methodology Registry: centralized repository of approved statistical models, evaluation metrics, and ingestion protocols.
  • Feature Importance Registry & Feature Engineering Roadmap: centralized repository connecting current data sources to product value and impact of removal/substitutes.
  • Architectural Roadmap: 12-month to 3-year vision aligning data science infrastructure with corporate scaling goals.

Technical Stack

  • Python, R, SQL, Cloud Platforms (AWS/GCP/Azure), Big Data tech (Spark, Kafka), Orchestration (Airflow), MLOps tools.

Expertise

  • Deep understanding of data modeling, schema design (SQL/NoSQL), statistical evaluation, and MLOps deployment patterns, especially in R&D functions that bridge research with production.

Soft Skills

  • Exceptional ability to translate complex technical architectures into strategic business value for non-technical stakeholders.
Skills
PythonRSQLAWSGCPAzureSparkKafkaAirflowMLOps
Similar roles at this salary range
All Data Science jobs →
Harvey

Data Scientist, Product

Product Data Scientist embedded with Product and Engineering to define metrics, run experiments, and turn product usage data into actionable recommendations for AI platform decisions.

155k – 240kSan Francisco, CAData ScienceHybrid5+ YOESQLPython
Plaid

Senior Data Scientist

First Data Scientist on Plaid's Embedded Insights team building analytics and measurement frameworks for ML models. Partners with product, engineering, and ML teams to drive data-informed decisions and evaluate model performance.

191k – 263kSan Francisco, CA +2Data ScienceHybrid5+ YOESQLDBT
Gusto

Senior Data Scientist, Risk

Staff Data Scientist on the Risk team driving experimentation, statistical inference, and causal analysis to inform strategic decisions. Requires 7-10 years of DS experience, strong SQL/Python, and proven impact influencing leadership.

186k – 230kSan Francisco, CA +1Data ScienceHybrid7+ YOESQLPython
Findigs

Data Scientist

Own feature engineering, model iteration, and A/B testing for an AI underwriting engine that influences rental decisions. Build production risk models and partner with Product and Engineering on high-stakes decisioning systems.

160k – 185kNew York, NYData ScienceHybrid4+ YOESQLdbt
Coinbase

Senior Data Scientist, CX Analytics

Senior Data Scientist on the CX Analytics team owning revenue calibration models, causal inference experiments, and LLM-powered classification pipelines to connect customer support interactions to business outcomes.

180k – 212kUnited StatesData ScienceRemote5+ YOERSQL