Skip to content

Validation Test Engineer, Test Set Curation

152k – 220kFoster City, CAHybrid3+ YOE
Summary

Develops and curates large-scale test sets from driving logs to validate autonomous driving system performance. Requires Master's/PhD in data-related field, 3+ years experience, Python proficiency, and expertise in building test sets for statistical analysis.

About the role

Responsibilities

  • Generate test sets of driving logs to evaluate the performance of the autonomous driving system
  • Validate and optimize the test sets to ensure they meet requirements and reflect the statistical distribution of our targeted robotaxi service
  • Collaborate with cross-functional stakeholders to gather requirements and integrate test sets with our testing frameworks
  • Develop and maintain data pipelines and tools for efficient test set creation and management
  • Define and document best practices for the design, curation, and maintenance of large-scale test sets
  • Present findings and test-set characteristics to both technical and non-technical audiences to inform system development

Qualifications

  • Master's Degree in Data Engineering, Data Science, Quantitative Science or a related field and at least 3 years of relevant industry experience or PhD in Data Engineering, Data Science, Quantitative Science or a related field
  • Proficiency in Python for data analysis and scripting
  • Experience building and curating large test sets for statistical studies or performance analysis
  • Strong communication and collaboration skills for working with cross-functional teams

Bonus Qualifications

  • Experience with PySpark
  • Experience with big data platforms such as Databricks or AWS EMR
  • Professional experience in the autonomous driving domain
Skills
PythonPySparkDatabricksAWS EMRdata pipelinesdata analysisbig data platformstest set curation
Similar roles at this salary range
All Data Engineering jobs →
Loop Financial

Analytics Engineer

Build and own core data models, ETL pipelines, and analytics infrastructure to enable data-driven decisions across the company and clients. Requires 2+ years building analytical products, strong SQL/Python, and modern data stack experience.

135k – 155kChicago, ILData EngineeringOn-siteSQLdbt
Airbnb

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemoteSQLAWS
Pinterest

Engineering Manager II, Big Data Storage

Staff-level engineer leading design and development of Pinterest’s exabyte-scale data lake storage platform using Iceberg and related big data technologies to support ML/AI workloads.

177k – 365kPalo Alto, CAData EngineeringHybridJavaTrino
Turquoise Health

Data Science Engineer, Analytics

Build data pipelines, models, dashboards, and analyses to support product and business decision-making. Requires 2+ years of Python/SQL experience with data modeling, ETL tools, and AWS.

145k – 160kSan Diego, CAData EngineeringRemoteSQLdbt
Skydio

Senior Autonomy Engineer, Data Curation

Senior engineer building data pipelines and tooling to curate large-scale autonomy datasets from drone logs and media for ML model training. Requires 5+ years experience, strong Python/C++ skills, and production data pipeline expertise.

170k – 240kSan Mateo, CAData EngineeringHybridC++ETL