Skip to content

Staff Data Engineer, tvScientific

156k – 320kSan Francisco, CARemote5+ YOE
Summary

Lead design and implementation of scalable identity resolution and data governance platforms. Build pipelines for identity data management, ensure privacy compliance, and partner with teams to deliver reliable data services. Requires 5+ years Spark/Scala experience.

About the role

What you'll do:

Identity Services:

  • Design and maintain a scalable identity resolution platform
  • Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources
  • Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable
  • Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets
  • Build and operate batch and streaming pipelines using modern data stack tools
  • Create clear documentation, standards, and runbooks for identity and governance systems

Data Governance & Trust:

  • Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls
  • Implement privacy-by-design principles (PII handling, consent enforcement, retention policies)
  • Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA)
  • Establish monitoring and alerting for data quality, freshness, and integrity

What we're looking for:

  • Data engineering experience with proven track record building data infrastructure using Spark with Scala
  • Proven experience building data infrastructure using Spark with Scala for at least 5 years
  • Experience in delivering significant technical initiatives and building reliable, large scale services
  • Experience in delivering APIs backed by relationship-heavy datasets
  • Experience implementing data governance practices, including data quality, metadata management, and access controls
  • Strong understanding of privacy-by-design principles and handling of sensitive or regulated data
  • Familiarity with data lakes, cloud warehouses, and storage formats
  • Strong proficiency in AWS services
  • Successful design and implementation of scalable and efficient data infrastructure
  • High attention to detail in implementation of automated data quality checks
  • Effective collaboration with cross-functional teams
  • Excellent written and verbal communication skills
  • Bachelor's degree in Computer Science or a related field
Skills
SparkScalaAWSIdentity ResolutionData PipelinesData GovernanceData LakesStreaming PipelinesAPIsData Quality
Similar roles at this salary range
All Data Engineering jobs →
Loop Financial

Analytics Engineer

Build and own core data models, ETL pipelines, and analytics infrastructure to enable data-driven decisions across the company and clients. Requires 2+ years building analytical products, strong SQL/Python, and modern data stack experience.

135k – 155kChicago, ILData EngineeringOn-siteSQLdbt
Instacart

Senior Data Engineer II, Finance

Senior data engineer building and owning financial data pipelines, models, and ETL/ELT systems for accounting, billing, and revenue reporting at Instacart.

183k – 232kUnited StatesData EngineeringRemoteSQLdbt
Airbnb

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemoteSQLAWS
Pinterest

Engineering Manager II, Big Data Storage

Staff-level engineer leading design and development of Pinterest’s exabyte-scale data lake storage platform using Iceberg and related big data technologies to support ML/AI workloads.

177k – 365kPalo Alto, CAData EngineeringHybridJavaTrino
Turquoise Health

Data Science Engineer, Analytics

Build data pipelines, models, dashboards, and analyses to support product and business decision-making. Requires 2+ years of Python/SQL experience with data modeling, ETL tools, and AWS.

145k – 160kSan Diego, CAData EngineeringRemoteSQLdbt