Staff Data Engineer, tvScientific

156k – 320kSan Francisco, CARemote5+ YOEFeb 25

Summary

Lead design and implementation of scalable identity resolution and data governance platforms. Build pipelines for identity data management, ensure privacy compliance, and partner with teams to deliver reliable data services. Requires 5+ years Spark/Scala experience.

About the role

What you'll do:

Identity Services:

Design and maintain a scalable identity resolution platform
Build pipelines and services to ingest, normalize, link, and version identity data across multiple sources
Ensure deterministic and probabilistic matching logic that is transparent, auditable, and measurable
Partner with product and analytics teams to expose identity data through reliable, well-documented APIs and datasets
Build and operate batch and streaming pipelines using modern data stack tools
Create clear documentation, standards, and runbooks for identity and governance systems

Data Governance & Trust:

Own data governance foundations including data lineage, quality checks, schema enforcement, and access controls
Implement privacy-by-design principles (PII handling, consent enforcement, retention policies)
Collaborate with legal, privacy, and security teams to operationalize regulatory requirements (e.g., GDPR, CCPA)
Establish monitoring and alerting for data quality, freshness, and integrity

What we're looking for:

Data engineering experience with proven track record building data infrastructure using Spark with Scala
Proven experience building data infrastructure using Spark with Scala for at least 5 years
Experience in delivering significant technical initiatives and building reliable, large scale services
Experience in delivering APIs backed by relationship-heavy datasets
Experience implementing data governance practices, including data quality, metadata management, and access controls
Strong understanding of privacy-by-design principles and handling of sensitive or regulated data
Familiarity with data lakes, cloud warehouses, and storage formats
Strong proficiency in AWS services
Successful design and implementation of scalable and efficient data infrastructure
High attention to detail in implementation of automated data quality checks
Effective collaboration with cross-functional teams
Excellent written and verbal communication skills
Bachelor's degree in Computer Science or a related field

Skills

SparkScalaAWSIdentity ResolutionData PipelinesData GovernanceData LakesStreaming PipelinesAPIsData Quality

Similar roles at this salary range

All Data Engineering jobs →

Loop Financial

Jun 8

Analytics Engineer

Build and own core data models, ETL pipelines, and analytics infrastructure to enable data-driven decisions across the company and clients. Requires 2+ years building analytical products, strong SQL/Python, and modern data stack experience.

135k – 155kChicago, ILData EngineeringOn-siteSQLdbt

Instacart

Jun 8

Senior Data Engineer II, Finance

Senior data engineer building and owning financial data pipelines, models, and ETL/ELT systems for accounting, billing, and revenue reporting at Instacart.

183k – 232kUnited StatesData EngineeringRemoteSQLdbt

Airbnb

Jun 8

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemoteSQLAWS

Jun 5

Engineering Manager II, Big Data Storage

Staff-level engineer leading design and development of Pinterest’s exabyte-scale data lake storage platform using Iceberg and related big data technologies to support ML/AI workloads.

177k – 365kPalo Alto, CAData EngineeringHybridJavaTrino

Turquoise Health

Jun 4

Data Science Engineer, Analytics

Build data pipelines, models, dashboards, and analyses to support product and business decision-making. Requires 2+ years of Python/SQL experience with data modeling, ETL tools, and AWS.

145k – 160kSan Diego, CAData EngineeringRemoteSQLdbt

Apply