Senior Data Engineer - Data Engineering

Builds and owns scalable SQL/Python data pipelines, golden datasets, and workflows using DBT, Airflow, Redshift for large-scale data (500TB+). Collaborates cross-functionally to enable data-driven decisions at Plaid. Requires 4+ years data engineering experience.

191k – 287kSan Francisco, CAData EngineeringHybrid4+ YOE

Apply

About the role

Responsibilities

Understand different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
Have data quality and performance top of mind while designing datasets.
Lead key data engineering projects that drive collaboration across the company.
Advocate for adopting industry tools and practices at the right time.
Own core SQL and Python data pipelines that power our data lake and data warehouse.
Deliver well-documented data with defined dataset quality, uptime, and usefulness.

Qualifications

4+ years of dedicated data engineering experience, solving complex data pipelines issues at scale.
Experience building data models and data pipelines on top of large datasets (500TB to petabytes).
Value SQL as a flexible tool, comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
Experience with performant warehouses and data lakes: Redshift, Snowflake, Databricks.
Experience building and maintaining batch and realtime pipelines using Spark, Kafka.
Appreciation for schema design, evolving analytics schema on top of unstructured data.
Excited to try new technologies, produce proof-of-concepts balancing technical advancement and adoption.
Get deep into managing, deploying, and improving low-level data infrastructure.
Empathetic with stakeholders, listen and collaborate on solutions balancing infra and business needs.
Champion for data privacy and integrity.

Skills

SQLPythondbtAirflowRedshiftSnowflakeDatabricksSparkKafkaData Pipelines

Similar roles

Data Engineering jobs

Senior Analytics Engineer

Lead analytics engineering for Reddit's Sales and Marketing teams, building scalable data pipelines, ETLs, dashboards, and self-service tools to empower data-driven decision making. Requires 4-5+ years experience with large-scale ETL systems, Python/SQL, and data modeling; advanced quantitative degree required.

191k – 267kUnited StatesData EngineeringRemote5+ YOED3SQL

Plaid

Senior Software Engineer - Data Infrastructure

Builds and scales data infrastructure including warehouses, lakehouses, Spark pipelines, streaming, and orchestration to enable data-driven decisions and ML at Plaid. Requires 5+ years experience in data platforms, strong system design, and leadership.

191k – 287kSan Francisco, CAData EngineeringHybrid5+ YOEETLSpark

Sentry

Senior Software Engineer, Events Analytics Platform

Senior backend/infrastructure engineer expanding Sentry's time-series data platform (Snuba/ClickHouse) to handle petabyte-scale events with sub-second latency. Requires 4+ years experience and distributed storage expertise.

190k – 280kSan Francisco, CAData EngineeringHybrid4+ YOERedisKafka

Jellyfish

Senior Data Engineer

Jellyfish is seeking a Senior Data Engineer to build, automate, and execute the next generation of their data platform. The role involves maintaining end-to-end data pipelines, modernizing orchestration, and automating data infrastructure.

190k – 240kUnited StatesData EngineeringRemoteSQLdbt

Sage

Lead Data Product Engineer

Leads development and architecture of client-facing data platform using Palantir Foundry in a low/no-code environment. Collaborates with Product and Design teams, applies software engineering best practices, and requires 7+ years experience with bachelor's in quantitative field.

190k – 225kNew York, NYData EngineeringHybrid7+ YOEHIPAAPython