Skip to content

Senior Data Engineer - Data Engineering

Builds and owns scalable SQL/Python data pipelines, golden datasets, and workflows using DBT, Airflow, Redshift for large-scale data (500TB+). Collaborates cross-functionally to enable data-driven decisions at Plaid. Requires 4+ years data engineering experience.

191k – 287kSan Francisco, CAData EngineeringHybrid4+ YOE

About the role

Responsibilities

  • Understand different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasets.
  • Lead key data engineering projects that drive collaboration across the company.
  • Advocate for adopting industry tools and practices at the right time.
  • Own core SQL and Python data pipelines that power our data lake and data warehouse.
  • Deliver well-documented data with defined dataset quality, uptime, and usefulness.

Qualifications

  • 4+ years of dedicated data engineering experience, solving complex data pipelines issues at scale.
  • Experience building data models and data pipelines on top of large datasets (500TB to petabytes).
  • Value SQL as a flexible tool, comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
  • Experience with performant warehouses and data lakes: Redshift, Snowflake, Databricks.
  • Experience building and maintaining batch and realtime pipelines using Spark, Kafka.
  • Appreciation for schema design, evolving analytics schema on top of unstructured data.
  • Excited to try new technologies, produce proof-of-concepts balancing technical advancement and adoption.
  • Get deep into managing, deploying, and improving low-level data infrastructure.
  • Empathetic with stakeholders, listen and collaborate on solutions balancing infra and business needs.
  • Champion for data privacy and integrity.

Skills

SQLPythondbtAirflowRedshiftSnowflakeDatabricksSparkKafkaData Pipelines

Senior Analytics Engineer

Lead analytics engineering for Reddit's Sales and Marketing teams, building scalable data pipelines, ETLs, dashboards, and self-service tools to empower data-driven decision making. Requires 4-5+ years experience with large-scale ETL systems, Python/SQL, and data modeling; advanced quantitative degree required.

191k – 267kUnited StatesData EngineeringRemote5+ YOED3SQL

Senior Software Engineer - Data Infrastructure

Builds and scales data infrastructure including warehouses, lakehouses, Spark pipelines, streaming, and orchestration to enable data-driven decisions and ML at Plaid. Requires 5+ years experience in data platforms, strong system design, and leadership.

191k – 287kSan Francisco, CAData EngineeringHybrid5+ YOEETLSpark

Senior Software Engineer, Events Analytics Platform

Senior backend/infrastructure engineer expanding Sentry's time-series data platform (Snuba/ClickHouse) to handle petabyte-scale events with sub-second latency. Requires 4+ years experience and distributed storage expertise.

190k – 280kSan Francisco, CAData EngineeringHybrid4+ YOERedisKafka

Senior Data Engineer

Jellyfish is seeking a Senior Data Engineer to build, automate, and execute the next generation of their data platform. The role involves maintaining end-to-end data pipelines, modernizing orchestration, and automating data infrastructure.

190k – 240kUnited StatesData EngineeringRemoteSQLdbt

Lead Data Product Engineer

Leads development and architecture of client-facing data platform using Palantir Foundry in a low/no-code environment. Collaborates with Product and Design teams, applies software engineering best practices, and requires 7+ years experience with bachelor's in quantitative field.

190k – 225kNew York, NYData EngineeringHybrid7+ YOEHIPAAPython