Staff Data Platform Engineer

180k – 275kNew York, NYHybrid8+ YOEApr 16

Summary

Leads evolution of data platform, defining contracts, APIs, governance, and access patterns for lakehouse/warehouse ecosystems. Requires 8+ years data/platform engineering expertise in Python, SQL, Spark, Snowflake, and Kafka for cross-functional teams including AI use cases.

About the role

Responsibilities

Define and evolve data contract standards, including schema enforcement, versioning, and validation patterns.
Design interoperable ingestion and publishing frameworks for upstream producers to integrate with the data platform.
Build and standardize APIs, libraries, or SDKs for event logging, schema validation, and contract compliance.
Establish best practices for schema registry usage and distributed schema validation across streaming and batch systems (e.g., Kafka).
Design patterns for data lake vs. warehouse usage, curated layers exposure, and safe downstream access.
Lead reverse ETL and activation architecture for operational use cases.
Define and enforce access control, governance, and compliance standards (e.g., PHI/PII handling, DEID boundaries, RBAC).
Partner with Product Engineering, Security, Compliance, Analytics Engineering, and Infrastructure teams.
Mentor engineers and influence engineering culture around data quality, ownership, and contracts.
Drive adoption of AI-assisted development practices and design guardrails for AI access to data systems.

Requirements

8+ years of experience in data engineering, platform engineering, or backend platform development.
Demonstrated experience designing data contracts, schema governance, or producer/consumer standards at scale.
Strong expertise in Python and SQL, with hands-on experience building scalable data frameworks.
Experience with distributed data systems such as Spark (Databricks or EMR) and modern lakehouse architectures (Delta Lake / Iceberg).
Experience with data warehouses such as Snowflake and strong understanding of performance and access patterns.
Familiarity with schema registry systems and schema evolution in streaming systems (e.g., Kafka).
Experience building APIs, shared libraries, or platform services adopted by multiple teams.
Strong understanding of access control, RBAC, and compliance constraints in regulated environments.
Proven ability to lead cross-functional architectural initiatives.
Clear communication skills and track record of influencing standards.
Experience with AI-assisted development tools or cloud-based coding environments.
Strong understanding of governance for GenAI systems.

Nice-to-haves

Experience designing reverse ETL frameworks or operational activation pipelines.
Familiarity with metadata and governance platforms (e.g., Unity Catalog, Collibra, OpenMetadata).
Experience building internal developer platforms for event logging or data publishing.
Experience in regulated environments involving PHI/PII.
Experience integrating streaming systems (Kafka/Kinesis) with warehouse and lakehouse ecosystems.

Skills

PythonSQLSparkSnowflakeDelta LakeIcebergKafkaDatabricksUnity Catalogschema registry

Similar roles at this salary range

All Data Engineering jobs →

Headway

Jun 12

Staff Data Infrastructure Engineer

Staff-level Data Infrastructure Engineer to architect and evolve the data platform (Snowflake, ingestion, orchestration, CI/CD, AWS infra) serving analytics, product, and ML teams. Requires 10+ years building scalable data platforms and proven technical leadership.

212k – 265kNew York, NY +2Data EngineeringHybrid10+ YOEAWSSQL

Headway

Jun 12

Senior Manager, Data Engineering

Lead and scale Headway's data engineering team, owning architecture for data warehouse, pipelines, dbt transformations, and orchestration to power analytics, ML, and operations. Requires 8+ years data engineering experience and 3+ years managing teams.

212k – 265kNew York, NY +2Data EngineeringHybrid8+ YOEdbtData Modeling

Jellyfish

Jun 12

Staff Data Engineer

Staff Data Engineer to define architecture and build scalable data pipelines, integrations, and workflow orchestration systems. Requires deep Python expertise, IaC fluency, and technical leadership across AI-driven data infrastructure.

200k – 260kUnited StatesData EngineeringRemote7+ YOECI/CDPython

Airbnb

Jun 11

Senior Software Engineer, Data Infrastructure

Design, build, and operate Airbnb's next-generation big data compute platform using Spark, Trino, and related technologies. Requires 5+ years of data infrastructure experience and strong distributed systems expertise.

191k – 225kUnited StatesData EngineeringRemote5+ YOEHiveJava

Pinecone

Jun 9

Senior Data Platform Engineer

Senior Data Platform Engineer to own ingestion, transformation, orchestration, and metrics layers powering business analytics. Requires 4+ years building production data pipelines with strong SQL, BigQuery, Airflow/Kubernetes, and AI coding tools experience.

176k – 200kNew York, NYData EngineeringHybrid4+ YOESQLDBT

Apply