Staff Data Platform Engineer
Leads evolution of data platform, defining contracts, APIs, governance, and access patterns for lakehouse/warehouse ecosystems. Requires 8+ years data/platform engineering expertise in Python, SQL, Spark, Snowflake, and Kafka for cross-functional teams including AI use cases.
Responsibilities
- Define and evolve data contract standards, including schema enforcement, versioning, and validation patterns.
- Design interoperable ingestion and publishing frameworks for upstream producers to integrate with the data platform.
- Build and standardize APIs, libraries, or SDKs for event logging, schema validation, and contract compliance.
- Establish best practices for schema registry usage and distributed schema validation across streaming and batch systems (e.g., Kafka).
- Design patterns for data lake vs. warehouse usage, curated layers exposure, and safe downstream access.
- Lead reverse ETL and activation architecture for operational use cases.
- Define and enforce access control, governance, and compliance standards (e.g., PHI/PII handling, DEID boundaries, RBAC).
- Partner with Product Engineering, Security, Compliance, Analytics Engineering, and Infrastructure teams.
- Mentor engineers and influence engineering culture around data quality, ownership, and contracts.
- Drive adoption of AI-assisted development practices and design guardrails for AI access to data systems.
Requirements
- 8+ years of experience in data engineering, platform engineering, or backend platform development.
- Demonstrated experience designing data contracts, schema governance, or producer/consumer standards at scale.
- Strong expertise in Python and SQL, with hands-on experience building scalable data frameworks.
- Experience with distributed data systems such as Spark (Databricks or EMR) and modern lakehouse architectures (Delta Lake / Iceberg).
- Experience with data warehouses such as Snowflake and strong understanding of performance and access patterns.
- Familiarity with schema registry systems and schema evolution in streaming systems (e.g., Kafka).
- Experience building APIs, shared libraries, or platform services adopted by multiple teams.
- Strong understanding of access control, RBAC, and compliance constraints in regulated environments.
- Proven ability to lead cross-functional architectural initiatives.
- Clear communication skills and track record of influencing standards.
- Experience with AI-assisted development tools or cloud-based coding environments.
- Strong understanding of governance for GenAI systems.
Nice-to-haves
- Experience designing reverse ETL frameworks or operational activation pipelines.
- Familiarity with metadata and governance platforms (e.g., Unity Catalog, Collibra, OpenMetadata).
- Experience building internal developer platforms for event logging or data publishing.
- Experience in regulated environments involving PHI/PII.
- Experience integrating streaming systems (Kafka/Kinesis) with warehouse and lakehouse ecosystems.
Staff Data Infrastructure Engineer
Staff-level Data Infrastructure Engineer to architect and evolve the data platform (Snowflake, ingestion, orchestration, CI/CD, AWS infra) serving analytics, product, and ML teams. Requires 10+ years building scalable data platforms and proven technical leadership.
Senior Manager, Data Engineering
Lead and scale Headway's data engineering team, owning architecture for data warehouse, pipelines, dbt transformations, and orchestration to power analytics, ML, and operations. Requires 8+ years data engineering experience and 3+ years managing teams.
Staff Data Engineer
Staff Data Engineer to define architecture and build scalable data pipelines, integrations, and workflow orchestration systems. Requires deep Python expertise, IaC fluency, and technical leadership across AI-driven data infrastructure.
Senior Software Engineer, Data Infrastructure
Design, build, and operate Airbnb's next-generation big data compute platform using Spark, Trino, and related technologies. Requires 5+ years of data infrastructure experience and strong distributed systems expertise.
Senior Data Platform Engineer
Senior Data Platform Engineer to own ingestion, transformation, orchestration, and metrics layers powering business analytics. Requires 4+ years building production data pipelines with strong SQL, BigQuery, Airflow/Kubernetes, and AI coding tools experience.