Staff Data Engineer, Analytics Data Engineering
Leads design of shared data models, standardizes analytics pipelines, and modernizes orchestration for Dropbox's analytics platform. Requires 12+ years in data engineering, expertise in SQL/Python/Airflow/dbt, and cross-team leadership.
Responsibilities
- Lead the design and implementation of shared, reusable data models, defining shared fact tables, conformed dimensions, and a semantic/metrics layer that serves as the single source of truth across analytics functions
- Drive standardization of data engineering practices across ADE and functional analytics teams, including pipeline patterns, CI/CD workflows, naming conventions, and data modeling standards
- Partner with Data Infrastructure to modernize orchestration, improve pipeline decomposition, and establish secure dev/test environments with production data access
- Architect and implement a shift-left data governance strategy, working with upstream data producers to establish data contracts, SLOs, and code-enforced quality gates that catch issues before production
- Collaborate with Data Science leads and Product Management to translate metric definitions into reliable, certified data pipelines that power executive dashboards, WBR reporting, and growth measurement
- Reduce operational burden by improving pipeline granularity, observability, and failure recovery, establishing runbooks and alerting standards that make on-call sustainable
- Evaluate and integrate AI-native tooling into the data development lifecycle, enabling conversational data exploration with guardrails and AI-assisted pipeline development
Requirements
- BS degree in Computer Science or related technical field, or equivalent technical experience
- 12+ years of experience in data engineering or analytics engineering with increasing scope and technical leadership
- 12+ years of SQL experience, including complex analytical queries, window functions, and performance optimization at scale (Spark SQL)
- 8+ years of Python development experience, including building and maintaining production data pipelines
- Deep expertise in dimensional data modeling, schema design, and scalable data architecture, with hands-on experience building shared data models across multiple business domains
- Strong experience with orchestration tools (Airflow strongly preferred) and dbt, including pipeline design, scheduling strategies, and failure recovery patterns
- Demonstrated ability to drive cross-team technical alignment, establishing standards, influencing without authority, and working across Data Engineering, Data Science, Data Infrastructure, and Product Engineering boundaries
Preferred Qualifications
- Experience with Databricks (Unity Catalog, Delta Lake) and modern lakehouse architectures
- Experience leading orchestration or platform modernization efforts at scale
- Familiarity with data governance and observability tools such as Atlan, Monte Carlo, Great Expectations, or similar
- Experience building or contributing to a metrics/semantic layer (dbt MetricFlow, Databricks Metric Views, or equivalent)
- Track record of establishing data engineering standards and best practices in a federated analytics organization
Software Engineer, Data Platform
Build and maintain data infrastructure processing petabytes of data. Own end-to-end projects for data ingestion, transformation, and serving systems. Requires 3+ years of software engineering experience.
Staff Analytics Engineer
Design and maintain a robust business data layer in dbt to enable trusted GTM sales analytics, reporting, data science, and AI capabilities. Requires 8+ years in analytics engineering with advanced SQL and dbt expertise.
Data Engineer
Own and extend customer data ingestion platform and large-scale pipelines powering AI workers. Build data lake, retrieval layer, and infrastructure for syncing, enriching, and querying customer data across CRMs and third-party systems.
Staff Software Engineer, Data Platform
Staff Software Engineer building and scaling high-volume, low-latency distributed data platform services and analytics infrastructure using Java, Kinesis, Flink, Snowflake, and Kubernetes. Requires 8+ years experience and U.S. Person status for FedRAMP access.