Senior Software Engineer - Distributed Data Systems
Senior engineer building distributed data systems like Apache Spark and Delta Lake to handle big data processing, ETL, and data science workloads. Requires 5+ years in Java/Scala/C++ and expertise in distributed systems.
Responsibilities
- Develop Apache Spark, the open source standard for big data.
- Build reliable, high-performance storage services for cloud backends like AWS S3 and Azure Blob Store.
- Implement Delta Lake for scalable data lakes with ACID transactions and time travel.
- Create Delta Pipelines to orchestrate thousands of data pipelines.
- Engineer query optimizers and execution engines for speed and scalability.
Requirements
- BS (or higher) in Computer Science or equivalent.
- 5+ years production experience in Java, Scala, or C++.
- Strong foundation in algorithms, data structures, and distributed systems.
- Experience with databases and big data systems (Apache Spark, Hadoop).
- Comfortable with multi-year visions and delivering customer impact.
Staff Data Platform Engineer
Staff Data Platform Engineer building and leading AWS-native data platform architecture, orchestration, governance, and AI-readiness for analytics and ML workloads. Requires 8-10+ years experience with AWS data systems and strong technical leadership.
Analytics Engineer
Build and own core data models, ETL pipelines, and analytics infrastructure to enable data-driven decisions across the company and clients. Requires 2+ years building analytical products, strong SQL/Python, and modern data stack experience.
Senior Data Engineer, People Analytics
Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.