Staff Software Engineer - Distributed Data Systems

192k – 260kSan Francisco, CAOnsite8+ YOEJan 30

Summary

Develops distributed data systems like Apache Spark and Delta Lake at massive scale, ensuring high performance and reliability for exabyte-scale workloads. Requires 8+ years in Java/Scala/C++ and deep distributed systems expertise.

About the role

Key Projects

Apache Spark™: Develop the open source standard for big data processing.
Data Plane Storage: Build services for cloud storage like AWS S3 and Azure Blob Store.
Delta Lake: Create storage layer with ACID transactions and time travel.
Delta Pipelines: Orchestrate thousands of data pipelines.
Performance Engineering: Optimize query engines for speed and scalability.

Requirements

BS in Computer Science or equivalent.
8+ years production experience in Java, Scala, or C++.
Strong algorithms, data structures, and distributed systems knowledge.
Experience with databases and big data systems (Apache Spark™, Hadoop).

Nice-to-Haves

MS or PhD in databases or distributed systems.
Comfortable with multi-year visions and customer impact.

Skills

Apache SparkJavaScalaC++Distributed SystemsDelta LakeAWS S3Azure Blob StorageHadoopAlgorithms

Similar roles at this salary range

All Data Engineering jobs →

Honor

Jun 8

Staff Data Platform Engineer

Staff Data Platform Engineer building and leading AWS-native data platform architecture, orchestration, governance, and AI-readiness for analytics and ML workloads. Requires 8-10+ years experience with AWS data systems and strong technical leadership.

194k – 220kUnited StatesData EngineeringRemotedbtPython

Instacart

Jun 8

Senior Data Engineer II, Finance

Senior data engineer building and owning financial data pipelines, models, and ETL/ELT systems for accounting, billing, and revenue reporting at Instacart.

183k – 232kUnited StatesData EngineeringRemoteSQLdbt

Justworks

Jun 8

Manager, Data Engineering

Lead and mentor a team of data engineers building scalable data pipelines and platform infrastructure. Hands-on coding, operational excellence, and cross-functional collaboration with analytics, data science, and business teams.

205k – 262kNew York, NYData EngineeringHybridSQLAWS

Airbnb

Jun 8

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemoteSQLAWS

Nuance Labs

Jun 5

Member of Technical Staff — ML Data Infra

Build and operate large-scale multimodal data pipelines for AI avatar model training. Design production-grade systems for petabyte-scale video, audio, and text data.

200k – 300kSeattle, WAData EngineeringOn-siteRayDVC

Apply