Skip to content

Senior Software Engineer - Distributed Data Systems

Develop distributed data systems including Apache Spark and Delta Lake to handle big data workloads efficiently. Requires 5+ years in Java/Scala/C++ and expertise in distributed systems.

166k – 225kMountain View, CAData EngineeringOnsite5+ YOE

About the role

Responsibilities

  • Develop Apache Spark, the open source standard for big data.
  • Build reliable, high-performance storage services for cloud backends (AWS S3, Azure Blob Store).
  • Implement Delta Lake for ACID transactions, time travel, and data lake efficiency.
  • Create Delta Pipelines for orchestrating thousands of data pipelines.
  • Engineer query optimizers and execution engines for speed and scalability.

Requirements

  • BS (or higher) in Computer Science or equivalent.
  • 5+ years production experience in Java, Scala, or C++.
  • Strong foundation in algorithms, data structures, and distributed systems.
  • Experience with databases and big data systems (Apache Spark, Hadoop).

Skills

SparkJavaScalaC++Distributed SystemsDelta LakeAws S3Azure Blob StoreAlgorithmsData Structures

Senior Software Engineer, Data Governance & Foundations

Instacart is seeking a Senior Software Engineer for the Data Governance & Foundations team. This role involves defining and driving architecture roadmaps for large-scale data ingestion and processing infrastructure, owning end-to-end platform initiatives, and optimizing infrastructure spend.

166k – 210kUnited StatesData EngineeringRemote5+ YOESQLdbt

Senior Analytics Engineer

Owns the data warehouse, semantic layer, and ingestion pipelines using Snowflake, dbt, and Looker. Architects reliable data models, integrates new sources, enables AI workflows, and sets company-wide metrics standards. Requires 5+ years in analytics/data engineering with strong SQL, dbt, and Python.

166k – 224kSan Francisco, CAData EngineeringOn-site5+ YOESQLdbt

Senior Software Engineer - Distributed Data Systems

Senior engineer building distributed data systems like Apache Spark and Delta Lake to handle big data processing, ETL, and data science workloads. Requires 5+ years in Java/Scala/C++ and expertise in distributed systems.

166k – 225kSan Francisco, CAData EngineeringOn-site5+ YOEC++Java

Senior Software Engineer, Data Products

Senior engineer building performant user-facing data products from internal datasets using Python, Databricks, and Postgres while collaborating with platform teams.

165k – 235kUnited StatesData EngineeringRemote5+ YOESQLPython

Senior Data Engineer

Designs and runs massive-scale data pipelines for ingestion, normalization, enrichment, and delivery across 80M+ companies and 800M+ people. Manages data operations, BPO vendors, partnerships, monitoring, and cost optimization using Python, Dagster, and DuckDB.

165k – 250kNew York, NYData EngineeringOn-siteELTETL