Skip to content

Staff Software Engineer - Data Platform

218k – 260kMountain View, CAMcLean, VAData EngineeringOnsite8+ YOE
Summary

Leads design, build, and operation of core data infrastructure including event streaming, relational databases, and batch processing systems. Ensures scalability, reliability, and compliance while mentoring engineers and optimizing pipelines for analytics and ML.

About the role

Responsibilities

  • Own and operate core data infrastructure, including event streaming, relational database, and batch processing platforms.
  • Design and implement highly reliable, observable, and scalable data systems that enable real-time and batch data processing.
  • Develop automation and guardrails for data governance, retention, and compliance, ensuring auditability and consistency across services.
  • Partner with application, platform, and SRE teams to improve data access patterns, reliability SLAs, and recovery processes.
  • Establish standards for data infrastructure monitoring, alerting, and capacity planning, ensuring proactive issue detection.
  • Drive operational excellence by improving resilience, reducing toil, and implementing self-healing or automated recovery mechanisms.
  • Evolve and optimize data pipelines that support downstream analytics, identity verification, and machine learning systems.
  • Evaluate, implement, and operate event-driven and batch data platforms such as Kafka, Google Pub/Sub, Dataflow, or Temporal.
  • Lead incident response and root cause analysis for production data systems, contributing to postmortems and platform improvements.
  • Mentor engineers and advocate for reliability-focused engineering culture across teams.
  • Design and build the data lake storage and compute topology (object storage, partitioning, lifecycle, tiering) to support batch and streaming workloads.

Minimum Qualifications

  • Bachelor's or Graduate degree in Computer Science, Software Engineering, or a related technical field.
  • 8+ years of professional experience in data engineering, software engineering, or distributed systems development.
  • 6+ years of programming experience in one or more languages such as Go, Python, or Java, with emphasis on automation and data system integration.

Preferred Qualifications

  • Deep expertise in building and operating data systems—including relational databases, streaming, and batch platforms—in production environments.
  • Hands-on experience administering and optimizing PostgreSQL or other relational databases in the cloud (AWS RDS, CloudSQL, or AlloyDB).
  • Solid understanding of reliability engineering principles, including observability, SLOs, capacity management, and operational readiness.
  • Experience managing cloud infrastructure (AWS or GCP) using infrastructure-as-code tools like Terraform, Kubernetes, or Helm.
  • Experience operating event streaming platforms such as Kafka or Google Pub/Sub.
  • Experience with batch and stream processing systems, including Dataflow, Temporal, or Airflow.
  • Strong knowledge of data pipeline orchestration, change data capture, and schema management.
  • Background in automation, incident response, and data platform observability.
  • Familiarity with data governance and regulatory compliance frameworks (e.g., FedRAMP, GDPR, NIST).
  • Contributions to open-source data infrastructure projects or strong community engagement in the data reliability space.
  • Passion for performance engineering, system design, and mentoring others to deliver operational excellence at scale.
  • AI-assisted development — Demonstrable experience using AI developer tools (e.g., code generation, test generation, query synthesis).
  • Data-aware LLM usage — Ability to safely use large language models for tasks such as SQL generation, data lineage summarization, and runbook drafting.
Skills
GoPythonJavaPostgreSQLKafkaGoogle Pub/SubDataflowTemporalAirflowTerraformKubernetesHelmAWSGCP
Similar roles at this salary range
All Data Engineering jobs →
OpenAI

Enterprise Application Data Architect, GTM Systems

Define and improve data architecture for GTM systems and enterprise CRM. Lead Salesforce data modeling, integrations, governance, and quality initiatives across the customer lifecycle.

260k – 288kSan Francisco, CAData EngineeringHybrid7+ YOESQLETL
Okta

Staff Software Engineer, Data Platform

Staff Software Engineer building and scaling high-volume, low-latency distributed data platform services and analytics infrastructure using Java, Kinesis, Flink, Snowflake, and Kubernetes. Requires 8+ years experience and U.S. Person status for FedRAMP access.

194k – 267kSan Francisco, CAData EngineeringHybrid8+ YOEAWSJava
Haus

Staff Engineer - Data Platform

Staff-level technical lead and architect for Haus's data ingestion and normalization platform. Owns schema evolution, data contracts, DQ, lineage, and observability in a GCP/BigQuery/dbt stack. Partners with DS and Product; mentors senior engineers.

240k – 260kSan Francisco, CA +2Data EngineeringHybrid8+ YOESQLdbt
Headway

Staff Data Infrastructure Engineer

Staff-level Data Infrastructure Engineer to architect and evolve the data platform (Snowflake, ingestion, orchestration, CI/CD, AWS infra) serving analytics, product, and ML teams. Requires 10+ years building scalable data platforms and proven technical leadership.

212k – 265kNew York, NY +2Data EngineeringHybrid10+ YOEAWSSQL
Headway

Senior Manager, Data Engineering

Lead and scale Headway's data engineering team, owning architecture for data warehouse, pipelines, dbt transformations, and orchestration to power analytics, ML, and operations. Requires 8+ years data engineering experience and 3+ years managing teams.

212k – 265kNew York, NY +2Data EngineeringHybrid8+ YOEdbtData Modeling