Skip to content

Software Engineer, Data Infrastructure

185k – 385kSan Francisco, CAHybrid4+ YOE
Summary

Builds and operates scalable data infrastructure including compute fleets, storage systems, and streaming platforms to support OpenAI's AI products, research, and analytics. Requires 4+ years in data or infrastructure engineering with expertise in Spark, Kafka, and distributed systems.

About the role

Responsibilities

  • Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
  • Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient.
  • Accelerate company productivity by empowering your fellow engineers & teammates with excellent data tooling and systems.
  • Collaborate with product, research and analytics teams to build the technical foundations capabilities that unlock new features and experiences.
  • Own the reliability of the systems you build, including participation in an on-call rotation for critical incidents.

Requirements

  • 4+ years in data infrastructure engineering OR 4+ years in infrastructure engineering with a strong interest in data.
  • Experience supporting Spark, Kafka, Flink, Airflow, Trino, or Iceberg as platforms.
  • Well-versed in infrastructure tooling like Terraform.
  • Experienced in debugging large-scale distributed systems.

Nice-to-Haves

  • Comfortable with ambiguity and rapid change.
  • Intrinsic desire to learn and fill in missing skills, and talent for sharing learnings clearly.
Skills
SparkKafkaFlinkAirflowTrinoIcebergTerraformDelta LakeKubernetesChronon
Similar roles at this salary range
All Data Engineering jobs →
Honor

Staff Data Platform Engineer

Staff Data Platform Engineer building and leading AWS-native data platform architecture, orchestration, governance, and AI-readiness for analytics and ML workloads. Requires 8-10+ years experience with AWS data systems and strong technical leadership.

194k – 220kUnited StatesData EngineeringRemotedbtPython
Instacart

Senior Data Engineer II, Finance

Senior data engineer building and owning financial data pipelines, models, and ETL/ELT systems for accounting, billing, and revenue reporting at Instacart.

183k – 232kUnited StatesData EngineeringRemoteSQLdbt
Justworks

Manager, Data Engineering

Lead and mentor a team of data engineers building scalable data pipelines and platform infrastructure. Hands-on coding, operational excellence, and cross-functional collaboration with analytics, data science, and business teams.

205k – 262kNew York, NYData EngineeringHybridSQLAWS
Airbnb

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemoteSQLAWS
Nuance Labs

Member of Technical Staff — ML Data Infra

Build and operate large-scale multimodal data pipelines for AI avatar model training. Design production-grade systems for petabyte-scale video, audio, and text data.

200k – 300kSeattle, WAData EngineeringOn-siteRayDVC