Software Engineer, Data Infrastructure
Builds and operates scalable data infrastructure including compute fleets, storage systems, and streaming platforms to support OpenAI's AI products, research, and analytics. Requires 4+ years in data or infrastructure engineering with expertise in Spark, Kafka, and distributed systems.
Responsibilities
- Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security.
- Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient.
- Accelerate company productivity by empowering your fellow engineers & teammates with excellent data tooling and systems.
- Collaborate with product, research and analytics teams to build the technical foundations capabilities that unlock new features and experiences.
- Own the reliability of the systems you build, including participation in an on-call rotation for critical incidents.
Requirements
- 4+ years in data infrastructure engineering OR 4+ years in infrastructure engineering with a strong interest in data.
- Experience supporting Spark, Kafka, Flink, Airflow, Trino, or Iceberg as platforms.
- Well-versed in infrastructure tooling like Terraform.
- Experienced in debugging large-scale distributed systems.
Nice-to-Haves
- Comfortable with ambiguity and rapid change.
- Intrinsic desire to learn and fill in missing skills, and talent for sharing learnings clearly.
Staff Data Platform Engineer
Staff Data Platform Engineer building and leading AWS-native data platform architecture, orchestration, governance, and AI-readiness for analytics and ML workloads. Requires 8-10+ years experience with AWS data systems and strong technical leadership.
Manager, Data Engineering
Lead and mentor a team of data engineers building scalable data pipelines and platform infrastructure. Hands-on coding, operational excellence, and cross-functional collaboration with analytics, data science, and business teams.
Senior Data Engineer, People Analytics
Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.