Skip to content

Software Engineer, Data Platform

Build and operate the identity data platform that ingests, transforms, and serves high-volume identity data to power all Lumos products. Own ingestion pipelines, service layers, APIs, and observability for correctness and reliability.

170k – 220kUnited StatesData EngineeringRemote3+ YOE

About the role

Responsibilities

  • Design, develop, and operate systems that transform high volumes of identity data from third-party integrations to product consumers with correctness, freshness, and operational rigor.
  • Build the shared primitives and interfaces (APIs, services, materialized models) that abstract raw identity data into the building blocks that power our products.
  • Contribute to the technical vision for a world-class data infrastructure that empowers engineering, product, and AI teams, enabling seamless access to high-quality data.
  • Establish SLOs, observability, and operational tooling that catch and remediate failures in identity data systems before they reach customers.
  • Promote and implement software engineering best practices for building scalable, reliable, and secure data-centric applications.

Requirements

  • 3-7 years of experience as a backend or platform engineer building production data systems other teams depend on, either ingestion and sync pipelines (Dagster, Airflow, or comparable orchestration) or service layers in front of a transactional database (MySQL/Postgres) that abstract storage from internal consumers.
  • Strong backend development skills in Python, Go, or TypeScript, with a focus on clean API design, testability, and observability.
  • Strong instinct for data correctness, observability, and SLOs in systems where downstream products take action on the output.
  • Experience designing service or API interfaces over a transactional datastore (MySQL/Postgres) defining contracts that let producers and consumers evolve independently.
  • Familiarity with identity and access governance (IGA) data (e.g. identities, accounts, entitlements, group memberships) and how its correctness, freshness, and traceability shape downstream governance outcomes at scale.

Skills

PythonGoTypeScriptMySQLPostgresDagsterAirflowAPI DesignObservabilitySLOs

Data Engineer, Machine Learning

Build and maintain production data pipelines that prepare conversational, voice, and multimodal data for ML model training and evaluation. Partner closely with ML engineers to deliver high-quality, versioned datasets and infrastructure.

170k – 240kSan Francisco, CAData EngineeringOn-site5+ YOESQLETL

Data Engineer

Own and extend customer data ingestion platform and large-scale pipelines powering AI workers. Build data lake, retrieval layer, and infrastructure for syncing, enriching, and querying customer data across CRMs and third-party systems.

170k – 200kUnited StatesData EngineeringRemote4+ YOEPythonAirbyte

Lead Data Engineer

Leads design and implementation of scalable data pipelines using dbt, Airflow, and SQL. Mentors data engineers, optimizes AWS infrastructure with Terraform, and builds integrations with MySQL/SQL Server/OLAP for analytics. Requires strong SQL/Python and production pipeline expertise.

170k – 210kNew York, NYData EngineeringRemoteSQLAWS

Software Development Engineer - Data Acquisition & Normailization

Builds and maintains data connectors, pipelines, and normalization services to acquire and validate identity attributes from various sources for the Identity Trust Graph. Requires 4+ years in OOP languages, containerized cloud systems like GCP/Kubernetes, and data integration experience.

169k – 193kMountain View, CAData EngineeringOn-site4+ YOEGoGCP

Forward Deployed Data Engineer

Forward Deployed Data Engineer embedding with strategic enterprise accounts to design and deploy bespoke intelligence applications combining ZoomInfo's third-party data with customer first-party data. Own engagements end-to-end from discovery through production deployment.

172k – 270kUnited StatesData EngineeringRemoteSQLPython