Skip to content

Senior Data Infrastructure Engineer

160k – 190kUnited StatesData EngineeringRemote5+ YOE
Summary

Own and evolve Voltus' data pipelines, orchestration, modeling, and governance to deliver reliable analytics and AI-ready infrastructure.

About the role

What You’ll Do

  • Own and maintain our data pipeline architectures (e.g., critical data ingestion services, ETL pipelines, database mirroring and warehousing), ensuring they are reliable, monitored, and meet SLAs.
  • Manage and evolve our data modeling environments and provide a smooth, well-documented workflow for analysts and engineers.
  • Operate and improve our orchestration systems (Dagster), ensuring jobs run reliably and are observable.
  • Evaluate and rationalize data tooling from Databricks and notebooks (Marimo, Jupyter) to BI/analytics platforms (Redash and alternatives) and guide Voltus toward a sustainable, coherent data platform.
  • Implement observability for data systems (logging, alerting, metrics) so issues are detected early and data quality is continuously monitored.
  • Champion data governance and documentation, making datasets well-defined, trustworthy, and easy to navigate.
  • Collaborate with analysts, data scientists, and platform engineers to ensure the infrastructure you build is intuitive, scalable, and solves real-world problems.
  • Lay the groundwork for advanced applications by making Voltus’ data reliably accessible via well-documented interfaces, positioning us to adapt to future ML and AI use cases.

Who You Are

  • Proven experience in a data engineering or infrastructure role, with responsibility for production-grade pipelines and data systems.
  • Skilled in a programming language such as Python (Bonus Go)
  • Deep experience with ETL/ELT pipelines, dbt, and integrating disparate data sources into warehouses/lakes.
  • Familiarity with cloud data platforms (AWS, GCP) and modern data tooling. We are running on AWS.
  • Experienced in workflow orchestration (Airflow, Dagster, or similar).
  • Comfortable evaluating tradeoffs across notebook and analysis platforms (Jupyter, Marimo, Databricks) and recommending sustainable solutions.
  • Knowledge of BI/analytics tools (Redash, Looker, Mode, Superset, etc.) and how to support or migrate to them.
  • Strong understanding of data quality, governance, and observability.
  • A clear communicator who can work across technical and non-technical teams to define requirements and deliver solutions.
  • Comfortable taking ownership end-to-end of critical data infrastructure and serving as a point person for reliability and direction.
  • Familiarity with observability/monitoring tools (e.g., Datadog, Prometheus).

Bonus Points

  • Experience and Familiarity with Delta Lake, Databricks, DuckDB
  • Exposure with LLM-based applications and toolchains (LangChain, LlamaIndex, lite-llm).
  • Experience with vector databases (Pinecone, Qdrant, Chroma).
Skills
PythonGodbtETLELTAWSGCPDagsterAirflowDatabricksJupyterRedashDelta LakeDuckDBLangChain
Similar roles at this salary range
All Data Engineering jobs →
Coast

Lead Data Engineer

Lead the data engineering team to build and evolve a modern cloud-native data platform, drive data culture, and deliver high-impact data products for risk, customer acquisition, and financial services.

170k – 220kNew York, NYData EngineeringOn-site7+ YOEETLELT
Discord

Software Engineer, Data Platform

Build and maintain data infrastructure processing petabytes of data. Own end-to-end projects for data ingestion, transformation, and serving systems. Requires 3+ years of software engineering experience.

160k – 200kUnited StatesData EngineeringOn-site3+ YOEGoSQL
Twilio

Staff Analytics Engineer

Design and maintain a robust business data layer in dbt to enable trusted GTM sales analytics, reporting, data science, and AI capabilities. Requires 8+ years in analytics engineering with advanced SQL and dbt expertise.

156k – 229kUnited StatesData EngineeringRemote8+ YOESQLdbt
11x

Data Engineer

Own and extend customer data ingestion platform and large-scale pipelines powering AI workers. Build data lake, retrieval layer, and infrastructure for syncing, enriching, and querying customer data across CRMs and third-party systems.

170k – 200kUnited StatesData EngineeringRemote4+ YOEPythonAirbyte
Reddit

Software Engineer - Data Movement Platform

Software engineer building and maintaining scalable data movement infrastructure using Spark, Flink, and Airflow to support ML and analytics workloads processing 100B+ daily events.

164k – 230kUnited StatesData EngineeringRemote2+ YOEGoJava