Skip to content

Data Engineer

Builds and maintains reliable data pipelines and infrastructure using SQL, Python, Airflow, and Redshift to support analytics for Risk, Marketing, Finance stakeholders. Leads projects, reviews code, and scales platform as business grows.

Philadelphia, PAData EngineeringOnsite

About the role

Responsibilities

First Week:

  • Introduce yourself on Slack and meet your team.
  • Get oriented in the warehouse, walk through most-used pipelines, read codebase to ask useful questions.
  • Ship your first commit to production (bug fix, documentation update, or small improvement).

First Month:

  • Get fluent in Airflow setup, Redshift conventions, and DataHub governance tooling.
  • Complete first end-to-end pipeline or pipeline change in collaboration with a downstream team (Risk, Marketing, Finance, etc.).
  • Start reviewing PRs from other data engineers, providing pragmatic feedback.

First Three Months:

  • Be technical lead on a meaningful project from design through implementation, testing, and rollout.
  • Participate in architecture discussions and contribute to platform direction.
  • Build stakeholder trust so they come to you directly for data issues.

First Year:

  • Own a slice of infrastructure roadmap as product scales.
  • Become go-to expert on one or more platform areas.
  • Help hire and onboard new data engineers.
  • Enable teammates to ship projects independently on your services.

Requirements

  • Experience with healthy data engineering culture, data quality, pipeline reliability, stakeholder communication.
  • Led data pipeline or platform design decisions, including trade-offs.
  • Comfort across SQL, Python, orchestration, and infrastructure-as-code.
  • Favorite SQL patterns, modeling approaches, or data craft.

What You'll Learn

  • Perpay's payroll-deduction business model and data implications.
  • Building reliable data products: scoping, modeling, code review, testing, observability, lineage.
  • Data team roadmap and business alignment.
  • Stakeholders across Risk, Commerce, Marketing, Ops, Finance.

Compensation & Benefits

  • Meaningful compensation and equity.
  • Premium medical benefits (fully paid base plan).
  • 4% employer 401k match.
  • Unlimited PTO.
  • Remote weeks around major holidays + extra holidays.
  • High quality catered lunch 4 days/week.
  • Gym subsidy.
  • Paid cell phone + plan.
  • Student loan repayment.
  • Relocation assistance.
  • Generous team member discounts.

Skills

SQLPythonAirflowRedshiftDatahubData PipelinesInfrastructure As CodeData ModelingObservabilityData Lineage

Healthcare Data Analyst

Create advanced SQL/Spark SQL queries and prompt-engineered LLM workflows to transform healthcare claims data into clinical insights and automated policy tools. Requires 3-5 years SQL plus 2-3 years healthcare experience.

140k – 170kUnited StatesData EngineeringRemote3+ YOESQLClaude

Analytics Engineer

Build and maintain data models, pipelines, and dashboards that power customer experience and compliance operations. Partner with CX and compliance teams to deliver trusted, self-serve analytics.

152k – 179kUnited StatesData EngineeringRemote3+ YOESQLdbt

Data Engineer

Senior Data Engineer building scalable data pipelines and infrastructure on AWS using Spark, Metaflow, and container orchestration. Requires 5+ years of experience designing distributed data systems.

145k – 190kUnited StatesData EngineeringRemote5+ YOEAWSSQL

Software Engineer, Sensor Integration

Build and maintain ingestion pipelines that convert large-scale geospatial sensor data (LiDAR, imagery) into standardized formats for ML training and product use. Requires strong Python skills, comfort with undocumented formats, and distributed systems experience.

San Francisco, CAData EngineeringHybridC++Gdal

Data Engineer

Design, build, and maintain data pipelines for biomedical and clinical research datasets. Work with scientists and researchers to deliver accessible, well-governed data products using Python, SQL, and ETL/ELT processes.

Rockville, MDData EngineeringOn-siteSQLETL