Clinical Data Engineer

Builds and maintains ETL pipelines for clinical studies with biometric sensor data, validates hardware algorithms through statistical analysis, and develops tools for data visualization and cross-team collaboration at a sleep tech company.

110k – 130kBoston, MAData EngineeringHybrid2+ YOE

Apply

About the role

Data Engineering & Infrastructure

Build and maintain scalable ETL pipelines using Python, SQL, and APIs to ingest and process large-scale biometric and sensor data
Design data models and workflows that support clinical studies, internal tools, and downstream analytics
Manage data storage, retrieval, and archival systems in AWS, including handling long-term access and data restore workflows
Ensure data integrity, reproducibility, and proper versioning across evolving datasets and analyses
Leverage AI-assisted tools to accelerate data analysis, debugging, and code development, improving iteration speed and reducing manual effort

Clinical Analytics & Algorithm Validation

Analyze sleep, physiological, and behavioral datasets to evaluate product performance and validate new features
Perform statistical analyses (e.g., correlation, error metrics, bootstrapping, validation frameworks) to assess algorithm accuracy and clinical outcomes
Develop evaluation pipelines for metrics like HR/HRV accuracy, presence detection, and sleep staging
Build tools and structured datasets to support training and validation of machine learning models, integrating multiple data sources for supervised learning
Investigate edge cases, sensor issues, and data anomalies to improve model robustness

Internal Tooling & Visualization

Maintain and extend Python-based applications for visualizing and annotating biometric data
Develop interactive tools for researchers and engineers to inspect sessions, validate signals, and debug algorithms
Streamline workflows for clinical teams to reduce manual effort and improve reproducibility

Cross-Functional Collaboration & Communication

Partner with Machine Learning, Hardware, Firmware, and Product teams to build algorithms and test prototypes
Work with Growth and Product teams to explore user behavior and inform feature development
Synthesize findings into reports, dashboards, and presentations for internal teams and external audiences
Contribute to abstracts, posters, and conference presentations; communicate uncertainty, methodology, and tradeoffs clearly to guide decision-making

What You’ll Need to Succeed

2+ years of data engineering experience with health/physiology data in a research context — you’ve built ETL pipelines around messy, real-world biometric or sensor datasets, not just clean CSVs
Advanced Python and SQL proficiency — Pandas, NumPy, time-series analysis, and production-quality scripting are daily tools, not occasional ones
Intermediate-to-advanced signal processing and biometric data experience — you’ve worked directly with heart rate, HRV, sleep staging, or similar physiological signals from wearable or embedded sensors
Intermediate-to-advanced statistical modeling and validation skills — you can design and execute correlation analyses, error metrics, bootstrapping, and validation frameworks independently
Working proficiency with AWS and Snowflake — you’ve built or maintained cloud-based data storage, retrieval, and archival systems, not just queried them

Bonus Points

Experience with clinical or regulatory trial data, familiarity with GCP/ICH guidelines, or prior work supporting FDA submissions
Background in ML model validation or building structured training datasets for supervised learning
Fluency with AI-assisted development tools (Claude, Cursor, ChatGPT, Copilot) as part of your daily workflow
Domain knowledge in sleep science, biometrics, or wearable/embedded sensor data
Experience integrating internal and third-party APIs into unified data pipelines
Strong cross-functional communication skills — ability to translate complex analyses into clear insights for non-technical stakeholders

Compensation Target Base Salary: $110,000 – $130,000. Compensation is based on experience, qualifications, and market benchmarks for the Boston metro area. Equity and performance-based incentives are a significant component of total compensation.

Skills

PythonSQLpandasNumPyAWSSnowflakeETLSignal ProcessingTime Series AnalysisMachine LearningAPIsStatistical Modeling

Similar roles

Data Engineering jobs

PrizePicks

Data Engineer

Entry-level Data Engineer building and maintaining data pipelines on a medallion lakehouse architecture using Python and SQL. Works under guidance on ingestion, transformation, orchestration, and monitoring tasks.

105k – 140kUnited StatesData EngineeringRemoteEntry levelSQLGCP

GovWell

Data Solutions Engineer

Owns end-to-end customer data migrations and GIS onboarding for government clients, ensuring data quality by mapping schemas, validating, and troubleshooting legacy data using SQL and ETL tools. Requires 2-3+ years experience with strong data instincts and customer-facing skills.

100k – 145kNew York, NYData EngineeringHybrid2+ YOESQLHex

Garner Health

Data Engineer II

Builds, optimizes, and maintains scalable data pipelines using Python, SQL, and modern data stack in AWS to support BI, marketing, and data science. Requires 2+ years experience, expertise in data modeling and orchestration tools like Airflow and Snowflake.

125k – 165kNew York, NYData EngineeringHybrid2+ YOESQLAWS

Talkiatry

Data Engineer

As a Data Engineer, you will build and maintain data pipelines, dbt models, and infrastructure on AWS and Snowflake. You will partner with BI/Analytics Engineering, take operational responsibility, and mentor junior team members.

130k – 145kUnited StatesData EngineeringRemote2+ YOESQLAWS

Attentive

Software Engineer II, Customer Lifecycle Engineering

Software Engineer building data models, pipelines, and systems for measuring, monetizing, and optimizing customer lifecycle at an AI marketing platform.

130k – 170kUnited StatesData EngineeringRemote2+ YOESQLETL