Skip to content

Clinical Data Engineer

Builds and maintains ETL pipelines for clinical studies with biometric sensor data, validates hardware algorithms through statistical analysis, and develops tools for data visualization and cross-team collaboration at a sleep tech company.

110k – 130kBoston, MAData EngineeringHybrid2+ YOE

About the role

Data Engineering & Infrastructure

  • Build and maintain scalable ETL pipelines using Python, SQL, and APIs to ingest and process large-scale biometric and sensor data
  • Design data models and workflows that support clinical studies, internal tools, and downstream analytics
  • Manage data storage, retrieval, and archival systems in AWS, including handling long-term access and data restore workflows
  • Ensure data integrity, reproducibility, and proper versioning across evolving datasets and analyses
  • Leverage AI-assisted tools to accelerate data analysis, debugging, and code development, improving iteration speed and reducing manual effort

Clinical Analytics & Algorithm Validation

  • Analyze sleep, physiological, and behavioral datasets to evaluate product performance and validate new features
  • Perform statistical analyses (e.g., correlation, error metrics, bootstrapping, validation frameworks) to assess algorithm accuracy and clinical outcomes
  • Develop evaluation pipelines for metrics like HR/HRV accuracy, presence detection, and sleep staging
  • Build tools and structured datasets to support training and validation of machine learning models, integrating multiple data sources for supervised learning
  • Investigate edge cases, sensor issues, and data anomalies to improve model robustness

Internal Tooling & Visualization

  • Maintain and extend Python-based applications for visualizing and annotating biometric data
  • Develop interactive tools for researchers and engineers to inspect sessions, validate signals, and debug algorithms
  • Streamline workflows for clinical teams to reduce manual effort and improve reproducibility

Cross-Functional Collaboration & Communication

  • Partner with Machine Learning, Hardware, Firmware, and Product teams to build algorithms and test prototypes
  • Work with Growth and Product teams to explore user behavior and inform feature development
  • Synthesize findings into reports, dashboards, and presentations for internal teams and external audiences
  • Contribute to abstracts, posters, and conference presentations; communicate uncertainty, methodology, and tradeoffs clearly to guide decision-making

What You’ll Need to Succeed

  • 2+ years of data engineering experience with health/physiology data in a research context — you’ve built ETL pipelines around messy, real-world biometric or sensor datasets, not just clean CSVs
  • Advanced Python and SQL proficiency — Pandas, NumPy, time-series analysis, and production-quality scripting are daily tools, not occasional ones
  • Intermediate-to-advanced signal processing and biometric data experience — you’ve worked directly with heart rate, HRV, sleep staging, or similar physiological signals from wearable or embedded sensors
  • Intermediate-to-advanced statistical modeling and validation skills — you can design and execute correlation analyses, error metrics, bootstrapping, and validation frameworks independently
  • Working proficiency with AWS and Snowflake — you’ve built or maintained cloud-based data storage, retrieval, and archival systems, not just queried them

Bonus Points

  • Experience with clinical or regulatory trial data, familiarity with GCP/ICH guidelines, or prior work supporting FDA submissions
  • Background in ML model validation or building structured training datasets for supervised learning
  • Fluency with AI-assisted development tools (Claude, Cursor, ChatGPT, Copilot) as part of your daily workflow
  • Domain knowledge in sleep science, biometrics, or wearable/embedded sensor data
  • Experience integrating internal and third-party APIs into unified data pipelines
  • Strong cross-functional communication skills — ability to translate complex analyses into clear insights for non-technical stakeholders

Compensation Target Base Salary: $110,000 – $130,000. Compensation is based on experience, qualifications, and market benchmarks for the Boston metro area. Equity and performance-based incentives are a significant component of total compensation.

Skills

PythonSQLpandasNumPyAWSSnowflakeETLSignal ProcessingTime Series AnalysisMachine LearningAPIsStatistical Modeling

Data Engineer

Entry-level Data Engineer building and maintaining data pipelines on a medallion lakehouse architecture using Python and SQL. Works under guidance on ingestion, transformation, orchestration, and monitoring tasks.

105k – 140kUnited StatesData EngineeringRemoteEntry levelSQLGCP

Data Solutions Engineer

Owns end-to-end customer data migrations and GIS onboarding for government clients, ensuring data quality by mapping schemas, validating, and troubleshooting legacy data using SQL and ETL tools. Requires 2-3+ years experience with strong data instincts and customer-facing skills.

100k – 145kNew York, NYData EngineeringHybrid2+ YOESQLHex

Data Engineer II

Builds, optimizes, and maintains scalable data pipelines using Python, SQL, and modern data stack in AWS to support BI, marketing, and data science. Requires 2+ years experience, expertise in data modeling and orchestration tools like Airflow and Snowflake.

125k – 165kNew York, NYData EngineeringHybrid2+ YOESQLAWS

Data Engineer

As a Data Engineer, you will build and maintain data pipelines, dbt models, and infrastructure on AWS and Snowflake. You will partner with BI/Analytics Engineering, take operational responsibility, and mentor junior team members.

130k – 145kUnited StatesData EngineeringRemote2+ YOESQLAWS

Software Engineer II, Customer Lifecycle Engineering

Software Engineer building data models, pipelines, and systems for measuring, monetizing, and optimizing customer lifecycle at an AI marketing platform.

130k – 170kUnited StatesData EngineeringRemote2+ YOESQLETL