Skip to content

Clinical Data Manager

Leads end-to-end clinical research data management, building scalable research databases, automating EHR-to-EDC integrations via APIs, ensuring data quality through QC and cleaning, and performing light statistical analysis. Requires 5+ years experience with SQL/Python, EHR datasets, and data standardization.

127k – 183kUnited StatesData EngineeringRemote5+ YOE

About the role

Key Responsibilities

Research Database Build & Full Data Management

  • Configure/maintain EDC forms and eCRF specifications as needed to support ingestion and downstream analysis, focusing on scalability and standardization.
  • Own end-to-end Data Management for Evidence Generation studies: build, validate, and maintain research databases from ingest → cleaning → QC → data lock.
  • Develop and run data cleaning workflows (queries, reconciliation, audit trails), and ensure inspection-ready documentation.

EHR → EDC Auto-Import & Data Model Strategy

  • Design and operate data model strategy for automated ingestion of EHR-level RWE into the EDC.
  • Work hands-on with APIs/integration endpoints and unify disparate data models (site/EHR variability, mapping logic, versioning, schema evolution).

EHR Systems Fluency / Site Data Reality

  • Partner with site IT/informatics and the Product team to understand EHR constraints, extract structures, and change management.
  • Translate EHR data realities into feasible study data capture and monitoring plans.

Collateral, SOPs, and Enablement

  • Create and maintain DMPs, SAPs, SOPs, runbooks, and training materials that operationalize the above processes across care pathways.

Statistical Analysis, as needed

  • Perform statistical analyses on cleaned datasets (descriptive, comparative, time-to-event where appropriate) and support evidence packages and reporting.

Qualifications

Required

  • 5–7+ years in clinical research/RWE data management with hands-on database build + cleaning/QC ownership.
  • SQL + Python (or R) for data transformation, QC checks, and reproducible pipelines.
  • Experience with EHR or EHR-derived datasets and understanding of common structures/coding systems (ICD-10, CPT, LOINC, RxNorm preferred).
  • Practical familiarity with API-based ingestion and integrating multiple data sources/models.
  • Experience building/owning DMPs/SOPs/runbooks and maintaining audit-ready documentation.
  • Familiarity with integrating leading AI techniques into your work product.

Preferred

  • Experience with EDC platforms (Medidata Rave, REDCap, Castor, Veeva, etc.).
  • CDISC familiarity (SDTM/ADaM) or strong equivalent standardization experience.
  • Stats experience in real-world/implementation studies (propensity methods, time-to-event, mixed models).

Skills

SQLPythonRAPIsEdcEhrMedidata RaveRedcapVeevaCdiscIcd-10CptLoincRxnormSas

Bioinformatics Engineer

Develop and optimize Nextflow-based bioinformatics pipelines for high-throughput sequencing analysis on Google Cloud Platform. Requires 3+ years of production pipeline experience, Nextflow proficiency, and strong genomics analysis skills.

125k – 150kRockville, MDData EngineeringOn-site3+ YOERGCP

Data & ML Pipeline Software Engineer

Builds large-scale data processing pipelines and ML infrastructure to automate data curation, model training, and iteration for autonomous vehicles using real-world and simulation data. Requires 3-5 years experience in data/ML infra, Python, and frameworks like Spark/Airflow/Kafka.

125k – 222kSunnyvale, CAData EngineeringOn-site3+ YOEETLSpark

Analytics Engineer

Builds and maintains scalable data pipelines, models, and warehouses to enable business intelligence and decision-making. Collaborates with teams on data integration, governance, and self-serve analytics using SQL, dbt, Python, and BI tools. Requires 3+ years experience.

125k – 215kNew York, NYData EngineeringHybrid3+ YOESQLdbt

Data Engineer

Builds and maintains scalable data pipelines for ingestion, transformation, and reliability using SQL, Python, dbt, and Fivetran to support data science, engineering, and product teams. Requires proven data engineering experience with modern data stack tools.

130k – 500kSan Francisco, CA +1Data EngineeringOn-siteSQLdbt

Software Engineer II, Big Data, tvScientific

Design and implement scalable data infrastructure using Spark/Scala on AWS. Build data pipelines, knowledge graphs, and APIs to support a high-growth CTV advertising platform.

124k – 255kSan Francisco, CAData EngineeringRemoteAWSSQL