Skip to content

Data & Semantic Model Architect

Designs and owns Common Data Models and semantic layers for scientific data interoperability in life sciences. Translates business goals into ontologies, ensures FAIR data for AI/ML, and empowers forward-deployed engineers with standardized contracts. Requires 7+ years in data architecture and CDM expertise like HL7 FHIR, OMOP.

United StatesData EngineeringRemote7+ YOE

About the role

Responsibilities

Common Data Model & Exchange Strategy

  • Architect the Exchange Layer: Design and own the Common Data Models (CDMs) that serve as the universal language for scientific data across our customer base. Move the platform from bespoke, one-off mappings to a standardized "exchange layer" that ensures interoperability.
  • Empower Forward Deployed Engineering: Create the data contracts and standardized definitions that FDEs rely on. Your models will be the toolkit that allows them to deploy faster and with higher confidence.
  • Standardization vs. Flexibility: Strike the strategic balance between rigid global standards (for cross-customer exchange) and local flexibility. Define the core "immutable" aspects of the model versus where extension is permitted.

Semantic Architecture & Implementation

  • The "Forest" – Business Alignment: Translate high-level business goals into concrete data modeling strategies. Ensure our semantic roadmap directly supports the scientific questions our customers need to answer.
  • The "Trees" – Hands-on Modeling: Design and implement complex ontologies and taxonomies. Model intricate scientific relationships with precision.
  • Software & Data Engineering Integration: Work directly with Engineering to architect the software systems that consume these models. Ensure that the ontology does not break query performance or system scalability.

Cross-Functional Leadership & Governance

  • Data Contracts & Governance: Establish the "rules of the road" for data quality and consistency. Define how data contracts are versioned, enforced, and evolved.
  • Scientific Translation: Partner with Scientific Business Analysts to decode the complexity of biopharma R&D. Turn ambiguous scientific requirements into rigorous, machine-readable data structures.
  • Interoperability: Architect models that ensure our data is FAIR (Findable, Accessible, Interoperable, Reusable) and ready for downstream AI/ML applications.

Skills & Competencies

Common Data Model Expertise: Proven ability to design shared data models that serve as an exchange format between different systems or organizations. Data Contract Design: Experience defining and enforcing data contracts in a microservices or platform environment. Architectural Versatility: The ability to switch context effortlessly between high-level system design and low-level entity relationship modeling. Semantic Fluency: Deep, hands-on expertise with semantic web standards (RDF, OWL, SHACL, SPARQL) and property graph concepts (LPG).

Requirements

  • 7+ years of experience in data architecture, informatics, or technical product leadership, specifically within life sciences, healthcare, manufacturing technology or the ability to demonstrate complex, multidomain unification of data models & semantic layers.
  • CDM Framework Expertise: Direct, hands-on experience implementing and extending Common Data Model frameworks such as HL7 FHIR, OMOP (OHDSI), Allotrope, or CDISC.
  • Terminology & Standardization: Proven mastery in standardizing messy, heterogeneous data using both standard vocabularies (such as terminology standards & ontologies) as well as proprietary or custom vocabularies.
  • Platform & Exchange Experience: Experience building data platforms where standardization and reusability were key value drivers.
  • Technical Background: Strong proficiency in software development concepts; comfortable reading code, understanding API contracts, and discussing database internals.
  • Education: Bachelor's or Master’s in a relevant field (e.g., Medical Informatics, Computer Science, Bioinformatics, Physics).">

Skills

Hl7 FhirOmopAllotropeCdiscRdfOwlShaclSparqlSemantic WebProperty GraphData ModelingOntologiesTaxonomiesData ContractsCommon Data Model

Senior Data Engineer

Senior Data Engineer building ETL pipelines, data processing systems, and Lakehouse architecture on AWS to map decision-maker networks. Requires 6+ years experience, expert SQL, Spark, and data warehousing tools.

160k – 200kNew York, NYData EngineeringHybrid6+ YOEETLSQL

Senior Software Engineer, Data Products

Senior engineer building performant user-facing data products from internal datasets using Python, Databricks, and Postgres while collaborating with platform teams.

165k – 235kUnited StatesData EngineeringRemote5+ YOESQLPython

Sr. Software Engineer

Design and build scalable big data systems and ETL pipelines using Spark, Kafka, Hive and related technologies. Requires strong data modeling, SQL, and experience with AI coding assistants.

179k – 263kSan Francisco, CAData EngineeringRemote5+ YOEETLSQL

Senior Software Engineer - Data Platform

Senior engineer designing and owning scalable data platform infrastructure, ETL/ELT pipelines, and data products that power analytics and operations at MNTN. Requires 5+ years building distributed data systems with Python/Java/Go, Spark, Airflow, and cloud platforms.

United StatesData EngineeringRemote5+ YOEGoSQL

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems (Workday, Greenhouse) to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemote5+ YOESQLAWS