Skip to content

Scientific Data Architect

Design and implement scientific data models and AI/ML solutions for biopharma customers. Engage onsite with scientists in Indianapolis area to translate complex data workflows into scalable cloud-based applications.

Indianapolis, INData EngineeringHybrid8+ YOE

About the role

What You Will Do

  • Engage directly with customers onsite a couple of days per week in the Indianapolis area, building strong relationships, deeply understanding their scientific data challenges and requirements, and accelerating solutions.
  • Design and implement extensible, reusable data models that efficiently capture and organize scientific data for scientific use cases, ensuring scalability and future adaptability.
  • Translate scientific data workflows into robust solutions leveraging the Tetra Data Platform.
  • Own, scope, prototype, and implement solutions including:
    • Data model design (tabular & JSON)
    • Python-based parser development
    • Lab software (e.g., ELN/LIMS) integration via APIs
    • Data visualization and app development in Python (using app frameworks like Streamlit and plotting tools like holoviews and Plotly)
  • Collaborate with Scientific Business Analysts (SBAs), customer scientists and applied AI engineers to develop and deploy models (ML, AI, mechanistic, statistical, hybrid).
  • Programmatically interrogate proprietary instrument output files.
  • Dynamically iterate with scientific end users and technical stakeholders to rapidly drive solution development and adoption through regular demos and meetings.
  • Proactively communicate implementation progress and deliver demos to customer stakeholders.
  • Collaborate with the product team to build and prioritize our roadmap by understanding customers’ pain points within and outside Tetra Data Platform.
  • Rapidly learn new technologies (e.g., new AWS services or scientific analysis applications) to develop and troubleshoot use cases.

Requirements

  • PhD with +4 years or Masters with +8 years of industry experience in life sciences with extensive domain knowledge in drug discovery (target ID through lead optimization), preclinical development, CMC (all drug modalities), or product quality testing.
  • Proven track record of defining, designing, prototyping, and implementing productized AI/ML-driven use cases in cloud environments.
  • Collaborated with cross-functional teams, including product managers, software engineers, and scientific stakeholders.
  • Performed extensive exploratory data analysis and workflow optimization to enable scientific outcomes not previously possible.
  • Engaged diverse audiences, from scientists to executive stakeholders using excellent communication and storytelling abilities.
  • Advised scientists in a consulting capacity to further research, development, and quality testing outcomes.
  • Must be able to travel to client sites in St. Louis, Indianapolis, Chicago regions.

Nice-to-Haves / Benefits

  • Competitive Salary and equity in a fast-growing company.
  • Supportive, team-oriented culture of continuous improvement.
  • Generous paid time off (PTO).
  • Flexible working arrangements - Remote work when not at Customer Sites.

Skills

PythonData ModelingStreamlitPlotlyAWSElnLimsAPI IntegrationsMachine LearningExploratory Data Analysis

Senior Data Engineer

Senior Data Engineer building ETL pipelines, data processing systems, and Lakehouse architecture on AWS to map decision-maker networks. Requires 6+ years experience, expert SQL, Spark, and data warehousing tools.

160k – 200kNew York, NYData EngineeringHybrid6+ YOEETLSQL

Senior Software Engineer, Data Products

Senior engineer building performant user-facing data products from internal datasets using Python, Databricks, and Postgres while collaborating with platform teams.

165k – 235kUnited StatesData EngineeringRemote5+ YOESQLPython

Sr. Software Engineer

Design and build scalable big data systems and ETL pipelines using Spark, Kafka, Hive and related technologies. Requires strong data modeling, SQL, and experience with AI coding assistants.

179k – 263kSan Francisco, CAData EngineeringRemote5+ YOEETLSQL

Senior Software Engineer - Data Platform

Senior engineer designing and owning scalable data platform infrastructure, ETL/ELT pipelines, and data products that power analytics and operations at MNTN. Requires 5+ years building distributed data systems with Python/Java/Go, Spark, Airflow, and cloud platforms.

United StatesData EngineeringRemote5+ YOEGoSQL

Senior Data Engineer, People Analytics

Build and maintain data pipelines, tables, and AI-ready data foundations from HR systems (Workday, Greenhouse) to power People Analytics reporting, dashboards, and LLM tools. Requires 5+ years of data engineering experience with strong SQL, Python, Airflow, and data governance skills.

179k – 210kUnited StatesData EngineeringRemote5+ YOESQLAWS