Skip to content

Software Engineer, Data Platform

Designs and implements scalable data models, pipelines, and lakehouse infrastructure using Snowflake and Clickhouse to support analytics, ML, and products. Requires 5+ years data engineering experience, SQL/Python expertise, and leadership in data governance.

200k – 236kSan Francisco, CAData EngineeringHybrid5+ YOE

About the role

What You’ll Do

  • Design and implement core data models and pipelines that power analytics, ML, and product experiences
  • Implement modern data lake orchestration patterns, including medallion architectures
  • Architect and evolve a scalable, cost-efficient, and reliable lakehouse foundation using Snowflake, Clickhouse, and orchestration tools
  • Define best practices and technical standards that improve data quality, governance, and performance across teams
  • Mentor engineers and foster a culture of ownership, operational excellence, and continuous learning
  • Shape the long-term technical vision and roadmap for GlossGenius' data platform

What We’re Looking For

  • 5+ years of experience in data engineering, with a strong background in data architecture, data modeling, and distributed data systems
  • Deep expertise in modern lakehouse technologies such as Snowflake and Clickhouse
  • Experience implementing modern data orchestration patterns for big data use-cases, including batch and streaming workloads
  • Advanced proficiency in SQL and Python or Scala, including performance optimization and large-scale ETL design
  • Familiarity with AI/ML workflows and the data infrastructure that supports them
  • Demonstrated ability to lead technical initiatives, set standards, and influence decisions across teams
  • Comfort owning systems end-to-end, including monitoring, reliability, and cost management
  • Excellent communication skills with the ability to translate technical trade-offs to both engineers and non-technical stakeholders

Benefits & Perks

  • Flexible PTO
  • Competitive health & dental insurance options, with premiums partially or fully covered by GG
  • In-person opportunities that are designed to help team members foster collaboration and build community
  • Fertility and adoption benefits via Carrot
  • Generous, fully-paid parental leave policy
  • 401k benefit - employees are eligible to contribute starting day 1 of employment
  • Professional Development - employees receive a yearly stipend for approved learning and educational-related expenses
  • Pre-tax commuter benefits
  • Dependent Care FSA
  • Home office support

Starting base salary in California: $200,000 - $236,000 + target equity + benefits.

Skills

SnowflakeClickHouseSQLPythonScalaETLData PipelinesData OrchestrationMedallion ArchitectureAi/Ml Workflows

Lead Engineer (Data/Integrations)

Leads architecture and implementation of healthcare payer data integrations (EDI X12, HL7/FHIR) and scalable pipelines for AI/ML systems. Requires 5+ years data engineering experience with payer data and team leadership.

200k – 250kSan Francisco, CAData EngineeringHybrid5+ YOES3AWS

Software Engineer, Data Platform

Builds and maintains data governance frameworks, manages RBAC in Snowflake, optimizes data models with dbt, and ensures data quality, security, and audit readiness. Requires 3-5 years in data engineering with SQL and modern data tools.

194k – 215kBoston, MA +1Data EngineeringOn-site3+ YOESQLdbt

Data Engineer

Seeking the first Data Engineer to architect ETL pipelines, manage a central data lake, and drive data governance for an AI platform serving top financial institutions. Requires 5+ years of data engineering experience with Python, SQL, and big data tools.

190k – 250kNew York, NY +1Data EngineeringOn-site5+ YOESQLETL

Software Engineer, Data Infrastructure

Architect and build foundational data infrastructure for massive simulation outputs. Design novel data models and high-throughput pipelines to feed LLMs with structured context from complex, state-based environments.

186k – 233kNew York, NY +1Data EngineeringOn-site5+ YOEGoC++

Data Engineer

Builds and optimizes scalable data pipelines, storage, and OLAP databases for ML training, analytics, and product features. Requires 5+ years in data engineering, proficiency in Python/SQL/cloud platforms, and distributed systems experience.

185k – 217kSan Francisco, CAData EngineeringHybrid5+ YOESQLGCP