Senior Data Engineer
190k – 240kUnited StatesData EngineeringRemote
Summary
Jellyfish is seeking a Senior Data Engineer to build, automate, and execute the next generation of their data platform. The role involves maintaining end-to-end data pipelines, modernizing orchestration, and automating data infrastructure.
About the role
What you’ll actually be doing:
- Pipeline Execution & Modeling – You’ll maintain our end-to-end data pipelines, writing clean, modular Python and SQL. You will help translate the architectural blueprint into reality, structuring data across our Medallion layers (Bronze > Silver > Gold) for maximum performance and reliability.
- Orchestration Modernization – You’ll take the lead on migrating, optimizing, and maintaining our workflow orchestration engines. You’ll eliminate pipeline bottlenecks, leverage modern fast-paths (like Pydantic v2 and async database clients), and ensure distributed tasks scale seamlessly without hitting API limits.
- Data CI/CD & Infrastructure Automation – You’ll build the "paved road" for data deployments. You’ll use Terraform to provision data resources and write automated tests to validate schemas and data quality before code ever hits our isolated staging or production catalogs.
- API & Caching Integration – You’ll collaborate with product developers to expose data safely. You’ll help design and optimize the application backend tiers, backend-for-frontend (BFF) layers, and Redis caching structures that protect our core data warehouse from frontend concurrency spikes.
- On-Call & Observability Triage – You’ll participate in the data platform's incident response rotation. You won't just patch a failing pipeline; you’ll build deep observability, refine alerts to reduce noise, and write programmatic fixes to ensure the issue never happens again.
You’re a great fit if:
- Data Engineering Fluency – You have solid, production-level experience with Python, advanced SQL, and data transformation frameworks (like dbt or PySpark). You are highly comfortable working with programmatic orchestrators (such as Prefect, Dagster, or Airflow).
- Warehouse & Catalog Practitioner – You know your way around enterprise data platforms (e.g., Snowflake, Databricks, BigQuery). You understand how to safely navigate environment boundaries, manage access keys securely, and write performant queries that don't balloon the cloud bill.
- Automation Mindset – You look at a repeated data backfill, a manual schema fix, or an untracked data quality bug and immediately think about how to script a permanent, automated solution.
- Collaborative Builder – You love working in a team. You write readable code, value thorough documentation and clear data lineage, and enjoy collaborating with application engineers to solve complex data delivery problems.
- Pragmatic Problem Solver – You know when to write a perfectly optimized distributed processing job and when a simple, well-indexed database table or cached view is the smartest move to keep the business moving.
Bonus Points:
- You’ve survived (and thrived in) a rapidly scaling startup handling complex, multi-tenant B2B SaaS data.
- You have strong opinions on data quality testing frameworks (like Great Expectations or Soda) and data-observability patterns.
- You’ve worked extensively with cloud cost allocation or tracked token-level spend for LLM/AI model integrations.
Skills
PythonSQLdbtPySparkPrefectDagsterAirflowSnowflakeDatabricksBigQueryTerraformRedis
Similar roles at this salary range
All Data Engineering jobs →Senior Software Engineer
Senior Software Engineer building and scaling Chime's data platform, ETL pipelines, and distributed data infrastructure. Requires a Master's degree and 3+ years of experience with AWS/GCP, Spark/Trino, Kubernetes, and CI/CD.
210k – 230kSan Francisco, CAData EngineeringHybrid3+ YOEAWSETL
Data Engineer, Machine Learning
Build and maintain production data pipelines that prepare conversational, voice, and multimodal data for ML model training and evaluation. Partner closely with ML engineers to deliver high-quality, versioned datasets and infrastructure.
170k – 240kSan Francisco, CAData EngineeringOn-site5+ YOESQLETL