Skip to content

Senior Software Engineer, Strategy Research Analytics

Leads design and evolution of analytics infrastructure for research reporting, owning pipelines, datasets, and platform standardization. Collaborates with data scientists using Python, SQL, distributed engines like Presto/Spark. Requires 6+ years in data infrastructure.

200k – 255kBerkeley, CANew York, NYData EngineeringRemote6+ YOE

About the role

Responsibilities

  • Own implementation and on-going operation of recurring analytics pipelines (e.g., Airflow DAGs) including monitoring, alerting, and reliability improvements
  • Lead architectural evolution of the analytics platform, including schema standardization, DAG consolidation, and modernization of legacy workflows
  • Drive cross-team technical alignment when consolidating duplicated or inconsistent analytics outputs
  • Build and maintain base analytics tables and metrics with strong schema discipline and reproducible computation
  • Define and implement reliability standards (SLOs, observability patterns, runbooks) adopted across analytics pipelines
  • Improve transparency and usability through documentation, discoverability, and clear data contracts
  • Optimize distributed compute and SQL query performance; design data layouts (partitioning, file sizing) for columnar storage

Requirements

  • Bachelor's degree in Computer Science or equivalent professional experience
  • 6+ years of experience building and operating analytics or data infrastructure systems
  • Strong proficiency in Python and SQL
  • Deep experience with distributed query engines and large-scale compute systems
  • Demonstrated ownership of large-scale or mission-critical data infrastructure
  • Strong data modeling expertise, including schema design, partitioning strategy, and reproducibility considerations
  • Expertise in metadata management, data lineage, and applying robust data governance principles

Preferred Qualifications

  • Experience leading architectural migrations or major refactors of data platforms
  • Familiarity with AWS cloud technologies and on-prem compute clusters (e.g., Slurm, SSH, Unix)
  • Exposure to quantitative research or machine learning environments

Skills

PythonSQLAirflowPrestoSparkParquetOrcAWSData ModelingData LineageMetadata ManagementSlurm

Senior Software Engineer, Data Infrastructure

Decagon is seeking a Senior Data Infrastructure Engineer to design, build, and operate data systems for their AI products. This role involves owning critical data pipelines and storage layers, improving reliability and performance, and creating paved paths for engineers to work with data at scale.

200k – 400kNew York, NYData EngineeringOn-site5+ YOEdbtKafka

Senior Software Engineer, Data Infrastructure

As a Senior Data Infrastructure Engineer, you will design, build, and operate data systems for Decagon's AI products, owning critical data pipelines and storage layers. You will improve reliability and performance, and enable engineers to work with data at scale.

200k – 400kSan Francisco, CAData EngineeringOn-site5+ YOEdbtKafka

Senior Software Engineer I

Builds and maintains a high-quality data platform delivering Forge data to clients, implementing features with agile methodologies. Requires 3-5+ years in TypeScript/C#/Java/Python, React, modern data architectures, CI/CD, AWS, and SQL databases; hybrid in Soho, NY.

200k – 230kNew York, NYData EngineeringHybrid3+ YOEC#SQL

Senior Software Engineer I

Builds and maintains a high-quality data platform for delivering Forge's financial data to clients, implementing features with agile methods. Requires 3-5+ years in TypeScript/C#/Java/Python, React, modern data architectures, CI/CD, AWS, and SQL/PostgreSQL.

200k – 230kSan Francisco, CAData EngineeringHybrid3+ YOEC#SQL

Senior Software Engineer - Observe Data Management

Builds and scales petabyte-scale data ingestion pipelines for observability platform using Go/C++ on AWS/Azure. Requires 5+ years in distributed systems, strong systems programming, and cloud experience.

200k – 288kMenlo Park, CAData EngineeringHybrid5+ YOEGoC++