Skip to content

Technical Data Delivery Lead

Leads architecture, execution, and improvement of data collection/evaluation pipelines for AI labs, including agentic automation for quality and delivery. Requires Python/SQL proficiency, LLM internals knowledge, and hands-on agent framework experience.

140k – 180kUnited StatesData EngineeringRemote

About the role

Responsibilities

Pipeline architecture

  • Design end-to-end data collection and evaluation pipelines for RLVR, RLHF, SFT, red-teaming, and model evaluation workflows.
  • Prototype novel workflows quickly, identify architectural risks, and make tradeoff decisions.
  • Understand agent-tool interactions and communicate engineering needs.

Agentic system deployment

  • Build, test, and iterate on AI agents for automating pipeline tasks like quality gate review, expert matching, output flagging, and anomaly detection.
  • Scope agent capabilities, write prompts and evaluation logic, monitor production performance.

Quality systems

  • Define data quality standards for annotation, evaluation, and expert output review.
  • Design and run audits using inter-rater reliability metrics, calibration sets, and statistical sampling.
  • Build preventive systems: automated checks, structured output validation, model-assisted review.
  • Spot-check tasks and translate findings into expert guidelines.

Client interface

  • Engage with AI researchers, TPMs, and PMs to translate requirements into workflows.
  • Communicate pipeline performance, escalate risks, contribute to scoping and pricing.

Research integration

  • Stay current with LLM post-training, evaluation methodology, and data tooling.
  • Evaluate and integrate new approaches like model-assisted annotation.

Requirements

  • Proficiency in Python and SQL for data manipulation, pipeline monitoring, and quality analysis.
  • Working knowledge of LLM internals: RLHF/SFT training loops, prompt structure effects, RL environment setup for agentic data collection/eval.
  • Hands-on experience with at least one agentic or LLM workflow framework (LangChain, DSPy, AutoGen, direct tool-use via API, or equivalent).
  • Demonstrated ownership of a data or ML pipeline from scoping through delivery, including quality design.
  • Strong written communication for technical guidelines, rubrics, and researcher briefings.
  • Comfort operating with ambiguity in fast-moving environments.

Nice-to-haves

  • Direct experience with RL environment data pipelines, evaluation framework design, and red-teaming workflows.
  • Background in data engineering, ML research support or equivalent.
  • Experience designing or operating agentic systems in production/near-production.
  • Familiarity with inter-rater reliability methods, calibration set design, and annotation quality frameworks.
  • Prior client-facing or technical program management experience in AI/ML-adjacent context.
  • Experience scoping/driving projects with fuzzy specs.

Skills

PythonSQLLangChainDspyAutogenRLHFSftRlvrRed-TeamingLLMsAgentic Workflows

Healthcare Data Analyst

Create advanced SQL/Spark SQL queries and prompt-engineered LLM workflows to transform healthcare claims data into clinical insights and automated policy tools. Requires 3-5 years SQL plus 2-3 years healthcare experience.

140k – 170kUnited StatesData EngineeringRemote3+ YOESQLClaude

Data Engineer

Build core data infrastructure as the first Data Engineer, designing scalable warehouse/lakehouse, data pipelines, and models for KPIs and AI systems. Requires 3-5+ years experience with Python, SQL, and modern cloud data stack in startups.

140k – 195kNew York, NYData EngineeringOn-site3+ YOESQLdbt

Software Engineer, Data Foundations

Build and scale data ingestion pipelines and connectors for enterprise SaaS apps, transform unstructured data for AI search and agents, ensure reliability and security at petabyte scale. Requires 3+ years backend/data infrastructure experience with distributed systems.

140k – 265kUnited StatesData EngineeringHybrid3+ YOEGoC++

Software Engineer L3 Data Substrate

As a Software Engineer on the Data & Analytics Platform team, you will design, build, and optimize the data platform to support various data-driven initiatives. You will work with cross-functional teams to architect scalable solutions and implement data infrastructure using modern data technologies.

139k – 204kUnited StatesData EngineeringRemote5+ YOEHiveHudi

Data Engineer

Senior Data Engineer building scalable data pipelines and infrastructure on AWS using Spark, Metaflow, and container orchestration. Requires 5+ years of experience designing distributed data systems.

145k – 190kUnited StatesData EngineeringRemote5+ YOEAWSSQL