Staff Software Engineer, AI Data Platform

Staff-level engineer building AI data platform infrastructure, eval systems, and agent-first tooling for frontier labs. Requires 4+ years shipping production systems, full-stack experience, and deep TypeScript/Python proficiency.

250k – 280kSan Francisco, CAML EngineeringHybrid4+ YOE

Apply

About the role

What you may work on

Eval systems that run millions of agent trajectories to measure model and product quality.
Fine-tuning pipelines that turn evaluation signals into measurable agent improvements.
Agent-first product surfaces: UX and infrastructure for workflows where the user is a model or an agent operator.
The systems behind hundreds of thousands of AI interviews used to source and match freelance workers to projects.
Infrastructure that scales to the throughput frontier labs actually need.
Integration of the latest models and capabilities into production within days of release.

What we're looking for

4+ year track record of shipping systems customers and other engineers rely on
You build full stack prototypes fast and they hold up. The v1 you ship becomes the foundation the rest of the team builds on.
Strong system and API design judgement
Hard architecture and product calls land with you. You make them, defend them under pressure, and update fast when someone else is right.
You ship production code with coding agents daily. You know where they break and what it takes to make them reliable to further accelerate the team's velocity.
You set direction by being the example. Other engineers reach for your designs and your code as the reference.
You move fast in ambiguous, startup-pace environments with influence over authority.
You have worked in all parts of the stack
Deep proficiency in TypeScript and/or Python.

Nice to have

Production experience building LLM- or agent-driven products.
Designing evaluations for LLMs and agents, or producing high-quality data for ML systems.
Background in production distributed systems, ML infrastructure, or data systems at scale.

Our Technology Stack

Frontend: React.js with Redux, TypeScript
Backend: Node.js, TypeScript, Python, some Java & Kotlin
APIs: GraphQL
Cloud & Infrastructure: Google Cloud Platform (GCP), Kubernetes
Databases: MySQL, Spanner, PostgreSQL
Queueing / Streaming: Kafka, PubSub

Skills

TypeScriptPythonReactNode.jsGraphQLGCPKubernetesMySQLPostgresKafka

Similar roles

ML Engineering jobs

Axion

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMcpObservability

The Voleon Group

Member of Research Staff, Optimization

Conduct optimization research and implement large-scale constrained optimization models that drive real-time trading decisions, working across the full research lifecycle from theory to production. Requires PhD-level coursework and strong applied research background in optimization.

250k – 275kBerkeley, CA +1ML EngineeringHybrid7+ YOEC++Python

Nuance Labs

Member of Technical Staff — RL Research

New/recent PhD to own RL and post-training for large-scale omni models. Build and scale the full RL/post-training stack including rollout, optimization, reward modeling, and evaluation for real-time audiovisual AI.

250k – 350kSeattle, WAML EngineeringOn-siteEntry levelPpoDpo

Nuance Labs

Member of Technical Staff — Model Optimization and Inference

Optimize inference for real-time multimodal AI avatars. Specialize in LLM and diffusion model serving, KV cache strategies, quantization, and low-latency frameworks like vLLM and TensorRT-LLM.

250k – 350kSeattle, WAML EngineeringOn-site7+ YOEAwqvLLM

ClickUp

Staff AI Engineer - AI Product

Leads development of user-facing AI features using LLMs and AI models, integrating them into production for scalable, personalized experiences. Requires 5+ years engineering experience with Python/JS, databases, and AI orchestration expertise.

250k – 300kUnited StatesML EngineeringRemote5+ YOELLMsMySQL