AI Staff Software Engineer

Staff AI Software Engineer builds and scales AI systems including PromptQL AI assistant, data engine, and runtime infrastructure for enterprise reliability. Requires 6+ years experience with AI/ML, distributed systems, LLMs, Python fluency, and technical leadership.

250k – 500kSan Francisco, CAML EngineeringHybrid6+ YOE

Apply

About the role

Role Requirements

6+ years of experience as a software engineer, with at least 1+ years building AI or ML-powered systems
Designed and shipped complex distributed systems at scale, and know how to optimize for performance, cost, and reliability
Worked directly with large language models (LLMs) in production
Fluent in Python and/or a systems language, and can move comfortably between infrastructure, APIs, and core product logic
Comfortable leading technically: writing high-leverage design docs, setting engineering direction, and mentoring other senior engineers
Biased toward action and can operate with high autonomy; know how to unblock yourself and others
Care about quality, safety, and performance; bring a systems-level perspective to how engineering decisions impact the product and user

Responsibilities

Write production code, design architecture, optimize performance, shape infrastructure, and contribute to safety mechanisms
Build core stack, rapidly design and implement features across the PromptQL AI assistant, the PromptQL data engine, and the PromptQL runtime infrastructure

Compensation

Salary $250,000 - $500,000
Annual bonus
Large equity package

Skills

PythonLLMsDistributed SystemsMachine LearningCloud InfrastructureAPIsSystems Programming

Similar roles

ML Engineering jobs

Axion

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMcpObservability

The Voleon Group

Member of Research Staff, Optimization

Conduct optimization research and implement large-scale constrained optimization models that drive real-time trading decisions, working across the full research lifecycle from theory to production. Requires PhD-level coursework and strong applied research background in optimization.

250k – 275kBerkeley, CA +1ML EngineeringHybrid7+ YOEC++Python

Nuance Labs

Member of Technical Staff — RL Research

New/recent PhD to own RL and post-training for large-scale omni models. Build and scale the full RL/post-training stack including rollout, optimization, reward modeling, and evaluation for real-time audiovisual AI.

250k – 350kSeattle, WAML EngineeringOn-siteEntry levelPpoDpo

Labelbox

Staff Software Engineer, AI Data Platform

Staff-level engineer building AI data platform infrastructure, eval systems, and agent-first tooling for frontier labs. Requires 4+ years shipping production systems, full-stack experience, and deep TypeScript/Python proficiency.

250k – 280kSan Francisco, CAML EngineeringHybrid4+ YOEGCPReact

Nuance Labs

Member of Technical Staff — Model Optimization and Inference

Optimize inference for real-time multimodal AI avatars. Specialize in LLM and diffusion model serving, KV cache strategies, quantization, and low-latency frameworks like vLLM and TensorRT-LLM.

250k – 350kSeattle, WAML EngineeringOn-site7+ YOEAwqvLLM