Sr AI Engineer - Agentic Systems

Technical leader building and scaling production-grade multi-agent AI systems for real-time voice, workflow automation, and enterprise tool execution. Requires 8+ years experience and deep expertise in LLM platforms, agent frameworks, and distributed systems.

United StatesML EngineeringRemote8+ YOE

Apply

About the role

What You’ll Do

Drive Technical Strategy: Own the architectural roadmap and delivery of Dialpad’s Agentic infrastructure, core orchestration layers, memory architectures, and evaluation/observability systems.
Build & Scale: Design and deploy scalable, multi-modal AI agents capable of autonomous support, real-time voice reasoning, and secure API tool execution across complex enterprise workflows.
Mentor & Influence: Act as a technical anchor for the organization, raising the engineering bar, mentoring senior peers, and defining technical standards for an AI-native SDLC.
Partner Cross-Functionally: Collaborate with leadership across Product, Engineering, and Applied Research to align technical execution with Dialpad’s long-term business strategy.
Push the Frontier: Research and implement emerging agent frameworks, LLM inference optimization, advanced retrieval systems, and cutting-edge safety/policy guardrails to keep Dialpad at the absolute forefront of the "era of the agent."

Skills You’ll Bring

Experience:

8+ years of relevant software engineering experience, with a proven track record of technical leadership (as a Senior, Staff, or Principal Engineer) shipping complex, large-scale systems.

Systems Background:

Strong foundations in scaling distributed systems and production-grade infrastructure before evolving into applied AI, LLM platforms, and agentic architectures.

Core Technical Expertise:

Shipped production systems where AI agents reason, act, coordinate, and safely execute workflows.
LLM Platforms: Inference optimization and fine-tuning strategies.
Data & Retrieval: Advanced retrieval systems and memory architectures.
Agent Frameworks: Hands-on experience with frameworks like LangChain/LangGraph, CrewAI, or AWS/Google Agent ecosystems.
AI Ops: Evaluation, observability, and safety frameworks for production AI systems.
Real-Time Infrastructure: Streaming infrastructure and voice/conversational AI.
Tool Integration: Tool use, API execution frameworks, and human-in-the-loop validation systems.

Leadership & Mindset:

Operational Excellence: Experience setting clear technical goals, identifying architectural risks, and systematically clearing tech-debt gaps.
The 0→1 Archetype: Ability to thrive in ambiguity, build cutting-edge AI products from the ground up, and scale them into robust, self-sustaining enterprise systems.

Skills

Llm Inference OptimizationFine-TuningAdvanced Retrieval SystemsMemory ArchitecturesLangChainLangGraphCrewaiAws Agent EcosystemsGoogle Agent EcosystemsEvaluation FrameworksObservabilitySafety FrameworksStreaming InfrastructureVoice AiConversational AI

Similar roles

ML Engineering jobs

Traba

Senior Software Engineer

Build and own production AI agent systems (harnesses, evals, orchestration) on frontier LLMs for industrial supply chain workflows. Requires 5+ years software engineering with 1+ year shipping LLM/agent features, strong Python/TS, and high-agency customer immersion.

200k – 240kNew York, NY +1ML EngineeringOn-site5+ YOEWmsTms

Otter

Senior Machine Learning Engineer

Lead projects building and deploying large-scale ASR, NLP, and LLM systems for meeting intelligence. Requires 5+ years building production ML systems with PyTorch/JAX and experience with speech/language models.

230k – 265kMountain View, CAML EngineeringHybrid5+ YOEJAXAsr

OpenAI

Agent Post-Training, Artifacts Research

Train frontier models to generate polished artifacts (docs, spreadsheets, slides) by owning post-training improvements across RL, data, evals, and alignment. Requires strong ML fundamentals and hands-on LLM/RL experience.

295k – 445kSan Francisco, CAML EngineeringOn-site7+ YOELLMsRLHF

OpenAI

Agent Post-Training, Computer Use Research

Train frontier models to operate computers, browsers, and desktops. Design experiments, build evals, own post-training pipelines (RL, data, graders), and ship improvements into OpenAI agents.

295k – 445kSan Francisco, CAML EngineeringOn-site7+ YOERLHFLLMs

OpenAI

Agent Post-Training, Connectors Research

Train frontier agents to interface with professional software via code, APIs, and structured integrations. Design experiments, own post-training improvements (RL, evals, data), and ship capabilities into major model runs.

295k – 445kSan Francisco, CAML EngineeringOn-site7+ YOERLHFLLMs