Technical leader building and scaling production-grade multi-agent AI systems for real-time voice, workflow automation, and enterprise tool execution. Requires 8+ years experience and deep expertise in LLM platforms, agent frameworks, and distributed systems.
About the role
What You’ll Do
- Drive Technical Strategy: Own the architectural roadmap and delivery of Dialpad’s Agentic infrastructure, core orchestration layers, memory architectures, and evaluation/observability systems.
- Build & Scale: Design and deploy scalable, multi-modal AI agents capable of autonomous support, real-time voice reasoning, and secure API tool execution across complex enterprise workflows.
- Mentor & Influence: Act as a technical anchor for the organization, raising the engineering bar, mentoring senior peers, and defining technical standards for an AI-native SDLC.
- Partner Cross-Functionally: Collaborate with leadership across Product, Engineering, and Applied Research to align technical execution with Dialpad’s long-term business strategy.
- Push the Frontier: Research and implement emerging agent frameworks, LLM inference optimization, advanced retrieval systems, and cutting-edge safety/policy guardrails to keep Dialpad at the absolute forefront of the "era of the agent."
Skills You’ll Bring
Experience:
- 8+ years of relevant software engineering experience, with a proven track record of technical leadership (as a Senior, Staff, or Principal Engineer) shipping complex, large-scale systems.
Systems Background:
- Strong foundations in scaling distributed systems and production-grade infrastructure before evolving into applied AI, LLM platforms, and agentic architectures.
Core Technical Expertise:
- Shipped production systems where AI agents reason, act, coordinate, and safely execute workflows.
- LLM Platforms: Inference optimization and fine-tuning strategies.
- Data & Retrieval: Advanced retrieval systems and memory architectures.
- Agent Frameworks: Hands-on experience with frameworks like LangChain/LangGraph, CrewAI, or AWS/Google Agent ecosystems.
- AI Ops: Evaluation, observability, and safety frameworks for production AI systems.
- Real-Time Infrastructure: Streaming infrastructure and voice/conversational AI.
- Tool Integration: Tool use, API execution frameworks, and human-in-the-loop validation systems.
Leadership & Mindset:
- Operational Excellence: Experience setting clear technical goals, identifying architectural risks, and systematically clearing tech-debt gaps.
- The 0→1 Archetype: Ability to thrive in ambiguity, build cutting-edge AI products from the ground up, and scale them into robust, self-sustaining enterprise systems.
Skills
Llm Inference OptimizationFine-TuningAdvanced Retrieval SystemsMemory ArchitecturesLangChainLangGraphCrewaiAws Agent EcosystemsGoogle Agent EcosystemsEvaluation FrameworksObservabilitySafety FrameworksStreaming InfrastructureVoice AiConversational AI