Skip to content

Senior Software Engineer

Build and own production AI agent systems (harnesses, evals, orchestration) on frontier LLMs for industrial supply chain workflows. Requires 5+ years software engineering with 1+ year shipping LLM/agent features, strong Python/TS, and high-agency customer immersion.

200k – 240kNew York, NYSan Francisco, CAML EngineeringOnsite5+ YOE

About the role

Responsibilities

  • Embed with customers and operators to understand how supply chains run today—then design and ship agents that take meaningful work off their plate.
  • Build production agent systems on frontier LLMs: tool use, sub-agents, retrieval, structured outputs, MCP servers, and the orchestration that ties them together.
  • Own evaluation as a first-class discipline—datasets from real traces, rubrics and graders, experiments, and improvements you can prove move the needle.
  • Architect the data, services, and APIs the agent layer depends on—integrating our internal systems with customers' WMS, TMS, and ERP environments.
  • Codify repeatable deployment patterns so each new customer rollout is faster than the last.

Requirements

  • 5+ years of software engineering, with 1+ year shipping LLM- or agent-based features into production.
  • Strong in Python and/or TypeScript/Node.js, and comfortable designing APIs, distributed systems, and data models in PostgreSQL.
  • Hands-on with the modern agent stack: production-scale prompt engineering, evaluation frameworks, orchestration patterns, and frontier model APIs.
  • A track record of building in fast, ambiguous environments—ideally at a vertical AI, AI-agent, forward-deployed, or data-product company.
  • Excellent written and verbal communication—you can run a customer workshop in the morning and write a clean design doc in the afternoon.

Nice-to-Haves

  • Builder with an AI operator's instinct. You've shipped real product on top of LLMs—not just chat wrappers. You've designed agent harnesses, structured tools, written evals, and tuned prompts against production traces.
  • Domain-immersed. You enjoy time with the people who do the work—learning an industry's vocabulary, edge cases, and operational tempo—and let that shape what you build.
  • High-agency in ambiguity. Dropped into a fuzzy customer problem with a half-formed hypothesis and a deadline, you scope, build, evaluate, and ship without waiting for a spec.
  • Sweat both ends of the stack. You move between prompt iteration, eval design, backend services, and customer-facing UI in the same week, and you care about evals that catch regressions and traces that are easy to debug.

Compensation

  • Competitive salary in the range of $200,000 - $240,000.
  • Start-up equity.
  • 100% Paid health, dental & vision coverage.
  • Additional benefits including commuter benefit, gym membership, HSA, mental health support, and more.

Skills

PythonTypeScriptNode.jsPostgresLLMsPrompt EngineeringEvaluation FrameworksAgent OrchestrationDistributed SystemsAPIsWmsTmsERP

Similar roles

ML Engineering jobs

AI Engineer

Build and productionize LLM systems for clinical documentation at a healthcare AI startup. Requires 7+ years experience training, fine-tuning, and evaluating models with strong focus on evals and reliability.

200k – 250kSan Francisco, CAML EngineeringOn-site7+ YOELLMsFine-Tuning

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFsdp

Senior Software Engineer, Applied ML

Senior Software Engineer applying ML/AI techniques to fintech problems like fraud detection and user conversion. Requires 5+ years backend engineering (Java/JVM) and 2+ years deploying production ML models.

200k – 250kNew York, NYML EngineeringOn-site5+ YOEGCPAWS

Senior AI Engineer

Build and deploy production-grade agentic AI systems and automation workflows that drive efficiency across sales, marketing, finance, and other business functions. Partner with stakeholders to identify high-impact use cases and deliver reliable, observable LLM-powered solutions.

200k – 230kCarson City, NV +1ML EngineeringHybrid8+ YOERAGPython

Senior Applied Scientist

Build and own algorithmic systems that evaluate providers, make recommendations, and optimize healthcare outcomes for cost, quality, and access. Requires 2+ years shipping production algorithms and expertise in ML, optimization, and heuristics.

200k – 245kNew York, NYML EngineeringHybrid2+ YOESQLAWS