# Principle Software Engineer, AI Observability & Evals Platform
**Company:** [LangChain](https://hotfix.jobs/companies/langchain)
**Location:** Cambridge, MA, Boston, MA, San Francisco, CA, New York, NY
**Salary:** $230K-$270K
**Experience:** 10+ years
**Skills:** Python, Go, TypeScript, React, Postgres, Redis, ClickHouse, AWS, GCP, Azure
**Posted:** 2026-05-13
> Leads technical direction for AI observability platform, building full-stack features in Go/Python/TypeScript, driving architecture, mentoring engineers, and ensuring production reliability at scale. Requires 10+ years backend/fullstack experience with high-throughput systems.
## Job Description
## What You'll Do

### Drive Technical Direction
- Lead architectural decisions across our Go, Python, and TypeScript stack, ensuring systems are performant, maintainable, and built to scale
- Work across the full stack, owning features end-to-end from backend services and APIs through to frontend product experiences
- Drive tracing, monitoring, and evaluation workflows at scale, with a focus on reliability and query performance across high-volume data
- Help shape the product roadmap by partnering closely with product and design — not just executing on it

### Raise the Bar for the Team
- Set engineering standards for the team: define patterns, lead code reviews, and establish the foundations others build on
- Mentor and grow engineers at all levels through code review, design feedback, pairing, and ongoing technical guidance
- Drive projects from ambiguity to delivery while maintaining high engineering standards and aggressive timelines

### Own Reliability and Quality
- Troubleshoot and resolve production issues with a root-cause mindset, and implement durable fixes
- Ensure system reliability through strong testing, monitoring, and alerting practices
- Create and maintain technical documentation, including system design docs and API references

## What You'll Bring
- 10+ years of professional experience in backend or fullstack engineering on highly complex, production systems
- Strong programming skills across multiple parts of the stack: backend (**Python** and/or **Go**) and frontend (**TypeScript**, **React**, or similar)
- Demonstrated experience making and owning architectural decisions, including tradeoffs around data systems, APIs, and service reliability
- Experience with high-throughput or mission-critical systems, and a proven ability to optimize for performance and reliability
- Depth in operationalizing technical work — you've taken systems from prototype to production and kept them running well at scale
- Demonstrated track record of mentoring engineers and raising the technical quality of a team, not just the codebase
- Strong communication skills and comfort operating cross-functionally with product, design, and engineering leadership
- Customer centricity and an ownership mentality — you care how the product lands, not just how the code reads
- You exemplify our operating principles

## Nice to Have
- Experience with database systems (**Postgres**, **Redis**, **ClickHouse**) and cloud platforms (**AWS**, **GCP**, or **Azure**)
- Familiarity with observability tooling, evaluation frameworks, or AI/LLM infrastructure

## Salary Range
$230,000 - $270,000
**Apply:** https://hotfix.jobs/jobs/principle-software-engineer-ai-observability-evals-platform-at-langchain-f95fbb92-9993-418e-ad62-f735edeb434c
**Canonical:** https://hotfix.jobs/jobs/principle-software-engineer-ai-observability-evals-platform-at-langchain-f95fbb92-9993-418e-ad62-f735edeb434c