Skip to content

Backend Software Engineer (Evals)

230k – 385kSan Francisco, CAOnsite4+ YOE
Summary

Build evals infrastructure and backend services for OpenAI's support automation, focusing on reliable pipelines, monitoring, and AI model integration. Requires 4+ years backend experience with Python, FastAPI, Postgres, and ML/LLM evals.

About the role

In this role, you will:

  • Design eval pipelines that are reliable, reproducible, and extendable
  • Build infrastructure for continuous eval monitoring frameworks (regression/drift monitoring, robust golden datasets) with feedback loops
  • Design, build, and maintain backend services and APIs for intelligent automation and knowledge systems
  • Integrate and structure data across internal platforms for downstream systems and AI workflows
  • Collaborate with data, research, and engineering teams to integrate OpenAI models into workflows
  • Own the full development lifecycle of new backend systems
  • Build with scale and maintainability in mind while iterating rapidly

You might be a great fit if you have:

  • 4+ years of backend engineering experience at product-driven companies (excluding internships)
  • Proficiency in backend technologies: Python, FastAPI, Postgres
  • Experience designing and scaling distributed systems, APIs, or data processing pipelines
  • Experience building AI agents or applications, including evals and performance improvement via prompting or scaffolding
  • Familiarity with LLM evaluation methods and patterns like multi-agent workflows, tool use, or long context
  • Experience creating production evals and/or measuring ML/LLM model performance at scale
  • Pragmatic mindset for iterative shipping toward long-term vision
Skills
PythonFastAPIPostgresDistributed SystemsAPIsData Processing PipelinesLLM EvalsAI AgentsMulti-agent WorkflowsML Models
Similar roles at this salary range
All Backend Engineering jobs →
Pinterest

Staff Software Engineer, Growth AI

Staff Software Engineer anchoring AI-powered growth products across SEO and exploratory teams. Architect production ML systems, partner with ML orgs, and set technical direction as a senior IC.

208k – 365kSan Francisco, CA +3Backend EngineeringHybridJavaLLMs
Traba

Staff Software Engineer

Lead development of core backend systems and platform architecture for an AI-powered industrial supply chain startup. Own architectural decisions, CI/CD, and performance optimization in an early-stage team.

240k – 300kNew York, NY +1Backend EngineeringOn-siteKafkaPython
ClickUp

Staff Backend Engineer, Search

Staff-level search engineer responsible for designing, scaling, and optimizing ClickUp's search infrastructure using OpenSearch/ElasticSearch, including real-time indexing, vector search, and relevance tuning.

250k – 300kUnited StatesBackend EngineeringRemoteNLPIndexing
ClickUp

Senior Backend Engineer, Search

Senior Search Engineer responsible for designing, optimizing, and scaling search infrastructure using OpenSearch/ElasticSearch, improving relevance and speed, and building vector search capabilities.

200k – 250kUnited StatesBackend EngineeringRemoteNLPIndexing
GlossGenius

Staff Software Engineer, Backend

Staff Backend Engineer leading architecture and technical direction for AI-powered products. Owns system design, mentors engineers, and builds proof-of-concepts in Kotlin on AWS/Kubernetes.

241k – 284kNew York, NYBackend EngineeringHybridAWSLLMs