Staff Softare Engineer, Cortex AI Infrastructure

236k – 339kMenlo Park, CAOnsite5+ YOEApr 27

Summary

Builds scalable backend infrastructure for Snowflake's Cortex AI products, including agentic runtimes, RAG systems, evals engines, and productionized AI workflows. Requires 5+ years in distributed systems and proficiency in Go/Java/Python.

About the role

Responsibilities

Architect Agentic Runtimes: Build and scale the orchestration engines that execute complex agentic workflows, ensuring low-latency tool execution and robust state management.
Scale Context Engineering Infra: Design high-performance systems for RAG (Retrieval-Augmented Generation), including vector database integration, scalable and efficient search indexing, query processing, and result ranking, semantic caching, and automated metadata extraction.
Build the "Evals Engine": Develop the automated infrastructure required to run massive-scale golden set simulations, error analysis pipelines, and "hillclimbing" experiments.
Productionize AI Workflows: Collaborate with the modeling team to take raw LLM capabilities and turn them into hardened, multi-tenant microservices with strict guardrails and observability.
Optimize Performance & Cost: Direct the infra strategy for model routing, prompt caching, and token optimization to ensure Snowflake’s AI features are the most efficient in the industry.

Requirements

Education: Bachelor’s degree in Computer Science or a related technical field.
Experience: 5+ years of experience building distributed systems, high-throughput APIs, or backend infrastructure for AI/ML products.
Technical Stack: Deep proficiency in Go or Java (for systems) and Python (for AI orchestration).
Systems Thinking: Strong understanding of database internals, distributed state management, and cloud-native architecture (Kubernetes, FoundationDB, etc.).
Domain Expertise: Familiarity with the "plumbing" of AI: vector indices, agent platforms, and building scalable data pipelines.

Bonus

Experience with query optimization and SQL engine internals.
Designing multi-tenant systems that handle sensitive enterprise data at scale.
Developing search infrastructure for large-scale applications.
Direct experience with any of the subsystems outlined above.

Skills

GoJavaPythonKubernetesFoundationDBRAGVector DatabasesDistributed SystemsLLMMulti-tenant Systems

Similar roles at this salary range

All Backend Engineering jobs →

Jun 8

Staff Software Engineer, Growth AI

Staff Software Engineer anchoring AI-powered growth products across SEO and exploratory teams. Architect production ML systems, partner with ML orgs, and set technical direction as a senior IC.

208k – 365kSan Francisco, CA +3Backend EngineeringHybridJavaLLMs

Traba

Jun 8

Staff Software Engineer

Lead development of core backend systems and platform architecture for an AI-powered industrial supply chain startup. Own architectural decisions, CI/CD, and performance optimization in an early-stage team.

240k – 300kNew York, NY +1Backend EngineeringOn-siteKafkaPython

ClickUp

Jun 8

Staff Backend Engineer, Search

Staff-level search engineer responsible for designing, scaling, and optimizing ClickUp's search infrastructure using OpenSearch/ElasticSearch, including real-time indexing, vector search, and relevance tuning.

250k – 300kUnited StatesBackend EngineeringRemoteNLPIndexing

ClickUp

Jun 8

Senior Backend Engineer, Search

Senior Search Engineer responsible for designing, optimizing, and scaling search infrastructure using OpenSearch/ElasticSearch, improving relevance and speed, and building vector search capabilities.

200k – 250kUnited StatesBackend EngineeringRemoteNLPIndexing

GlossGenius

Jun 5

Staff Software Engineer, Backend

Staff Backend Engineer leading architecture and technical direction for AI-powered products. Owns system design, mentors engineers, and builds proof-of-concepts in Kotlin on AWS/Kubernetes.

241k – 284kNew York, NYBackend EngineeringHybridAWSLLMs

Apply