Skip to content

Senior Infra Engineer

180k – 250kSan Francisco, CADevOps / SREHybrid5+ YOE
Summary

Builds and scales core infrastructure including Kubernetes clusters and cloud resources to power AI data platform. Collaborates with teams on foundational systems, optimizes for cost/performance, and ensures security/compliance. Requires 5+ years infra experience.

About the role

Responsibilities

  • Collaborate with other engineering teams to build and maintain foundational systems that empower developers and support the company's rapid growth.
  • Design and implement scalable infrastructure solutions for various deployment models, including SaaS, single-tenant, and private deployments.
  • Manage and optimize cloud resources and Kubernetes clusters for cost-effectiveness and performance.
  • Enable external customer deployment success through maintaining clear infrastructure boundaries and principles.
  • Optimize and improve the release and deployment processes to enhance efficiency and reliability.
  • Ensure compliance with relevant regulations and implement robust security measures across different deployment environments.

Qualifications

  • 5+ years of engineering experience.
  • Worked on Platform or Infrastructure teams on significant projects involving infrastructure components (Terraform/CDKTF, Kubernetes, Helm, test infrastructure, release management, observability, etc.).
  • Experience in optimizing cloud resource utilization. Proficient in tuning Kubernetes clusters and cloud resources for cost and performance efficiency.
  • Willing to build LlamaIndex's engineering culture as we grow.
  • You can balance speed and pragmatism and build the appropriate solutions for each stage of the company's growth.

Preferred Qualifications

  • Experience building out infrastructure from the ground up at a fast-growing startup.
  • Experience with observability tools like Prometheus, Grafana, New Relic.
  • Experience with GitOps tools like ArgoCD and Flux for continuous deployment.
  • Experience with security compliance and audits in cloud environments such as SOC2.
  • Familiar with Python, Postgres, multi-cloud deployments.
Skills
KubernetesTerraformHelmPrometheusGrafanaArgoCDFluxPythonPostgresCDKTF
Similar roles at this salary range
All DevOps / SRE jobs →
Tines

Senior Site Reliability Engineer - Government Cloud

Build and operate AWS GovCloud infrastructure for federal customers, owning IaC, container pipelines, compliance documentation, and operational tooling. Requires 5+ years AWS experience and FedRAMP familiarity.

210k – 220kUnited StatesDevOps / SRERemote5+ YOEAWSCDK
Idme

AI Enablement Engineer

Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.

154k – 171kMcLean, VA +1DevOps / SREOn-site4+ YOEGoRAG
Idme

AI Enablement Engineer

Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.

154k – 171kMcLean, VA +1DevOps / SREOn-site4+ YOEGoRAG
Idme

AI Enablement Engineer

Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.

154k – 171kMcLean, VA +1DevOps / SREOn-site4+ YOEGoRAG
Idme

AI Enablement Engineer

Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.

154k – 171kMcLean, VA +1DevOps / SREOn-site4+ YOEGoRAG