Skip to content

Latest DevOps / SRE jobs at Blaxel

Site Reliability Engineer

Builds and operates reliable, scalable AI infrastructure including observability, SLOs, incident response, automation, and performance tuning for ultra-low-latency serverless compute. Requires 3+ years SRE/DevOps experience with cloud, Kubernetes, programming (Go/Rust/Python), and observability tools.

175k – 250kSan Francisco, CADevOps / SREOn-site3+ YOEGoAWS