Senior Site Reliability Engineer, Platform Infrastructure
Senior SRE building and scaling control and data plane infrastructure for distributed AI/ML workloads on Ray. Requires 3+ years production experience, strong distributed systems background, Kubernetes, cloud platforms, Go/Python, and observability expertise.