Skip to content

Senior Platform Engineer, Operator

Designs, builds, and maintains enterprise-grade Kubernetes operator in Rust for SMBP, focusing on API architecture, networking, security, observability, multi-cloud deployment, and operational tooling. Collaborates with customers to meet production Kubernetes needs.

223k – 259kAtlanta, GAAustin, TXSan Francisco, CA+1 moreDevOps / SRERemote

About the role

Responsibilities

  • Own the architecture and evolution of the SMBP Operator's CRD API surface, designing extensions for enterprise needs including status conditions, security contexts, network configuration, and admission validation.
  • Design and implement flexible network configuration patterns supporting various ingress controllers, load balancers, and traffic management without prescribing specific Kubernetes components.
  • Build observability integration: Prometheus metrics, ServiceMonitor CRDs, Kubernetes Events, and pre-built dashboards.
  • Harden security posture: pod security contexts, network policies, supply chain integrity (image signing, SBOM), and auditable configurations.
  • Make SMBP deployable across EKS, GKE, AKS, OpenShift, on-prem with OLM support, OperatorHub, and multi-distribution testing.
  • Build operational tooling: must-gather diagnostics, kubectl plugins, compatibility matrices, documented upgrade paths.
  • Partner with customer-facing teams and engage enterprise customers for deployment requirements and technical escalations.

Requirements

  • Production Rust experience with large multi-crate workspaces, async Rust, lifetimes, and ownership in reconciliation loops.
  • Deep Kubernetes internals: controllers, reconciliation loops, watches, owner references, finalizers, admission webhooks, status subresources.
  • Kubernetes operator development using controller-runtime, kube-runtime, kubebuilder or equivalent.
  • Kubernetes networking: Service types, Ingress, IngressClass, ingress controllers, trade-offs with LoadBalancer, Gateway API.
  • StatefulSet management: rolling updates, PVC lifecycle, pod disruption budgets, topology changes.
  • Observability: Prometheus metrics, ServiceMonitor CRDs, Prometheus Operator.
  • Enterprise customer orientation: security reviews, compliance, operational expectations.

Nice-to-Haves

  • kube-runtime experience.
  • Strimzi (Kafka operator) familiarity.
  • Cert-manager integration.
  • Property-based testing (proptest).
  • OLM and OpenShift operator certification.
  • Kubernetes Gateway API.
  • Supply chain security: Cosign, SBOM (Syft, SPDX).
  • Distributed systems: CRDT, eventually consistent stores, P2P replication.
  • Open source contributions to cloud-native, Kubernetes, or Rust tooling.

Skills

RustKubernetesController-RuntimeKubebuilderKube-RuntimePrometheusIngressServicemonitorStatefulsetGateway ApiOlmOperatorhubStrimziCert-ManagerCosign

Similar roles

DevOps / SRE jobs

Senior Software Engineer, FDE

Forward Deployed Engineer deploys and optimizes peer-to-peer sync systems in secure, high-stakes edge environments, troubleshoots real-time issues, and relays insights to product teams. Requires TS/SCI clearance, 5+ years engineering experience, and expertise in distributed systems and container orchestration.

223k – 259kSan Diego, CADevOps / SREOn-site5+ YOEGoK3S

Senior Software Engineer, Cloud

Builds and scales Rust-based services for edge-to-cloud distributed systems integrated with Kubernetes and major clouds (AWS, Azure, GCP). Requires 6+ years experience, preferably from FAANG/cloud providers, with strong distributed systems expertise.

223k – 259kAtlanta, GA +3DevOps / SRERemote6+ YOEGoAWS

Sr. Manager - Production Engineering

Lead engineering team managing cloud IAM operations, CSP provisioning, compliance, and security data pipelines across AWS, Azure, GCP. Requires 8+ years in security/cloud engineering, 5+ years management, and BS in technical field.

222k – 300kBellevue, WA +2DevOps / SRERemote8+ YOEAWSGCP

Senior Software Integration Engineer

Leads end-to-end integration of agentic platform tools with clients, backends, and cloud ops on Kubernetes. Requires 7+ years experience, strong Python, HTTP/auth expertise, and cross-functional collaboration for reliable tool contracts and releases.

225k – 249kFoster City, CADevOps / SREHybrid7+ YOEJwtTls

Senior Software Engineer, Platform Team

Designs and builds distributed systems for scheduling, workflows, and storage abstractions powering research and trading in hybrid environments. Requires 5+ years experience in scalable services, modern languages like Python/Go, and Linux with strong system design skills.

225k – 255kBerkeley, CA +1DevOps / SRERemote5+ YOEGoC++