Skip to content

Software Engineer, Infrastructure

170k – 290kUnited StatesRemote5+ YOE
Summary

Builds and scales cloud infrastructure for Render's developer platform, focusing on container orchestration, networking, storage, and AI workloads. Requires 5+ years experience with Kubernetes, IaC tools like Terraform/Pulumi/Ansible, and production systems at scale.

About the role

What You'll Do

  • Own Render's core infrastructure across multiple data centers and regions.
  • Help offer unique capabilities to Render customers through infrastructure innovation.
  • Plan and architect for rapidly increasing scale.
  • Debug issues at all levels in our infrastructure stack.
  • Improve the performance and reliability of our infrastructure through increased observability, load testing, and chaos engineering.
  • Collaborate with other engineers to help keep our platform stable, predictable, and secure.
  • Participate in our on-call rotation, with the rest of the engineering team.

What We're Looking For

  • At least 5 years of experience building and scaling cloud infrastructure.
  • Experience developing, maintaining, and debugging production systems at scale.
  • Experience building, operating and scaling Kubernetes clusters or similar resource/container orchestration.
  • Experience with infrastructure-as-code tools like Terraform, Pulumi, and Ansible.

Nice-to-haves

  • Experience with Linux kernel and/or container optimization
  • Familiarity with observability tools like Datadog, Grafana, and OpenTelemetry.
  • Experience hosting PostgreSQL (or similar data stores) at scale.
  • Security hardening skills, especially in the context of untrusted workloads.
Skills
KubernetesTerraformPulumiAnsibleLinux kernelDatadogGrafanaOpenTelemetryPostgreSQLChaos Engineering
Similar roles at this salary range
All DevOps / SRE jobs →
Fivetran

Senior Site Reliability Engineer

Senior SRE responsible for production infrastructure reliability, incident response, deployment automation, and scaling SaaS systems on Kubernetes and major cloud platforms.

175k – 210kOakland, CADevOps / SREHybrid5+ YOEAWSGCP
Dropbox

Senior Infrastructure Software Engineer, Storage Core

Senior engineer building and operating Dropbox's exabyte-scale distributed storage systems. Focus on replication, erasure coding, performance, and reliability in Go/Rust.

180k – 274kUnited StatesDevOps / SRERemote9+ YOEGoC++
Okta

Staff Site Reliability Engineer - Observability

Staff SRE focused on building and scaling a comprehensive observability platform on GCP using Terraform, Splunk, and Grafana. Requires 5+ years GCP observability experience and strong coding skills in Python or Go.

194k – 267kBellevue, WA +4DevOps / SREHybrid5+ YOEGoGKE
Cribl

Sr Software Engineer, Storage

Senior Software Engineer on the Storage team building autoscaling, self-healing infrastructure-as-code systems that manage petabyte-scale telemetry storage on AWS.

175k – 205kUnited StatesDevOps / SRERemote5+ YOEGoS3
Grow Therapy

Senior Platform Reliability Engineer

Senior Platform Reliability Engineer establishing reliability standards, observability, and incident response practices across engineering teams. Requires 6+ years operating production systems at scale with AWS, Kubernetes, Terraform, and modern observability tooling.

182k – 250kSan Francisco, CA +2DevOps / SREHybrid6+ YOEAWSEKS