Skip to content

Software Engineer - Infrastructure

100k – 180kSunnyvale, CADevOps / SREOnsite3+ YOE
Summary

Builds and maintains edge and cloud infrastructure for IoT devices and AI video security platform, including AWS provisioning, Kubernetes orchestration, CI/CD pipelines, and observability. Requires 3+ years in AWS IaC, Docker/K8s, and Python/Go.

About the role

Responsibilities

  • Writing and maintaining production-grade software code for our custom edge infrastructure stack
  • Provisioning and maintaining resources running on AWS
  • Building provisioning and management across hundreds of thousands of connected IoT devices deployed in the field
  • Building CI and CD and automation pipelines for various parts of the stack
  • Building observability and telemetry, both across our cloud applications and our edge devices
  • Helping maintain compliance with various security standards (SOC2, HIPAA …)
  • Maximising developer productivity by streamlining development workflows

Requirements

  • 3+ years of experience writing production infrastructure running on AWS using infrastructure as code tools, such as Pulumi or Terraform
  • Experience with Docker and Kubernetes (particularly EKS)
  • 3+ years of experience with Python, Go or any other modern programming language
  • Building CI / CD pipelines and automation of various parts of the stack
  • Self-hosting and maintaining observability tools such as Grafana/Prometheus

Nice-to-Haves

  • Edge/IoT infrastructure (Yocto, IoT devices provisioning, over-the-air updates..)
  • Remote management of on-prem infrastructure, or hybrid/multi-cloud
  • Maintaining SOC 2 compliance
  • Building high-volume data-processing pipelines
  • High intrinsic motivation to succeed and ability to work hard
Skills
AWSKubernetesEKSDockerTerraformPulumiPythonGoCI/CDGrafanaPrometheusIoTYocto
Similar roles at this salary range
All DevOps / SRE jobs →
Openly

Site Reliability Engineer II (Remote, US)

DevOps/SRE II building and maintaining infrastructure for an insurance platform using GCP, Kubernetes, and Terraform. Focus on automation, monitoring, incident response, and security best practices.

115k – 173kUnited StatesDevOps / SRERemote2+ YOEGoCI/CD
Pinterest

Site Reliability Engineer II

Operate and scale a cloud-native CTV advertising platform on AWS and Kubernetes. Focus on reliability, GitOps workflows, infrastructure automation, observability, and incident response.

114k – 235kSan Francisco, CADevOps / SRERemote4+ YOEAWSEKS
CommandLink

Senior Network Engineer

Senior Network Engineer building and supporting carrier interconnects, private circuits, NNIs, and cloud connectivity for a managed network services provider. Requires hands-on service provider experience with Layer 2/3 protocols and direct carrier coordination.

120k – 160kUnited StatesDevOps / SRERemote5+ YOEBGPVRF
Nuro

Software Reliability Engineer

Build and operate resilient systems for Nuro's autonomous vehicle fleet. Design pipelines, automation, and tools to improve reliability and reduce operational toil. Join on-call rotation and lead investigations.

109k – 163kMountain View, CADevOps / SREOn-siteGoC++
Kraken

Site Reliability Engineer - AI Agents

Design, build, and operate reliable infrastructure for AI agent workflows and model serving on AWS and Kubernetes. Build platform APIs, SDKs, and self-service tooling while ensuring observability and incident response for production AI systems.

96k – 192kUnited StatesDevOps / SRERemote5+ YOEAWSBash