Software Engineer - Infrastructure
Builds and maintains edge and cloud infrastructure for IoT devices and AI video security platform, including AWS provisioning, Kubernetes orchestration, CI/CD pipelines, and observability. Requires 3+ years in AWS IaC, Docker/K8s, and Python/Go.
Responsibilities
- Writing and maintaining production-grade software code for our custom edge infrastructure stack
- Provisioning and maintaining resources running on AWS
- Building provisioning and management across hundreds of thousands of connected IoT devices deployed in the field
- Building CI and CD and automation pipelines for various parts of the stack
- Building observability and telemetry, both across our cloud applications and our edge devices
- Helping maintain compliance with various security standards (SOC2, HIPAA …)
- Maximising developer productivity by streamlining development workflows
Requirements
- 3+ years of experience writing production infrastructure running on AWS using infrastructure as code tools, such as Pulumi or Terraform
- Experience with Docker and Kubernetes (particularly EKS)
- 3+ years of experience with Python, Go or any other modern programming language
- Building CI / CD pipelines and automation of various parts of the stack
- Self-hosting and maintaining observability tools such as Grafana/Prometheus
Nice-to-Haves
- Edge/IoT infrastructure (Yocto, IoT devices provisioning, over-the-air updates..)
- Remote management of on-prem infrastructure, or hybrid/multi-cloud
- Maintaining SOC 2 compliance
- Building high-volume data-processing pipelines
- High intrinsic motivation to succeed and ability to work hard
Site Reliability Engineer II (Remote, US)
DevOps/SRE II building and maintaining infrastructure for an insurance platform using GCP, Kubernetes, and Terraform. Focus on automation, monitoring, incident response, and security best practices.
Senior Network Engineer
Senior Network Engineer building and supporting carrier interconnects, private circuits, NNIs, and cloud connectivity for a managed network services provider. Requires hands-on service provider experience with Layer 2/3 protocols and direct carrier coordination.
Site Reliability Engineer - AI Agents
Design, build, and operate reliable infrastructure for AI agent workflows and model serving on AWS and Kubernetes. Build platform APIs, SDKs, and self-service tooling while ensuring observability and incident response for production AI systems.