Skip to content

Infrastructure Engineer

Infrastructure Engineer owns system stability, observability, and debugging at scale. Requires 3+ years experience with Go, Kubernetes, and tools like Datadog/Prometheus for production incident response.

200k – 240kSan Francisco, CACaliforniaDevOps / SREOnsite3+ YOE

About the role

Must-Have

  • 3+ years of hands-on experience debugging production systems (logs, traces, incidents, etc.)
  • Strong problem-solving skills and ability to dive into unfamiliar backend codebases
  • Strong Go and Kubernetes experience
  • Familiarity with observability and monitoring tools (e.g., Datadog, Prometheus, Sentry)
  • Clear, calm communication under pressure — especially during live incidents

Nice-to-Have

  • Experience working with distributed systems or services at scale
  • Built or maintained internal tooling for on-call teams or reliability workflows
  • Familiarity with deployment pipelines, CI/CD, or infra-as-code
  • Experience improving system observability (e.g., custom metrics, traces, log pipelines)

Skills

KubernetesGoDatadogPrometheusSentryDistributed SystemsCI/CDObservabilityInfrastructure As CodeDebugging

Similar roles

DevOps / SRE jobs

Platform Engineer, Model Shaping

Build and operate backend services and infrastructure for model customization and evaluation at Together AI. Requires 3+ years building production infrastructure, strong Python/Go skills, and deep experience with Kubernetes, Linux, and cloud platforms.

200k – 290kSan Francisco, CADevOps / SREHybrid3+ YOEGoAWS

Platform Engineer

Own AWS infrastructure, Pulumi IaC, deployment pipelines, and security baseline for an AI research platform serving financial institutions. First dedicated platform hire defining enterprise deployment, SOC 2 controls, and developer experience.

200k – 280kNew York, NYDevOps / SREOn-site5+ YOEAWSCdk

SRE/Infrastructure Engineer

Own Terraform, Kubernetes, and cloud infrastructure for a fast-growing AI infrastructure startup. Manage multi-cloud deployments, build reusable infrastructure components, and support enterprise BYOC offerings.

200k – 350kSan Francisco, CADevOps / SREOn-site5+ YOEGCPAWS

Software Engineer - Infrastructure

Builds and scales reliable cloud infrastructure, deployment systems, observability, and developer tooling to support mortgage market operations. Requires experience with strongly typed languages, PostgreSQL, Kubernetes, and major cloud providers.

200k – 250kUnited StatesDevOps / SRERemoteGoC#

AI Automation Engineer

Builds AI-powered CI/CD pipelines and automation infrastructure to enable autonomous code generation, testing, and deployment. Collaborates across teams to identify AI opportunities and develops productivity tools, ensuring production reliability.

200k – 230kNew York, NYDevOps / SREHybridAICI/CD