Skip to content

Senior Cloud Engineer

215k – 240kBellevue, WAFloridaHybrid5+ YOE
Summary

Senior Cloud Engineer owning AWS/GCP infrastructure, Kubernetes/GitOps platforms, and CI/CD systems. Designs and operates scalable, secure cloud infrastructure while mentoring engineers and enabling AI/ML tooling.

About the role

Responsibilities

  • Design, build, and operate cloud infrastructure, platform tooling, and CI/CD systems for OfferUp's core applications
  • Build cloud infrastructure as code using Terraform on AWS and GCP, optimizing for reliability, security, and cost
  • Own and evolve the Kubernetes & GitOps platform (EKS/GKE, Envoy/Gloo networking, Helm, ArgoCD)
  • Build and maintain CI/CD pipelines and developer workflows (GitHub Actions, monorepo tooling, JFrog Artifactory)
  • Maintain backend "golden templates" and shared libraries used across the organization
  • Lead platform migrations end-to-end including observability, data-store, Kubernetes, and multi-account migrations
  • Drive operational excellence and security: own observability in Datadog, participate in on-call, remediate CVEs, manage secrets and access controls (Cloudflare/ZTNA, Okta, SSO, Workload Identity Federation), optimize cloud cost
  • Enable AI/ML developer tooling (Amazon Bedrock, Claude, AI-assisted workflows)
  • Lead design and code reviews, set technical standards, mentor engineers, and partner with other teams

Requirements

  • 5–8 years in cloud engineering, infrastructure, platform, DevOps, or SRE roles
  • Bachelor's in Computer Science or a related field, or equivalent practical experience
  • Deep, hands-on experience with AWS (ideally GCP): EKS/GKE, Lambda, networking, IAM, S3, Kinesis, OpenSearch, Route53 using Terraform at scale
  • Solid experience with Kubernetes (EKS/GKE), Helm, GitOps (ArgoCD), and CI/CD pipelines (GitHub Actions or similar) in a monorepo or multi-service environment
  • Experience with modern observability tooling (Datadog or similar) for metrics, logs, traces, and alerting; supporting production systems, on-call, and debugging complex distributed systems
  • Proficiency in at least one of Java, TypeScript, Python, or Go for automation, tooling, and platform development; comfort with scripting and SQL
  • Ability to communicate technical concepts to technical and non-technical audiences, lead through influence, and mentor other engineers

Nice-to-Haves

  • Experience with edge and zero-trust networking (Cloudflare, Envoy/Gloo)
  • Experience operating data and streaming infrastructure (Kinesis, Flink, Redis/Valkey, Confluent/Kafka)
  • Experience enabling AI/ML infrastructure or developer tooling (Amazon Bedrock, SageMaker, or LLM-based developer workflows)
  • Experience with security and compliance practices, change management, and incident management at scale
  • Experience with GCP Workload Identity Federation and cross-cloud (AWS↔GCP) integrations

Compensation & Benefits

  • Compensation Range: $215,000 - $240,000
  • Equity in OfferUp
  • Health insurance, healthcare savings and spending accounts
  • 401(k) plan with match
  • Basic and voluntary life insurance, disability benefits
  • Paid time off: sick leave, family/medical leave, vacation (flexible 3-5 weeks), 12 company holidays
Skills
AWSGCPTerraformKubernetesEKSGKEHelmArgoCDGitHub ActionsDatadogJavaTypeScriptPythonGoCI/CD
Similar roles at this salary range
All DevOps / SRE jobs →
Alembic

Senior Network & Site Reliability Engineer

Design, operate, and automate the global network and reliability layer for a high-performance NVIDIA DGX SuperPOD supporting ML workloads. Own architecture, observability, incident response, and security for mission-critical infrastructure.

210k – 240kSan Francisco, CADevOps / SREOn-site8+ YOEBGPVPN
Datadog

Senior Software Engineer - Observability Visibility

Senior engineer building observability and resilience standards, tooling, and automation to make reliability the default across Datadog services. Requires 5+ years experience, Go/Python skills, and AI feature delivery experience.

175k – 240kNew York, NYDevOps / SREHybrid5+ YOEGoPython
Shield AI

Senior Manager, DevOps Engineering

Lead and mentor a team of DevOps and Infrastructure Engineers responsible for build pipelines, CI/CD systems, developer tooling, and release infrastructure across Hivemind Solutions. Drive modernization of C++/Python build ecosystems and ensure scalable, secure software delivery pipelines.

180k – 280kWashington, DCDevOps / SREOn-site7+ YOENixCMake
Coinbase

Staff Software Engineer

Staff Software Engineer owning technical strategy and systems for Coinbase's test infrastructure at scale. Focus on fast, reliable test signals through orchestration, smart selection, sharding, and flakiness remediation.

218k – 257kUnited StatesDevOps / SRERemote10+ YOEGoAWS
Hightouch

Staff Engineer, AI Productivity

Staff-level engineer building infrastructure, tooling, and documentation to make AI coding agents dramatically more productive across the codebase. Owns agentic dev environments, MCP integrations, and agent context.

180k – 400kUnited StatesDevOps / SRERemote7+ YOEGoDevin