Skip to content

Staff Software Engineer

218k – 257kUnited StatesRemote10+ YOE
Summary

Staff Software Engineer owning technical strategy and systems for Coinbase's test infrastructure at scale. Focus on fast, reliable test signals through orchestration, smart selection, sharding, and flakiness remediation.

About the role

Responsibilities

  • Define and own the technical strategy for test infrastructure across Coinbase engineering, prioritizing speed of feedback.
  • Build and operate core Developer Infrastructure - Test services: test execution orchestration, smart test selection, parallel sharding, flaky-test detection and suppression, and test result storage and analysis.
  • Reduce time between code written and test results; drive flakiness rate down for trustworthy signals.
  • Own systems end to end — SLOs, observability, on-call — with zero tolerance for correctness issues.
  • Partner with engineering teams to identify and fix test infrastructure bottlenecks.
  • Mentor engineers, set technical standards, and shape organizational approach to test infrastructure.

Requirements

  • 10+ years building and operating production software with strong fundamentals in distributed systems and a primary language like Go (or similar).
  • Demonstrated track record of defining and delivering technical strategy for foundational systems.
  • Deep, hands-on experience in test infrastructure: test execution at scale, flaky-test detection, test selection, sharding, or test result analysis.
  • Strong operational instincts — reliability, security, and observability focus.
  • Track record of driving complex, cross-team technical projects to completion.
  • Customer-focused mindset measuring success by impact on dependent engineers.
  • Ability to responsibly use generative AI tools and copilots in daily workflows.

Nice to Haves

  • Familiarity with test selection algorithms (change-based, dependency graph, ML-assisted) and coverage/speed tradeoffs.
  • Experience operating at scale with Kubernetes, AWS, GitHub Actions, Terraform, and containers (Docker/OCI).
  • Experience with observability and performance tooling (e.g., Datadog) for distributed test execution.
  • Curiosity about crypto/web3; experience in regulated or security-sensitive environments.
Skills
GoDistributed SystemsTest InfrastructureTest ExecutionFlaky Test DetectionTest SelectionShardingKubernetesAWSGitHub ActionsTerraformDockerDatadogObservability
Similar roles at this salary range
All DevOps / SRE jobs →
Alembic

Senior Network & Site Reliability Engineer

Design, operate, and automate the global network and reliability layer for a high-performance NVIDIA DGX SuperPOD supporting ML workloads. Own architecture, observability, incident response, and security for mission-critical infrastructure.

210k – 240kSan Francisco, CADevOps / SREOn-site8+ YOEBGPVPN
Datadog

Senior Software Engineer - Observability Visibility

Senior engineer building observability and resilience standards, tooling, and automation to make reliability the default across Datadog services. Requires 5+ years experience, Go/Python skills, and AI feature delivery experience.

175k – 240kNew York, NYDevOps / SREHybrid5+ YOEGoPython
Shield AI

Senior Manager, DevOps Engineering

Lead and mentor a team of DevOps and Infrastructure Engineers responsible for build pipelines, CI/CD systems, developer tooling, and release infrastructure across Hivemind Solutions. Drive modernization of C++/Python build ecosystems and ensure scalable, secure software delivery pipelines.

180k – 280kWashington, DCDevOps / SREOn-site7+ YOENixCMake
Hightouch

Staff Engineer, AI Productivity

Staff-level engineer building infrastructure, tooling, and documentation to make AI coding agents dramatically more productive across the codebase. Owns agentic dev environments, MCP integrations, and agent context.

180k – 400kUnited StatesDevOps / SRERemote7+ YOEGoDevin
Skydio

Staff Software Engineer - Infrastructure

Staff Infrastructure Engineer responsible for re-architecting Kubernetes infrastructure, improving continuous delivery, and making code changes across the stack to support drone platform needs.

230k – 275kSan Mateo, CADevOps / SREHybrid6+ YOEGoSaaS