Staff Software Engineer (Developer Platform)
Staff Software Engineer on the Developer Platform team building internal infrastructure, monorepo CI/CD, and Agentic AI tooling to accelerate secure software delivery. Requires 8+ years experience with Go, CI/CD, and cloud platform infrastructure.
What you'll do
- Take ownership of significant pieces of the Developer Platform, across monorepo CI/CD, the Agentic AI platform (DriftlessAF), shared build infrastructure, or the change-throughput substrate, and contribute to technical direction.
- Improve the speed, reliability, and developer experience of the monorepo CI/CD pipeline.
- Contribute to the next generation of change throughput — exploring substrates beyond PR-based workflows for agent-driven changes, trusted agents committing directly to main, and AI-assisted review and auto-merge.
- Help productionize the Agentic AI platform: agent observability, RAG-as-platform, and context-engineering patterns.
- Consolidate fragmented build systems and deliver standardized "paved road" blueprints for new products.
- Partner with engineers to identify toil, codify patterns, and automate manual work.
- Mentor teammates, raise the bar on design and code review, and serve as a technical voice for engineering velocity.
What we're looking for
- 8+ years building and operating production services and platform infrastructure in modern cloud environments.
- Proficiency with Go, or strong readiness to ramp quickly.
- Deep technical expertise in CI/CD systems, container-based orchestration, and scaling a large monorepo.
- Experience contributing to technical direction across multiple teams.
- Familiarity with Agentic AI and LLM-based automation in production, and interest in context engineering (RAG, memory, tool-calling).
- Excellent communication and collaboration skills, with a track record of automating repetitive manual work.
- Comfort with ambiguity and a focus on data-driven improvements.
Nice to have
- Experience scaling GitHub Actions, Argo Workflows, Buildkite, or comparable systems.
- Experience scaling change-throughput beyond traditional PR workflows.
- Background in developer experience metrics (DORA, SPACE).
- Experience operating a large Go monorepo.
- Familiarity with supply chain security, container ecosystems (OCI, Kubernetes), or open source communities.
- Experience publishing internal-platform OKRs and running stakeholder feedback loops.
Compensation & Benefits
Base Salary Range: $205,000–$231,000 USD Flexible & Remote-First Culture with team meetups and coworking stipend. Stock options with 10 years to exercise. 100% covered health, vision, and dental insurance. ∞ Flexible Time Off. 18 weeks paid parental leave for birthing parents, 12 weeks for non-birthing.
Senior Network & Site Reliability Engineer
Design, operate, and automate the global network and reliability layer for a high-performance NVIDIA DGX SuperPOD supporting ML workloads. Own architecture, observability, incident response, and security for mission-critical infrastructure.
Senior Software Engineer - Observability Visibility
Senior engineer building observability and resilience standards, tooling, and automation to make reliability the default across Datadog services. Requires 5+ years experience, Go/Python skills, and AI feature delivery experience.
Senior Manager, DevOps Engineering
Lead and mentor a team of DevOps and Infrastructure Engineers responsible for build pipelines, CI/CD systems, developer tooling, and release infrastructure across Hivemind Solutions. Drive modernization of C++/Python build ecosystems and ensure scalable, secure software delivery pipelines.
Staff Software Engineer
Staff Software Engineer owning technical strategy and systems for Coinbase's test infrastructure at scale. Focus on fast, reliable test signals through orchestration, smart selection, sharding, and flakiness remediation.
Staff Engineer, AI Productivity
Staff-level engineer building infrastructure, tooling, and documentation to make AI coding agents dramatically more productive across the codebase. Owns agentic dev environments, MCP integrations, and agent context.