Skip to content

Staff Software Engineer, Developer Experience

209k – 253kSan Francisco, CASunnyvale, CAOnsite7+ YOE
Summary

Staff-level engineer building developer tools, infrastructure, and automation to accelerate Crusoe engineering productivity. Requires Go, Kubernetes, CI/CD, and strong DevOps/SRE experience.

About the role

What You’ll Be Working On

Engineering Acceleration: Partner with the broader organization to build paved paths that engineers love to use. Our paved paths are fast, reliable and broadly adopted across all of engineering.

Toil Elimination: Ruthlessly identify and automate away the friction and repetitive tasks that slow down the development process, keeping Crusoe engineers in a state of high productivity.

Library & Environment Creation: Develop the libraries, tools, and pre-production environments necessary for vetting service APIs and complex microservice interactions.

Ecosystem Integration: Unify internal tooling and vendor services to automate workflows, build operational efficiency, and optimize security across the development stack.

Lifecycle Innovation: Innovate across every stage of the development lifecycle, including source code management, build systems, code review, CI/CD pipelines, platform runtimes, and telemetry.

Culture of Quality: Lead efforts to establish a culture of continuous quality delivery that scales seamlessly as our engineering headcount and infrastructure grow.

System Optimization: Work diligently to build efficient systems and processes that serve as a force multiplier for the impact of every engineer around you.

What You’ll Bring to the Team

  • Previous experience building developer tools and/or infrastructure for engineering teams
  • Expert ability to evaluate technical tradeoffs and understand how infrastructure decisions impact the daily productivity of the end-user (the developer)
  • Fluent knowledge of industry-standard AI tooling, build tooling, containerization, and open-source development frameworks
  • A demonstrated passion for building empathetic developer and operator workflows that prioritize human productivity
  • Professional experience managing or developing within Kubernetes clusters and a deep understanding of container orchestration
  • Proven experience in DevOps, Site Reliability Engineering (SRE), Release Engineering, or a similar productivity-focused discipline
  • Deep understanding of automated testing infrastructure and how to integrate it into a seamless CI/CD pipeline
  • Expertise in modern programming languages (specifically Go) and advanced proficiency in Git-based workflows (GitLab/GitHub)
  • A Bachelor’s or Master’s degree in Computer Science, Engineering, Mathematics, or a related analytical field (or equivalent professional experience)

Bonus Points

  • Experience informing long-term company objectives through technical insight and developer-centric advocacy
  • Experience building AI agent platforms for engineering teams
  • Hands-on experience with Linux image construction, package building, and kernel-level optimizations
  • Active involvement in the open-source community or a track record of staying current with recent industry advancements in developer productivity
  • A background in solving complex, multi-layered technical problems and then successfully automating the resulting solutions

Benefits

  • Competitive compensation
  • Restricted Stock Units
  • Paid time off & paid holidays
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
Skills
GoKubernetesGitGitLabGitHubCI/CDDockerLinuxSREDevOps
Similar roles at this salary range
All DevOps / SRE jobs →
Aurelian

Staff Infrastructure Engineer

Build infrastructure, observability, and developer tooling for a realtime AI platform serving 911 centers. Requires 6+ years infrastructure/platform/backend experience and comfort across the full stack.

180k – 240kSeattle, WADevOps / SREOn-siteLoggingClickHouse
Stuut

Lead Site Reliability Engineer

Lead SRE driving reliability strategy, infrastructure architecture, observability, and incident response for a B2B fintech platform on AWS and Kubernetes. Requires 7+ years building production-grade distributed systems.

200k – 275kSan Francisco, CADevOps / SREOn-siteAWSEKS
Crusoe

Staff Network Engineer, Operations

Staff-level network operations engineer responsible for production reliability, incident response, and operational excellence across Crusoe's global edge, backbone, data center, and GPU cluster networks supporting AI workloads.

195k – 235kSan Francisco, CADevOps / SREOn-siteBGPQoS
Watershed

Software Engineer, Developer Tooling

Software engineer building developer tooling, AI automation, and test infrastructure to improve productivity and reliability for Watershed engineering teams.

174k – 230kSan Francisco, CADevOps / SREOn-siteCI/CDTemporal
Gusto

Staff Software Engineer, AI Developer Tools

Staff-level engineer architecting AI-native developer tools and infrastructure to accelerate engineering velocity across Gusto. Requires 8+ years experience building production AI systems with deep expertise in LLMs, RAG, and multi-agent workflows.

180k – 245kDenver, CO +3DevOps / SREHybridRAGLLMs