Staff Infrastructure Engineer
Build infrastructure, observability, and developer tooling for a realtime AI platform serving 911 centers. Requires 6+ years infrastructure/platform/backend experience and comfort across the full stack.
What You’ll Be Doing
- Build out analytics infrastructure using ClickHouse to handle massive increases in traffic and increasingly complex queries
- Create instrumentation, logging, and monitoring tools for visibility into production issues
- Develop testing frameworks, CLIs, automations, and developer tooling to help the team move quickly with AI coding tools without sacrificing reliability
Who We’re Looking For
- 6+ years of experience in infrastructure, platform, or backend engineering roles, ideally at companies where reliability and scale are serious challenges
- Comfortable across areas of the stack from backend application code to cloud infrastructure
- Excited by the challenges of building AI systems at scale
- Thrive on autonomy and constantly look for new areas to improve
Benefits
- Comprehensive Medical, Dental, Vision & Life insurance
- 401(k)
- Unlimited PTO
- Company-wide offsites
- Equipment stipend
- Relocation assistance
- Daily delivered lunches
- Start-up Equity
Senior Infrastructure Engineer
Build analytics infrastructure, observability tooling, and developer platforms to support real-time AI agents for 911 centers. Requires 4+ years infrastructure/platform/backend experience and comfort across the full stack.
Lead Site Reliability Engineer
Lead SRE driving reliability strategy, infrastructure architecture, observability, and incident response for a B2B fintech platform on AWS and Kubernetes. Requires 7+ years building production-grade distributed systems.
Senior Developer Experience Engineer
Senior Platform Engineer focused on Developer Experience building tools, automation, CI/CD systems, and AI tooling to improve developer productivity and workflows. Requires 7+ years cloud experience, containerization, and proficiency in Ruby, Go, or Python.
Staff Network Engineer, Operations
Staff-level network operations engineer responsible for production reliability, incident response, and operational excellence across Crusoe's global edge, backbone, data center, and GPU cluster networks supporting AI workloads.