Staff Network Engineer, Deployment
Leads physical and logical deployment of network infrastructure in data centers for AI/HPC, including rack/stack, testing, automation with Python/Ansible, and partner coordination. Requires 8+ years experience with Arista, Juniper, NVIDIA hardware, BGP/EVPN, and physical layer expertise.
What You’ll Be Working On
- Execute Global Build-outs: Lead the end-to-end deployment of network infrastructure in new and existing data centers, from initial rack/stack oversight to final hand-off.
- Bridge Design and Reality: Take high-level designs from the Network Development team and translate them into site-specific implementation plans, cable maps, and configuration templates.
- Validate and Commission: Perform rigorous "Burn-in" testing and site acceptance testing (SAT) for new network clusters, ensuring zero-defect handovers to the Operations team.
- Optimize Deployment Automation: Use Python, Ansible, and ZTP (Zero Touch Provisioning) to automate the staging and configuration of hundreds of network devices simultaneously.
- Manage On-site Partners: Coordinate with remote hands, structured cabling vendors, and data center providers to ensure physical layer standards (fiber paths, power requirements, and cooling) meet Crusoe’s stringent HPC requirements.
- Inventory and Capacity Management: Track global hardware assets and lead the "Turn-up" of new backbone capacity and edge interconnects.
What You’ll Bring to the Team
- 8+ years of experience in network engineering with a heavy focus on large-scale data center deployments and infrastructure projects.
- Mastery of Physical Layer Standards: Expert knowledge of structured cabling (SMF/MMF, MPO/MTP), optical transceivers (400G/800G), and data center power/cooling requirements.
- Strong Routing and Switching Knowledge: Hands-on experience configuring Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in a leaf-spine architecture.
- Protocol Proficiency: Solid understanding of BGP, EVPN-VXLAN, and LLDP as they relate to large-scale fabric provisioning.
- Automation-First Mindset: Proficiency in Python and Ansible for automating repetitive deployment tasks and validating configuration state.
- Logistical Excellence: Proven ability to manage multiple complex projects simultaneously across different time zones and physical locations.
- Troubleshooting Expertise: Ability to diagnose complex physical layer and link-layer issues using OTDRs, light meters, and packet captures.
Education: Bachelor’s degree in a technical field or equivalent practical experience in hyperscale or ISP environments.
Compensation
Compensation will be paid in the range of $174,000 to $211,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Senior Infrastructure Engineer
Build analytics infrastructure, observability tooling, and developer platforms to support real-time AI agents for 911 centers. Requires 4+ years infrastructure/platform/backend experience and comfort across the full stack.
Lead Site Reliability Engineer
Lead SRE driving reliability strategy, infrastructure architecture, observability, and incident response for a B2B fintech platform on AWS and Kubernetes. Requires 7+ years building production-grade distributed systems.
Senior Developer Experience Engineer
Senior Platform Engineer focused on Developer Experience building tools, automation, CI/CD systems, and AI tooling to improve developer productivity and workflows. Requires 7+ years cloud experience, containerization, and proficiency in Ruby, Go, or Python.