Software Engineer - Networking Software and Services
Build software, services, and frameworks for network management, automation, and monitoring of large-scale GPU supercomputing fabrics. Requires deep network protocol knowledge and experience orchestrating tens of thousands of devices.
Responsibilities
- Build cutting-edge software, services, and frameworks to empower Network Development Engineers
- Tackle all facets of network management: metric collection, configuration, zero-touch provisioning, monitoring, and auto-remediation
- Develop extensible tools and streamline complex processes for production and ancillary networks
- Build software and tools with extensive metrics coverage for large GPU supercomputing network fabrics
- Implement IaC best practices, enhance deployment pipelines, and ensure robust, secure service delivery
Requirements
- Deep experience collaborating with network engineers using knowledge of network topologies (physical and logical) and network protocols
- Expert knowledge designing scalable and reliable software from the ground up that can build and orchestrate tens of thousands of network devices
- Ability to thrive in ambiguity and create metrics to help prioritize team focus
Tech Stack
- Python
- Go
- TCP/IP
- BGP
- RDMA
Benefits
- Equity
- Comprehensive medical, vision, and dental coverage
- 401(k) retirement plan
- Short & long-term disability insurance
- Life insurance
- Various discounts and perks
Senior Manager, DevOps
Lead DevOps strategy and team to improve engineering velocity, platform reliability, and operational efficiency across multi-cloud (AWS/GCP) environments. Drive IaC, Kubernetes delivery, observability, AI-powered tooling adoption, and cross-functional collaboration.
Software Engineer, Dev Velocity
Build internal developer platform, tooling, and automation to accelerate engineering velocity. Focus on CI/CD pipelines, test infrastructure, build systems, and metrics to help engineers ship faster and more reliably.
Senior Software Engineer, Observability
Senior engineer on the Auth0 Platform Observability team responsible for designing, building, and maintaining scalable observability infrastructure (metrics, logs, traces) using Datadog, Terraform, and OpenTelemetry.
Software Engineer, Cloud Infrastructure
Build and operate AWS cloud and LLM infrastructure powering retrieval-augmented generation, vector search, and ML pipelines for aviation AI systems. Requires strong AWS depth, Python data pipelines, and production LLM experience.