Software Engineer, Network Performance & Reliability
United StatesDevOps / SREHybrid5+ YOE
Summary
Build and operate Argo's network performance and reliability platform powering Cloudflare products. Requires systems programming (Go/Rust/C/C++), deep networking knowledge (L3/L4, HTTP, TLS), and distributed systems experience.
About the role
What You’ll Do
The Argo team owns reliable network connectivity for Cloudflare’s products. Almost all products in Cloudflare’s portfolio are powered by Argo technology, including CDN, Spectrum, Magic Transit, Stream, Workers, Workers AI, R2, WARP, and more.
- Work closely with Engineering teams to translate requirements into new platform capabilities
- Partner with Network Engineering and SRE to optimize use of Cloudflare's edge network
- Participate in all stages of the software development lifecycle: designing, documenting, coding, testing, deploying, and monitoring production systems
- Work with Rust, Go, Linux networking, ClickHouse, PostgreSQL, Grafana, Kubernetes, and more
- Use AI-powered tools for code analysis, production system introspection, and problem-solving
- Operate under a "Run What You Build" model and participate in on-call rotation
Must-Have Skills
- Systems-level programming experience in Go, Rust, C, or C++
- Solid grasp of networking protocols in Layers 3 and 4 of the OSI Model
- Knowledge of HTTP, TLS, and CDN networks
- Experience implementing secure and highly-available distributed systems
- Strong debugging skills in complex systems
- Strong collaboration and communication skills
- Experience/interest in HTTP networking and VPN technologies
- Experience/interest in network performance monitoring and tuning
- Willingness to adopt and integrate AI tools into engineering workflow
Bonus Points
- Knowledge of TCP/IP and Internet routing
- Professional systems-level programming experience in Rust
- Working knowledge of statistical-analysis techniques and control theory
- Experience building tools and APIs
- Experience with monitoring, alerting, and debugging large-scale distributed systems
- Experience participating in an on-call rotation
- Experience using AI-assisted development tools in a professional setting
Skills
RustGoCC++Linux networkingClickHousePostgreSQLGrafanaKubernetesHTTPTLSCDNTCP/IPVPN