Network Reliability Engineer
United StatesDevOps / SREHybrid3+ YOE
Summary
Network Reliability Engineer responsible for operating and engineering Cloudflare's core data center network, building automation tools, and leveraging LLMs for deployment and troubleshooting. Requires 3+ years of network/SRE experience, strong Go/Python skills, and Linux networking expertise.
About the role
Responsibilities
- Technical operation and engineering of Cloudflare's core data center network
- Planning, installation, and management of network hardware and software
- Day-to-day operations of the network supporting internal needs (databases, high-volume logging, internal application clusters)
- Build tools to automate operational tasks and streamline deployment processes
- Provide a platform for other engineering teams to build upon
- Bring ideas from design through to production
- Leverage LLMs to build agentic deployment and troubleshooting tools, automate configurations (SaltStack + Temporal), parse complex log files, and streamline documentation
Requirements
- 3 years of relevant Network/Site Reliability Engineering experience
- BA/BS in Computer Science or equivalent experience
- Solid foundation on configuration management frameworks: Saltstack, Ansible, Chef
- Experience with NX-OS, JUNOS, EOS, Cumulus, or Sonic Network Operating Systems
- Solid Linux systems administration experience
- Linux networking experience (iproute2, Traffic Control, Devlink)
- Strong software development skills in Go and Python
Nice-to-Haves
- Deep knowledge of BGP and other routing protocols
- Workflow Management (AirFlow, Temporal)
- Open Source Routing Daemons (FRR, Bird, GoBGP)
- Experience with bare metal switching
- Experience with network programming in C, C++, or Rust
- Experience with the Linux kernel and Linux software packaging
- Strong tooling and automations development experience
- Time series databases (Prometheus, Grafana, Thanos, Clickhouse)
- Other Tools: Kubernetes, Docker, Prometheus, Consul
Skills
GoPythonSaltStackAnsibleChefNX-OSJUNOSEOSCumulusSonicLinuxBGPKubernetesDockerPrometheus