Skip to content

Network Reliability Engineer

United StatesDevOps / SREHybrid3+ YOE
Summary

Network Reliability Engineer responsible for operating and engineering Cloudflare's core data center network, building automation tools, and leveraging LLMs for deployment and troubleshooting. Requires 3+ years of network/SRE experience, strong Go/Python skills, and Linux networking expertise.

About the role

Responsibilities

  • Technical operation and engineering of Cloudflare's core data center network
  • Planning, installation, and management of network hardware and software
  • Day-to-day operations of the network supporting internal needs (databases, high-volume logging, internal application clusters)
  • Build tools to automate operational tasks and streamline deployment processes
  • Provide a platform for other engineering teams to build upon
  • Bring ideas from design through to production
  • Leverage LLMs to build agentic deployment and troubleshooting tools, automate configurations (SaltStack + Temporal), parse complex log files, and streamline documentation

Requirements

  • 3 years of relevant Network/Site Reliability Engineering experience
  • BA/BS in Computer Science or equivalent experience
  • Solid foundation on configuration management frameworks: Saltstack, Ansible, Chef
  • Experience with NX-OS, JUNOS, EOS, Cumulus, or Sonic Network Operating Systems
  • Solid Linux systems administration experience
  • Linux networking experience (iproute2, Traffic Control, Devlink)
  • Strong software development skills in Go and Python

Nice-to-Haves

  • Deep knowledge of BGP and other routing protocols
  • Workflow Management (AirFlow, Temporal)
  • Open Source Routing Daemons (FRR, Bird, GoBGP)
  • Experience with bare metal switching
  • Experience with network programming in C, C++, or Rust
  • Experience with the Linux kernel and Linux software packaging
  • Strong tooling and automations development experience
  • Time series databases (Prometheus, Grafana, Thanos, Clickhouse)
  • Other Tools: Kubernetes, Docker, Prometheus, Consul
Skills
GoPythonSaltStackAnsibleChefNX-OSJUNOSEOSCumulusSonicLinuxBGPKubernetesDockerPrometheus