Skip to content

Senior/Staff Virtualization Engineer

Builds custom compute environments including bare metal/virtual machines with GPU passthrough, dedicated Kubernetes clusters, and overlay networking for customer AI workloads. Requires 5+ years in Linux virtualization, strong networking, Kubernetes on bare metal, and NVIDIA GPU expertise.

180k – 250kSan Francisco, CADevOps / SREOnsite5+ YOE

About the role

Key Responsibilities

  • Build and deliver custom environments with excellent GPU performance for customer workloads
  • Leverage AI to an extreme level to automate provisioning, alerting and recovery
  • Provision and configure dedicated Kubernetes clusters tailored to customer requirements
  • Design and implement overlay networking (VLAN, VXLAN) and routing configurations (ECMP, BGP) and tunnels (strongSwan, IPSEC) for tenant isolation and performance
  • Build and maintain Linux images
  • Set up network monitoring and diagnostics for customer environments
  • Automate the end-to-end lifecycle of customer compute environments: creation, configuration, validation, and teardown

Requirements

  • 5+ years experience with Linux virtualization: KVM/QEMU, libvirt, VFIO device passthrough, hugepages, NUMA, CPU pinning
  • Strong networking fundamentals: VXLAN, VLAN, ECMP, BGP, ARP, and the ability to debug packet-level issues (tcpdump, Wireshark)
  • Production experience building and operating Kubernetes clusters on bare metal (MetalLB)
  • Proficiency with Linux image building and OS provisioning (kickstart, cloud-init, PXE/iPXE)
  • Proficiency in Python, Bash, Ansible and Terraform
  • Deep experience with NVIDIA GPUs: drivers, MIG, container runtimes (nvidia-container-toolkit), InfiniBand, RDMA/RoCEv2 and GPUDirect for high-performance AI networking
  • Excellent communication and ability to drive technical decisions across teams
  • Self-starter who executes quickly, takes ownership, and constantly seeks improvement

Nice to Have

  • Experience with SR-IOV, DPDK, or other high-performance networking technologies
  • Experience with shared network storage (Ceph, Lustre, Weka)
  • Experience with network automation tools (Netbox, Nautobot, Nornir)

Compensation

$180,000-250,000 plus equity + benefits

Skills

KubernetesKvm/QemuLibvirtVfioNvidia GpusVxlanVlanBGPEcmpPythonBashAnsibleTerraformTcpdumpWireshark

Similar roles

DevOps / SRE jobs

Staff Software Engineer, Cloud FinOps

Staff-level engineer driving company-wide cloud cost optimization and FinOps initiatives across engineering teams. Requires 5+ years infrastructure experience and 2+ years FinOps/cloud cost management.

180k – 240kUnited StatesDevOps / SRERemote5+ YOEAWSJava

Staff Engineer, AI Productivity

Staff-level engineer building infrastructure, tooling, and documentation to make AI coding agents dramatically more productive across the codebase. Owns agentic dev environments, MCP integrations, and agent context.

180k – 400kUnited StatesDevOps / SRERemote7+ YOEGoDevin

Staff Infrastructure Engineer

Build infrastructure, observability, and developer tooling for a realtime AI platform serving 911 centers. Requires 6+ years infrastructure/platform/backend experience and comfort across the full stack.

180k – 240kSeattle, WADevOps / SREOn-site6+ YOELoggingClickHouse

Staff Software Engineer, AI Developer Tools

Staff-level engineer architecting AI-native developer tools and infrastructure to accelerate engineering velocity across Gusto. Requires 8+ years experience building production AI systems with deep expertise in LLMs, RAG, and multi-agent workflows.

180k – 245kDenver, CO +3DevOps / SREHybrid8+ YOERAGLLMs

Staff Infrastructure Engineer

Staff Infrastructure Engineer building and operating secure cloud-native and edge platforms for military collaboration software. Requires 5+ years production infrastructure experience, deep Kubernetes expertise, and ability to obtain SECRET clearance.

180k – 235kUnited StatesDevOps / SRERemote5+ YOEGoAWS