Skip to content

Linux Systems Engineer (USA)

130k – 150kStamford, CTNew York, NYDevOps / SREOnsite3+ YOE
Summary

Hands-on Linux Systems Engineer builds and maintains bare-metal servers, manages storage like ZFS, automates with Ansible and Bash, and ensures production reliability. Requires 3+ years Linux experience, physical server management, and on-call rotation with data center travel.

About the role

Responsibilities

  • Build, provision, and maintain bare-metal Linux servers, including OS installation, configuration, and lifecycle management
  • Own server infrastructure from hardware through operating system and core services, ensuring stability and performance
  • Configure and manage storage systems, including ZFS and enterprise storage platforms (e.g., NetApp, Dell, or similar)
  • Monitor system health and performance; troubleshoot issues and implement durable, long-term fixes
  • Develop and maintain automation using Ansible and Bash to standardize provisioning, configuration, and operations
  • Perform system patching, upgrades, and capacity planning across a growing server fleet
  • Participate in incident response, root cause analysis, and continuous improvement of system reliability
  • Collaborate with engineering and infrastructure teams to support application performance on Linux systems
  • Contribute to documentation, runbooks, and operational best practices
  • Support data center operations as needed, including hardware troubleshooting, racking, cabling, and server replacements
  • Travel to data centers periodically for maintenance, expansions, and issue resolution

Requirements

  • Bachelor’s degree in a technical field or equivalent practical experience
  • 3+ years of experience managing Linux systems in a production environment
  • Strong expertise in Linux at the system level, including OS installation, configuration, and troubleshooting on bare metal
  • Proven experience provisioning and managing physical servers at scale (100+ servers preferred)
  • Hands-on experience with storage systems, including ZFS; familiarity with enterprise storage vendors (NetApp, Dell, or similar) is strongly preferred
  • Proficiency with Ansible for configuration management and automation (required)
  • Strong Bash scripting skills; Python is a plus but not required
  • Solid understanding of system performance, resource management, and reliability in production environments
  • Experience working in environments where uptime, precision, and operational discipline are critical
  • Willingness to perform occasional hands-on hardware work and travel to data centers as needed
  • Ability to participate in an on-call rotation

Compensation and Benefits

  • Base salary: $130,000 - $150,000, determined by education and experience
  • Performance-based bonus
  • PPO health, dental, and vision insurance fully covered for employees and dependents
  • Pre-tax commuter benefits
  • Weekly company-sponsored meals
Skills
LinuxAnsibleBashZFSNetAppDellPython
Similar roles at this salary range
All DevOps / SRE jobs →
Northwood Space

Senior Network Engineer

Design, deploy, and operate enterprise network infrastructure for corporate facilities and hybrid cloud environments with zero-trust architecture and compliance requirements. Requires 5+ years enterprise networking experience and ability to obtain TS/SCI clearance.

133k – 215kLos Angeles, CA +1DevOps / SREOn-site5+ YOEAWSVLAN
Pinterest

Site Reliability Engineer II

Operate and scale a cloud-native CTV advertising platform on AWS and Kubernetes. Focus on reliability, GitOps workflows, infrastructure automation, observability, and incident response.

114k – 235kSan Francisco, CADevOps / SRERemote4+ YOEAWSEKS
Forterra

Senior Software Engineer-Internal Tools

Senior Software Engineer on the DevOps and Tooling team building internal tools. Requires 3-5+ years experience, Rust or strong systems background, TypeScript/React, Linux, Docker, and CI/CD.

125k – 140kArlington, VA +1DevOps / SREOn-site5+ YOEAWSRust
Beacon AI

Software Engineer, Cloud Infrastructure

Build and operate AWS cloud infrastructure and LLM platform services including RAG pipelines, vector search, model endpoints, and data ingestion for an aviation AI company.

135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOEAWSGlue
MongoDB

Site Reliability Engineer

Senior or Staff Site Reliability Engineer focused on continuous delivery infrastructure using Argo Workflows, ArgoCD, and Kubernetes. Owns deployment tooling, onboarding flows, and participates in 24/7 on-call. Requires 6+ years building and operating distributed systems.

127k – 249kBoston, MA +6DevOps / SREHybrid6+ YOEGoAWS