Linux Systems Engineer (USA)

130k – 150kStamford, CTNew York, NYDevOps / SREOnsite3+ YOEApr 28

Summary

Hands-on Linux Systems Engineer builds and maintains bare-metal servers, manages storage like ZFS, automates with Ansible and Bash, and ensures production reliability. Requires 3+ years Linux experience, physical server management, and on-call rotation with data center travel.

About the role

Responsibilities

Build, provision, and maintain bare-metal Linux servers, including OS installation, configuration, and lifecycle management
Own server infrastructure from hardware through operating system and core services, ensuring stability and performance
Configure and manage storage systems, including ZFS and enterprise storage platforms (e.g., NetApp, Dell, or similar)
Monitor system health and performance; troubleshoot issues and implement durable, long-term fixes
Develop and maintain automation using Ansible and Bash to standardize provisioning, configuration, and operations
Perform system patching, upgrades, and capacity planning across a growing server fleet
Participate in incident response, root cause analysis, and continuous improvement of system reliability
Collaborate with engineering and infrastructure teams to support application performance on Linux systems
Contribute to documentation, runbooks, and operational best practices
Support data center operations as needed, including hardware troubleshooting, racking, cabling, and server replacements
Travel to data centers periodically for maintenance, expansions, and issue resolution

Requirements

Bachelor’s degree in a technical field or equivalent practical experience
3+ years of experience managing Linux systems in a production environment
Strong expertise in Linux at the system level, including OS installation, configuration, and troubleshooting on bare metal
Proven experience provisioning and managing physical servers at scale (100+ servers preferred)
Hands-on experience with storage systems, including ZFS; familiarity with enterprise storage vendors (NetApp, Dell, or similar) is strongly preferred
Proficiency with Ansible for configuration management and automation (required)
Strong Bash scripting skills; Python is a plus but not required
Solid understanding of system performance, resource management, and reliability in production environments
Experience working in environments where uptime, precision, and operational discipline are critical
Willingness to perform occasional hands-on hardware work and travel to data centers as needed
Ability to participate in an on-call rotation

Compensation and Benefits

Base salary: $130,000 - $150,000, determined by education and experience
Performance-based bonus
PPO health, dental, and vision insurance fully covered for employees and dependents
Pre-tax commuter benefits
Weekly company-sponsored meals

Skills

LinuxAnsibleBashZFSNetAppDellPython

Similar roles at this salary range

All DevOps / SRE jobs →

Northwood Space

Jun 19

Senior Network Engineer

Design, deploy, and operate enterprise network infrastructure for corporate facilities and hybrid cloud environments with zero-trust architecture and compliance requirements. Requires 5+ years enterprise networking experience and ability to obtain TS/SCI clearance.

133k – 215kLos Angeles, CA +1DevOps / SREOn-site5+ YOEAWSVLAN

Jun 18

Site Reliability Engineer II

Operate and scale a cloud-native CTV advertising platform on AWS and Kubernetes. Focus on reliability, GitOps workflows, infrastructure automation, observability, and incident response.

114k – 235kSan Francisco, CADevOps / SRERemote4+ YOEAWSEKS

Forterra

Jun 18

Senior Software Engineer-Internal Tools

Senior Software Engineer on the DevOps and Tooling team building internal tools. Requires 3-5+ years experience, Rust or strong systems background, TypeScript/React, Linux, Docker, and CI/CD.

125k – 140kArlington, VA +1DevOps / SREOn-site5+ YOEAWSRust

Beacon AI

Jun 17

Software Engineer, Cloud Infrastructure

Build and operate AWS cloud infrastructure and LLM platform services including RAG pipelines, vector search, model endpoints, and data ingestion for an aviation AI company.

135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOEAWSGlue

MongoDB

Jun 17

Site Reliability Engineer

Senior or Staff Site Reliability Engineer focused on continuous delivery infrastructure using Argo Workflows, ArgoCD, and Kubernetes. Owns deployment tooling, onboarding flows, and participates in 24/7 on-call. Requires 6+ years building and operating distributed systems.

127k – 249kBoston, MA +6DevOps / SREHybrid6+ YOEGoAWS

Apply