DevSecOps - Site Reliability Engineer (SRE) / US Gov
Designs, automates, and operates reliable cloud systems and deployment pipelines for US Government mission-critical workloads in AWS GovCloud/C2E. Requires 3+ years SRE/DevOps experience, Kubernetes, IaC, security clearance, and US citizenship.
Responsibilities
- Design, automate, deploy, and operate highly reliable cloud systems supporting mission-critical workloads for U.S. Government customers.
- Ensure reliability and operability of platform in production, making systems observable, fault-tolerant, and requiring minimal manual intervention.
- Build and evolve automated deployment pipelines, hardened runtime environments, and repeatable infrastructure patterns for secure, scalable operations in regulated environments.
- Support and improve deployments to air-gapped networks.
- Define and implement best practices for availability, latency, incident response, and service-level objectives (SLOs).
- Participate in incident response and 24/7 on-call rotation, eliminating toil through automation.
Technical Skills
- Strong experience with Kubernetes and containerized workloads in production environments.
- Hands-on experience operating clusters in AWS EKS, Rancher, or similar platforms.
- Experience supporting GovCloud, IL-enclave, or C2E environments.
- Deep experience with CI/CD systems and deployment automation (GitLab preferred).
- Proficiency in Python and Infrastructure-as-Code tools (Terraform or similar).
- Experience with observability platforms (Grafana LGTM stack, Datadog, or equivalent).
- Strong understanding of distributed systems, APIs, databases, caching, and event-driven architectures.
- Solid networking fundamentals (VPCs, VPNs, load balancers, TLS, service connectivity).
- Experience with Linux/Unix systems.
- Familiarity with cloud security best practices, enclave boundaries, and secure system design.
- Experience with identity and access management (AWS IAM, Auth0, Keycloak, ICAM patterns).
- Strong Git fundamentals and experience supporting deployments across multiple classification levels.
Qualifications
- Bachelor’s degree in Computer Science or related field.
- 3+ years of professional experience as an SRE, DevOps, reliability, infrastructure, or platform engineer.
- Active U.S. Security Clearance (Secret or higher required; TS/SCI preferred); U.S. Citizenship required.
- Experience working toward ATO/authorization in federal, DoD, or IC environments preferred.
- Experience supporting deployments in GovCloud, C2S/C2E, or IL-enclave environments highly desirable.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
Senior Manager, DevOps
Lead DevOps strategy and team to improve engineering velocity, platform reliability, and operational efficiency across multi-cloud (AWS/GCP) environments. Drive IaC, Kubernetes delivery, observability, AI-powered tooling adoption, and cross-functional collaboration.