Skip to content

Senior Software Engineer, Infrastructure

Designs and operates scalable AWS-based cloud infrastructure including Kubernetes, serverless, and data stores. Collaborates on platform vision, builds productivity tools, manages monitoring/on-call, and requires 6+ years in cloud-native environments with IaC expertise.

160k – 195kUnited StatesDevOps / SRERemote6+ YOE

About the role

What You'll Be Doing

  • Architect and evolve our cloud infrastructure (primarily on AWS) across container orchestration (Kubernetes, Elastic Container Service), serverless (e.g., Lambda), virtual machines (e.g., EC2), and data stores to support current and future products.
  • Collaborate with engineering leadership, machine learning, data science, and product partners to help shape our platform vision.
  • Develop and maintain tooling that improves engineering productivity and developer experience.
  • Promote sustainable incident response and lead blameless post-incident reviews.
  • Manage network and systems monitoring, design alert strategies, and participate in an equitable on‑call rotation.

Who We're Looking For

  • A data‑informed approach and a track record of effective problem solving.
  • Bring 6+ years of hands-on infrastructure / platform development experience (or equivalent practical experience) in modern, cloud-native environments, with a track record of owning critical systems in production.
  • Extensive experience operating Kubernetes in production, including familiarity with cluster lifecycle management, scaling, and container security.
  • Proficiency in one or more languages—Python (preferred) and/or Bash—and familiarity with Infrastructure as Code (IaC) tools such as CDK, Terraform or Pulumi.
  • Strong familiarity with AWS and/or GCP services.
  • Networking fundamentals and comfort working in command‑line Linux environments.
  • Clear, respectful communication and collaboration skills, including organizing work, delegating when appropriate, and giving/receiving feedback.
  • Experience designing complex systems and mentoring others in technical design.
  • Ability to scope project-level work, execute independently, and bring projects to completion while collaborating with teammates.

Nice to Haves

  • Proficiency in deep Linux troubleshooting, including debugging kernel and driver issues.
  • Experience in regulated environments (e.g., HIPAA) or at early‑stage startups.
  • Background in healthcare, security, or machine learning.
  • Familiarity with HL7 or radiology workflows.
  • Experience with OpenTelemetry or similar tracing services.
  • Familiarity with Grafana or similar logging services.
  • Experience with Spark (e.g., EMR, Dataproc, HDInsight) and Hadoop‑related technologies.

Compensation & Benefits

For US-Based Full-Time Roles, Rad AI offers a variety of benefits, including:

  • Comprehensive Medical, Dental, Vision & Life insurance
  • HSA (with employer match), FSA, & DCFSA
  • 401(k)
  • 11 Paid Company Holidays
  • Flexible PTO policy
  • Annual company-wide offsite
  • Periodic team offsites
  • Annual equipment stipend

Skills

KubernetesAWSPythonTerraformCdkPulumiLinuxGCPBashOpenTelemetryGrafanaSpark

Similar roles

DevOps / SRE jobs

Lead DevOps Engineer

Lead a team of DevOps engineers to design, implement, and maintain CI/CD pipelines, cloud infrastructure, monitoring, and security best practices. Requires 7+ years of DevOps experience including 2 years in leadership.

160k – 215kNew York, NYDevOps / SREOn-site7+ YOES3AWS

Senior Central Cloud Infrastructure Engineer

Senior engineer responsible for architecting and maintaining scalable AWS cloud infrastructure, leading modernization initiatives, and ensuring PCI/SOC2 compliance. Requires 5+ years experience with Terraform, Kubernetes, observability, and production cloud systems.

160k – 200kNew York, NYDevOps / SREOn-site5+ YOEAWSEKS

Senior Cloud Engineer

Architect and scale secure Azure cloud infrastructure supporting spacecraft control systems and autonomous satellite operations. Requires 5+ years in cloud/SRE/DevOps, deep Azure expertise, IAM proficiency, and compliance framework experience.

160k – 213kIrvine, CADevOps / SREOn-site5+ YOEGoAks

Senior Developer Experience Engineer

Senior Platform Engineer focused on Developer Experience building tools, automation, CI/CD systems, and AI tooling to improve developer productivity and workflows. Requires 7+ years cloud experience, containerization, and proficiency in Ruby, Go, or Python.

160k – 190kUnited StatesDevOps / SRERemote7+ YOEGoRuby

Senior Software Engineer - SRE

As a Senior Site Reliability Engineer, you will own the end-to-end reliability and scalability of AWS infrastructure and Kubernetes platforms. This role involves designing, operating, and continuously improving production systems with a strong focus on automation and observability.

160k – 180kCarson City, NV +3DevOps / SREHybrid5+ YOEGoAWS