Skip to content

Sr. Software Engineer, DevOps

Builds and scales reliable infrastructure for SaaS applications using Kubernetes, Terraform, and GitHub CI/CD. Focuses on observability with Grafana/Prometheus, automation to reduce toil, production troubleshooting, and cross-team collaboration. Requires 5+ years Python experience.

180k – 220kSouth San Francisco, CADevOps / SREHybrid5+ YOE

About the role

What You'll Do

Infrastructure Management: Build, manage, and optimize infrastructure using Terraform, GitHub CI/CD, and Kubernetes.

Monitoring & Observability: Create visualizations and alerts that provide actionable insights using tools like Grafana, Prometheus/Mimir, OpenSearch, and Sentry.

Automation & Reliability: Identify manual or error-prone processes and replace them with automated, repeatable systems.

Production Troubleshooting: Diagnose and resolve production issues across application and infrastructure layers.

Documentation: Capture knowledge in runbooks, setup guides, and architecture diagrams to support operational maturity.

Collaboration: Partner with engineers across teams to drive adoption of DevOps and infrastructure best practices.

Scalability Planning: Help scale infrastructure and monitoring systems to meet growing demands.

Incident Participation: Participate in an on-call rotation and support incident response processes as needed.

Skills & Qualifications

Observability: Experience with metrics, logs, and traces using tools such as Grafana, Prometheus/Mimir, OpenSearch, Sentry, or similar.

Infrastructure as Code: Proficient with Terraform, Kubernetes, and containerization tools.

Programming Skills: 5+ years of experience with Python.

Linux Systems: Comfortable working with Linux-based environments and writing shell scripts.

Communication: Strong collaboration skills with a focus on asynchronous, written communication.

Documentation: Commitment to clear, comprehensive documentation and process standardization.

Initiative: Self-starter mindset with a proactive approach to solving operational challenges.

Version Control: Skilled in Git/GitHub-based workflows.

Nice to haves

  • Cloud Experience: AWS (preferred), GCP, or Azure cloud infrastructure management.
  • Networking Fundamentals: Familiarity with TCP/IP, DNS, routing, and load balancing concepts.
  • Security: Understanding of cloud and infrastructure security best practices.
  • Performance Tuning: Experience tuning application or infrastructure performance in production environments.

What We Offer

  • Flexible paid time off (PTO)
  • Expansive coverage for health, dental, and vision
  • Employer contribution to Health Savings Accounts (HSA)
  • Generous parental leave policy
  • Full employee coverage for life insurance
  • Home office stipend
  • Cell phone/internet reimbursement
  • Company-paid holidays
  • 401(K) plan

Compensation

Based on market data and other factors, the salary range for this position is $180,000-$220,000 + Equity. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

Skills

TerraformKubernetesPythonGrafanaPrometheusOpensearchSentryGitHubCI/CDLinuxAWSGit

Similar roles

DevOps / SRE jobs

Senior Software Engineer, Infrastructure

Senior Infrastructure Engineer responsible for building and operating platform primitives including Kubernetes, CI/CD, observability, and developer tooling at a high-growth AI and data platform company.

180k – 250kBoston, MADevOps / SREHybrid5+ YOEGoGCP

Senior Infrastructure Engineer

As a Senior Infrastructure Engineer, you will own and evolve the foundational systems powering Casca's AI-driven lending platform. You will build and maintain infrastructure, CI/CD pipelines, cloud infrastructure, deployment automation, and observability systems, ensuring the platform is secure, compliant, and highly available.

180k – 215kSan Francisco, CADevOps / SREOn-site5+ YOEGoAWS

Senior Release Engineer

As a Senior Release Engineer, you will design, maintain, and improve deployment processes, build and scale CI/CD systems, and manage production environments. You will partner with engineering teams to eliminate bottlenecks and enhance platform resilience.

180k – 200kUnited StatesDevOps / SRERemote4+ YOEHelmLinux

Senior DevOps Engineer (Infrastructure & MLOps)

Designs and manages scalable AWS infrastructure, builds CI/CD pipelines for web and ML applications, implements MLOps practices, and ensures reliability through monitoring. Requires 5+ years DevOps experience, AWS expertise, containerization, and Python scripting.

180k – 225kUnited StatesDevOps / SRERemote5+ YOEAWSECS

Senior DevEx Engineer

Steward Replit's TypeScript monorepo, Go services, and developer tooling to accelerate engineering velocity and reduce friction. Partner with AI team to enhance agent-generated code, requiring senior-level expertise in build systems and large-scale codebases.

180k – 250kFoster City, CADevOps / SREHybridGoNix