Site Reliability Engineer II (Remote, US)
115k – 173kUnited StatesDevOps / SRERemote2+ YOE
Summary
DevOps/SRE II building and maintaining infrastructure for an insurance platform using GCP, Kubernetes, and Terraform. Focus on automation, monitoring, incident response, and security best practices.
About the role
Key Responsibilities
- Build internal tooling to help other engineers and the rest of the company understand and operate our system
- Design and implement security best practices for our team and infrastructure
- Reduce toil through automation, including building and maintaining CI/CD infrastructure
- Build infrastructure as code using declarative provisioning tools
- Develop high signal-to-noise ratio monitoring and alerting policies and technology to help us meet our SLOs
- Lead incident response and postmortems
- Contribute to important architectural and operational decisions like microservices vs. monoliths, deployment techniques, technologies, policies, etc.
Stack
- Backend: Go & PostgreSQL
- Frontend: Browser-based, VueJS, Webpack, Nuxt, Tailwind
- Research/Data Science: R, ArcGIS, Jupyter Notebooks, Python
- Data: GCP GCS, BigQuery, Composer/Airflow, Cloud Functions, Postgres, SQL, Python, Go, Aiven Debezium and Kafka, Fivetran
- Infrastructure: Google Cloud (Cloud Run, Kubernetes, Pub/Sub, BigQuery, CloudSQL), managed with Terraform. GitHub for code hosting, DataDog for monitoring, CircleCI for CI/CD pipelines.
Requirements
- 2+ years of professional/production experience developing and using infrastructure automation tools and techniques
- Proven track record of creating improvements in business-critical systems around stability, performance, and scalability
- Demonstrated ability to deliver complete systems from start to finish in a reasonable time frame
- Understands the consequences of running software in production and are willing to share your knowledge with the rest of the team
- Ability to explain complex technical challenges to non-technical audiences
- Strong scripting skills in one or more of the following: Python, Go
- Experience working with Infrastructure as Code (IaC) tooling, preferably Terraform
Compensation & Benefits
- Budgeted Salary Range: $115,200—$129,600 USD
- Full Salary Range: $115,200—$172,800 USD
- Remote-First Culture
- Competitive Salary & Equity
- Comprehensive Medical, Dental, and Vision Plan Offerings
- Life and disability coverage including voluntary options
- Parental Leave - up to 8 weeks (320 hours) of paid parental leave
- 401K Company Contribution - Openly contributes 3% of the employee's gross income
- Work-from-home stipend - $1,500 allowance
- Annual Professional Development Fund: $2,000 per employee
- Be Well Program - $50 per month
- Paid Volunteer Service Hours
- Referral Program and Reward
Skills
TerraformKubernetesGoogle CloudPythonGoCI/CDInfrastructure as CodeDataDogCircleCIPostgreSQL
Similar roles at this salary range
All DevOps / SRE jobs →Software Engineer, Cloud Infrastructure
Build and operate AWS cloud and LLM infrastructure powering retrieval-augmented generation, vector search, and ML pipelines for aviation AI systems. Requires strong AWS depth, Python data pipelines, and production LLM experience.
135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOES3AWS
Senior Network Engineer
Design, deploy, and operate enterprise network infrastructure for corporate facilities and hybrid cloud environments with zero-trust architecture and compliance requirements. Requires 5+ years enterprise networking experience and ability to obtain TS/SCI clearance.
133k – 215kLos Angeles, CA +1DevOps / SREOn-site5+ YOEAWSVLAN