Senior Site Reliability Engineer
Builds and scales infrastructure platforms including compute, storage, and networking on AWS with Kubernetes. Designs monitoring/alerting systems, collaborates on scalable app designs, and improves systems for global expansion using Python, Terraform, and observability tools.
Responsibilities
- Build and scale internal platform offerings (compute, storage, networking) to ensure reliability and performance of applications.
- Design and implement monitoring, alerting, and incident response systems.
- Collaborate with application software engineers to guide scalable designs.
- Act as an agent of change to incrementally improve systems for global expansion.
Requirements
- Extensive experience with cloud services: AWS (EC2, S3, RDS, Lambda), Google Cloud Platform, or Azure.
- Proficient in Infrastructure as Code: Terraform, Ansible, or CloudFormation.
- Kubernetes or other container orchestration.
- Networking: CNI, network policies, proxies, service mesh.
- Monitoring/observability: Prometheus, Grafana, ELK Stack, Datadog.
- Proficiency in Python for efficient, scalable code.
- API services: RESTful/GraphQL design, deployment, maintenance.
- AI fluency: using AI tools daily, building agents to reduce toil.
- Experience with CI/CD best practices appreciated.
Compensation
- Seattle, WA: $181,688 - $213,750
- Santa Clara, CA or San Francisco, CA: $191,250 - $225,000
- Includes equity and exceptional benefits.
Senior Site Reliability Engineer - Government Cloud
Build and operate AWS GovCloud infrastructure for federal customers, owning IaC, container pipelines, compliance documentation, and operational tooling. Requires 5+ years AWS experience and FedRAMP familiarity.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.
AI Enablement Engineer
Design and build AI automations, integrations, and agents that connect company systems and eliminate manual work. Requires 4+ years building production automations and LLMs, strong API skills, and experience with RAG, agents, and observability.