Senior Software Engineer, Infrastructure
Senior Infrastructure Engineer responsible for building and operating platform primitives including Kubernetes, CI/CD, observability, and developer tooling at a high-growth AI and data platform company.
Builds and scales reliable infrastructure for SaaS applications using Kubernetes, Terraform, and GitHub CI/CD. Focuses on observability with Grafana/Prometheus, automation to reduce toil, production troubleshooting, and cross-team collaboration. Requires 5+ years Python experience.
Infrastructure Management: Build, manage, and optimize infrastructure using Terraform, GitHub CI/CD, and Kubernetes.
Monitoring & Observability: Create visualizations and alerts that provide actionable insights using tools like Grafana, Prometheus/Mimir, OpenSearch, and Sentry.
Automation & Reliability: Identify manual or error-prone processes and replace them with automated, repeatable systems.
Production Troubleshooting: Diagnose and resolve production issues across application and infrastructure layers.
Documentation: Capture knowledge in runbooks, setup guides, and architecture diagrams to support operational maturity.
Collaboration: Partner with engineers across teams to drive adoption of DevOps and infrastructure best practices.
Scalability Planning: Help scale infrastructure and monitoring systems to meet growing demands.
Incident Participation: Participate in an on-call rotation and support incident response processes as needed.
Observability: Experience with metrics, logs, and traces using tools such as Grafana, Prometheus/Mimir, OpenSearch, Sentry, or similar.
Infrastructure as Code: Proficient with Terraform, Kubernetes, and containerization tools.
Programming Skills: 5+ years of experience with Python.
Linux Systems: Comfortable working with Linux-based environments and writing shell scripts.
Communication: Strong collaboration skills with a focus on asynchronous, written communication.
Documentation: Commitment to clear, comprehensive documentation and process standardization.
Initiative: Self-starter mindset with a proactive approach to solving operational challenges.
Version Control: Skilled in Git/GitHub-based workflows.
Nice to haves
Based on market data and other factors, the salary range for this position is $180,000-$220,000 + Equity. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
Senior Infrastructure Engineer responsible for building and operating platform primitives including Kubernetes, CI/CD, observability, and developer tooling at a high-growth AI and data platform company.
As a Senior Infrastructure Engineer, you will own and evolve the foundational systems powering Casca's AI-driven lending platform. You will build and maintain infrastructure, CI/CD pipelines, cloud infrastructure, deployment automation, and observability systems, ensuring the platform is secure, compliant, and highly available.
As a Senior Release Engineer, you will design, maintain, and improve deployment processes, build and scale CI/CD systems, and manage production environments. You will partner with engineering teams to eliminate bottlenecks and enhance platform resilience.
Designs and manages scalable AWS infrastructure, builds CI/CD pipelines for web and ML applications, implements MLOps practices, and ensures reliability through monitoring. Requires 5+ years DevOps experience, AWS expertise, containerization, and Python scripting.