Skip to content

Software Engineer, Compute Platform

Build and optimize Replit's cloud infrastructure for scalable application deployment, focusing on reliability, cost efficiency, and global performance using distributed systems expertise.

130k – 290kFoster City, CADevOps / SREHybrid

About the role

Responsibilities

  • Expand Replit's cloud infrastructure offerings: Launch new cloud products to be used by Replit Agent to build complex apps. Collaborate with cross-functional teams to design and implement these features.
  • Enhance reliability and scalability: Identify bottlenecks, optimize critical paths, and implement robust monitoring and alerting systems. Work closely with the SRE team to ensure high availability.
  • Improve utilization of cloud infrastructure: Analyze infrastructure costs and identify opportunities for optimization. Implement strategies to reduce cloud expenses without compromising performance.

Required Skills and Experience

  • Distributed systems: Track record of working with platform-as-a-service, distributed storage, or information retrieval systems. Experience in designing scalable architectures and optimizing systems for latency or cost.
  • Problem-solving mindset: Ability to approach complex challenges pragmatically and devise effective solutions.
  • Self-directed and autonomous: Able to work independently, set priorities, and drive projects forward.
  • Versatility and flexibility: Able to wear multiple hats and tackle a wide range of challenges.
  • Continuous learning and adaptability: Passionate about staying up-to-date with industry trends.

Nice to Have

  • Experience working on cloud infrastructure or platform products, particularly in the areas of application deployment, serverless computing, or container orchestration.
  • Familiarity with Google Cloud Platform (GCP) services and tools, such as GCE, GKE, Cloud Run, or Cloud Storage.
  • Contributions to open-source projects related to cloud technologies, deployment frameworks, or developer tools.

Tools + Tech Stack

  • Golang
  • Rust

Benefits

  • Competitive Salary & Equity
  • 401(k) Program with a 4% match
  • Health, Dental, Vision and Life Insurance
  • Paid Parental, Medical, Caregiver Leave
  • Flexible Time Off (FTO) + Holidays

Skills

Distributed SystemsGoRustGCPKubernetesCloud RunGceMonitoringAlertingLinux

Similar roles

DevOps / SRE jobs

Linux Systems Engineer (USA)

Hands-on Linux Systems Engineer builds and maintains bare-metal servers, manages storage like ZFS, automates with Ansible and Bash, and ensures production reliability. Requires 3+ years Linux experience, physical server management, and on-call rotation with data center travel.

130k – 150kStamford, CT +1DevOps / SREOn-site3+ YOEZfsBash

Software Engineer - Developer Infrastructure

Develop and maintain developer tooling and infrastructure for Nominal's platform, scaling across air-gapped, cloud, and on-prem environments. Requires 4+ years experience with cloud services, Docker, Kubernetes, CI/CD, and ability to mentor engineers.

130k – 230kNew York, NY +2DevOps / SREOn-site4+ YOEAWSGCP

Vault Application Engineer/Administrator (Hashicorp)

Designs, deploys, and manages HashiCorp Vault clusters for secure secret management in on-premises and cloud (AWS/GCP) hybrid environments with Kubernetes integration. Requires 3+ years experience, zero trust principles, IaC tools like Terraform, and automation scripting.

130k – 180kBethesda, MDDevOps / SREHybrid3+ YOEAWSGCP

Site Reliability Engineer

Owns production reliability for critical systems, builds SRE function from scratch, introduces modern practices like SLIs/SLOs and error budgets. Requires 5+ years SRE experience with large-scale distributed systems.

130k – 500kSan Francisco, CADevOps / SREOn-site5+ YOEAWSIac

Infrastructure Engineer

Builds and scales highly available infrastructure using AWS, Terraform, and Docker to support rapid growth and AI workloads. Collaborates with product and research teams on architectures, CI/CD, monitoring, and performance optimization.

130k – 500kSan Francisco, CADevOps / SREOn-siteGoAWS