Skip to content

Software Engineer, Production Engineering

Builds and operates scalable infrastructure for compute, storage, messaging, and observability to support high-volume financial transactions. Partners with product teams on architecture, reliability, and AI enablement with 2+ years software engineering experience in distributed systems and cloud (AWS preferred).

168k – 325kNew York, NYSan Francisco, CADevOps / SREHybrid2+ YOE

About the role

What You'll Do

  • Build and operate critical infrastructure across Ramp's compute, storage, messaging, and observability stack — owning the systems that handle real financial transactions at scale.
  • Drive architectural change — not just flag problems. When you surface a reliability or scalability issue, you own the path forward: you propose the solution, find the owners across engineering, and stay in until it's resolved.
  • Partner with product teams at the design phase — reviewing architectures, embedding golden paths, and making it easy to build correctly the first time.
  • Build Ramp's next level of scale — you'll be a hands-on contributor to the most consequential infrastructure shift happening right now: our move to a cellular architecture, enabling Ramp to scale, reach international markets, operate in highly regulated and constrained environments (e.g. FedRAMP), and deliver on enterprise-grade SLAs.
  • Enable AI-native engineering — as Ramp builds increasingly AI-powered products, PE is the team that makes sure the platform can support them. You'll proactively partner with product teams on AI infrastructure patterns, define the golden paths that turn one-off solutions into reusable foundations, and stay ahead of emerging challenges before they become blockers.
  • Build developer tooling and self-service infrastructure — so that other teams can answer their own questions (cost, performance, reliability) without involving PE.
  • Participate in on-call rotation — and more importantly, use every incident as a signal to eliminate the root cause, not just resolve the symptom.
  • Lead across the company — PE doesn't just review designs or show up when called. We proactively initiate cross-team architectural reviews, and we take full ownership of company-wide reliability and scalability initiatives: identifying the problem, proposing the solution, aligning the stakeholders, and staying in until it's done.

What We Look For

Experience profile:

  • 2+ years of software engineering experience shipping high-quality architectures for critical systems
  • Strong software engineering fundamentals — you write clean, well-tested, production-ready code
  • Hands-on experience with distributed systems at production scale
  • Experience with at least one major cloud provider (AWS preferred)
  • Familiarity with observability practices (SLOs, error budgets, alerting, dashboards)
  • Track record of leading technical projects end-to-end, including cross-team coordination
  • Comfortable using AI tooling and coding agents as part of your everyday engineering workflow — we expect our engineers to leverage these tools to move faster and think bigger

Bonus (not required):

  • Experience with cellular or multi-tenant architecture patterns
  • Prior work on workflow orchestration systems (Temporal)
  • Contributions to developer experience or internal platform tooling
  • Experience in fintech, payments, or regulated industries (FedRAMP, SOC 2)

Skills

AWSDistributed SystemsObservabilitySLOsTerraformTemporalCellular ArchitectureContainer OrchestrationKubernetesCI/CD

Similar roles

DevOps / SRE jobs

Velocity Solutions Engineer

Technical advisor partnering with sales to demo CodeRabbit's AI code review platform, develop custom solutions, and support pre-sales PoVs. Requires 2+ years customer-facing experience, cloud knowledge (AWS/GCP/Azure), and command line comfort.

175k – 185kSan Francisco, CADevOps / SREHybrid2+ YOEAWSGCP

Software Engineer, Offboard Infrastructure

Software Engineer building infrastructure for data platforms, simulation, or technical services in autonomous driving technology. Requires 2+ years experience, strong Python/C++/Go skills, and expertise in distributed systems or related areas.

160k – 241kMountain View, CADevOps / SREOn-site2+ YOEGoC++

Infrastructure Engineer, Observe by Snowflake

Builds and operates scalable AWS infrastructure for Observe by Snowflake's observability platform, focusing on reliability, CI/CD pipelines, and developer tooling. Requires 2+ years in infrastructure/SRE/DevOps with Kubernetes, IaC tools, and cloud experience.

160k – 210kMenlo Park, CADevOps / SREOn-site2+ YOEGoAWS

Production Engineer

Production Engineer builds and operates large-scale systems, focusing on automation, monitoring, infrastructure management, and resilient operations. Requires 2+ years in SRE/DevOps, expertise in Linux, AWS, Kubernetes, and programming in Python or Golang.

155k – 185kMountain View, CADevOps / SREHybrid2+ YOEGoAWS

Software Engineer – Infrastructure Tooling

Design, build, and maintain developer infrastructure and tooling for Android Automotive OS, including Gerrit, static analysis, and build systems. Requires 2+ years of software engineering experience and automotive domain exposure.

149k – 164kSunnyvale, CADevOps / SREOn-site2+ YOEC++ROS