Skip to content

Platform Engineer

Owns and builds core parts of the Forge platform end-to-end, including IDE, compiler, runtime, and infra for AI-powered English programming language. Requires staff+ engineer experience building products 0-1 with deep technical craft.

175k – 225kSan Francisco, CANew York, NYDevOps / SREOnsite

About the role

Responsibilities

  • Own and build core parts of the Forge platform end-to-end.
  • Design powerful and elegant abstractions that scale.

Requirements

  • Built multiple products 0 → 1 or 1 → n / staff+ engineer experience.
  • Deep care for your craft - explain how your tools work under the hood and have opinions about them.
  • Ability to run independently and build what you need on your own.
  • Someone who can vouch that you're the best engineer they've worked with (or something you've built that speaks for itself).

Locations

  • In-person only in San Francisco, CA or New York City, NY.

Skills

CompilersIdesRuntimesInfrastructureLLMsImmutable SystemsAnomaly DetectionProgramming Languages

Similar roles

DevOps / SRE jobs

Production Engineer, IaaS

Own observability, API surface, and control plane for a hyperscale AI compute fleet. Build production-grade data pipelines, stateful APIs, and Kubernetes infrastructure that other teams depend on.

175k – 300kSan Francisco, CA +3DevOps / SREOn-site5+ YOEGoPython

Production Engineer, Compute

Own end-to-end health, repair automation, and qualification of a hyperscale GPU/TPU compute fleet. Build metrics pipelines, firmware tooling, and self-healing repair workflows across Kubernetes and bare metal.

175k – 300kSan Francisco, CA +3DevOps / SREHybrid5+ YOEGoBmc

SWE - Backend Infrastructure Engineer

Builds and scales core infrastructure including ML training/serving, Kubernetes clusters, and low-latency voice/audio pipelines. Requires 3+ years in infrastructure/ML systems, hands-on reliability engineering, and Kubernetes expertise.

175k – 280kSan Francisco, CA +2DevOps / SREOn-site3+ YOEAPIsSeldon

Site Reliability Engineer

Builds and operates reliable, scalable AI infrastructure including observability, SLOs, incident response, automation, and performance tuning for ultra-low-latency serverless compute. Requires 3+ years SRE/DevOps experience with cloud, Kubernetes, programming (Go/Rust/Python), and observability tools.

175k – 250kSan Francisco, CADevOps / SREOn-site3+ YOEGoAWS

Infrastructure Engineer / SRE

Designs and operates large-scale infrastructure for secure, scalable AI agent runtimes, untrusted code execution, and multi-cloud deployments. Requires strong expertise in distributed systems, containers, Kubernetes, and security.

175k – 275kNew York, NYDevOps / SREOn-siteVpcKeda