Skip to content

Software Engineer - Member of Technical Staff (Consumption Team)

Build and maintain Python backend APIs, orchestration for GPU marketplace, billing systems, and developer tools across full stack for AI infrastructure platform. Requires strong backend experience with distributed systems, databases, and end-to-end feature ownership in a startup environment.

170k – 230kPalo Alto, CASan Francisco, CABackend EngineeringHybrid

About the role

Core Responsibilities

Backend & API

  • Build and maintain backend Python APIs powering Mithril's reservation, bidding, billing, and usage surfaces — including usage visualization and the billing platform.
  • Write and maintain database migrations, design relational schemas for complex multi-entity models (orders, allocations, grants, quotas), and optimize queries for performance.
  • Own features end-to-end: from design doc through backend API, database migration, feature flag rollout, and monitoring.

Platform & Marketplace

  • Design and implement orchestration primitives for flexible reservations: pause/return credits, reservation extensions, capacity algorithm updates, and the auction model connecting spot and reserved capacity.
  • Contribute to operational maturity — quota systems, financial controls, fraud prevention, and inventory management automation — so Mithril can scale supply sources without scaling headcount linearly.

Supply Integrations

  • Contribute to supply-side integrations, bringing GCP, Nebius, and OCI resources under Mithril management through the Mithril API.
  • Build tooling to give operators visibility into managed capacity and surface supply constraints early.

Consumption & Developer Tooling

  • Build consumption tooling to help developers make better decisions: reservation calculators, bid modeling, cost dashboards, and CLI extensions.
  • Work across the client frontend to ship customer-facing features including usage graphs, billing views, and reservation management UIs.

Operations

  • Participate in on-call rotations, triage production incidents, and contribute to operational runbooks and automation that reduce toil over time.

Requirements

  • Strong Python backend skills — you've built and maintained production APIs at real scale, not just prototypes.
  • Proven experience with distributed systems; comfortable with messaging systems (RabbitMQ, Kafka) and RPC methods (gRPC, protobuf, tRPC).
  • Fluency with relational databases: schema design, query optimization, and migrations in production environments.
  • Ability to own a feature end-to-end — from design doc through backend API, database migration, feature flag rollout, and monitoring.
  • Strong debugging instincts: calm and systematic when triaging production incidents, with a bias toward durable fixes.
  • Comfortable in small teams where ownership is high and process is lightweight.

Nice to Have

  • Experience with marketplace and billing systems — especially variable-usage, auction-based, or complex reservation models.
  • Familiarity with Kubernetes, AWS infrastructure, or cloud provider APIs (GCP, OCI, Nebius).
  • Familiarity with Linux containers and container orchestration (Docker Swarm, Nomad).
  • Comfort working across the stack — Python backend and TypeScript/React frontend.
  • Experience with developer tools or CLIs (bonus: SkyPilot or similar frameworks).
  • Proficiency with infrastructure-as-code (Terraform, Kustomize) or observability tooling (Grafana, Prometheus).
  • Background in real-time systems: SSE, WebSockets, or event-driven architectures.
  • Prior experience at a startup where you wore multiple hats and shipped across domains.

Benefits

  • Health, dental, and vision coverage for you and your dependents
  • 401k Plan with 4% company match
  • 21 days of PTO & 14 company holidays; including 2 floating holidays

Skills

PythonPostgresRabbitMQKafkagRPCKubernetesGCPOciTerraformPrometheusReactTypeScript

Staff Software Engineer

Own a major domain end-to-end building Athena, the enterprise-scale clearing house for vulnerability data. Design and ship production Go systems on GCP that ingest, deduplicate, and route security signals.

170k – 231kUnited StatesBackend EngineeringRemote7+ YOEGoGCP

Sr. Member of Technical Staff, Architecture

Senior engineer developing containerized microservices in Java/Spring Boot or Go for a distributed multi-tenant security platform on Kubernetes. Owns full SDLC, mentors juniors, and operates production systems serving real-time cloud telemetry.

170k – 196kSunnyvale, CABackend EngineeringOn-site5+ YOEGoAWS

Staff Software Engineer

Staff Software Engineer driving architecture, reliability, and AI-augmented practices for a large-scale Rails monolith serving self-storage operators. 40-50% hands-on coding with the rest focused on technical strategy, mentorship, and cross-team alignment.

170k – 200kUnited StatesBackend EngineeringRemote8+ YOEAWSRedis

Staff Backend Engineer

Leads development of backend systems for compliance platform including KYC, sanctions screening, transaction monitoring, and regulatory reporting. Requires 7+ years backend experience, ideally in Golang, with high-throughput data processing and integrations in regulated environments.

170k – 600kUnited StatesBackend EngineeringRemote7+ YOEGoKyc

Software Engineer - Member of Technical Staff (Platform)

Build and maintain Python backend APIs, orchestration for GPU capacity marketplace, supply integrations with cloud providers, and developer tooling across full stack for AI infrastructure platform handling real workloads and revenue.

170k – 230kPalo Alto, CA +1Backend EngineeringHybridGCPOci