Skip to content

Full-Stack Software Engineer, Reinforcement Learning

Build full-stack platforms, tools, and UIs for RL environment creation, data collection at scale, and training observability to improve AI models like Claude. Requires strong Python, modern web stack proficiency, high agency, and ability to ship reliable systems quickly in a fast-paced environment.

300k – 405kSan Francisco, CANew York, NYFullstack EngineeringHybrid

About the role

What You'll Do

  • Build and extend web platforms for RL environment creation, management, and quality review — including environment configuration, versioning, and validation workflows
  • Develop vendor-facing interfaces and tooling that let external partners create, submit, and iterate on training environments with minimal friction
  • Design and implement platforms for human data collection at scale, including labeling workflows, quality assurance systems, and feedback mechanisms that surface reward signal integrity issues early
  • Build evaluation dashboards and observability UIs that give researchers real-time insight into environment quality, training run health, and reward hacking
  • Create backend services and APIs that connect environment authoring tools, data collection systems, and RL training infrastructure
  • Build and expand scalable code data generation pipelines, producing diverse programming tasks with robust reward signals across languages and difficulty levels
  • Develop onboarding automation and documentation tooling so new vendors and internal users ramp up in hours, not weeks
  • Partner closely with RL researchers, data operations, and vendor management to translate ambiguous requirements into well-scoped, well-designed products

You May Be a Good Fit If You

  • Have strong software engineering fundamentals and real full-stack range — you're comfortable owning a surface from database schema to frontend
  • Are proficient in Python and a modern web stack (React, TypeScript, or similar)
  • Have a track record of shipping systems that solved a hard problem, not just shipped on time — e.g. you built the thing that made your team 10x faster, or the internal tool nobody thought was possible
  • Operate with high agency: you identify what needs to be done and drive it forward without waiting for a ticket
  • Have found yourself wondering "why isn't this moving faster?" in previous roles — and then have done something about it
  • Care about UX and can build interfaces that are intuitive for both technical researchers and non-technical labelers
  • Communicate clearly with researchers, operations teams, and engineers, and can turn vague asks into well-scoped work
  • Thrive in a fast-moving environment where priorities shift, Claude is your pair programmer, and the next problem is often one nobody has solved before

Strong Candidates May Also Have

  • Built data collection, labeling, or annotation platforms — ideally ones that had to scale across many vendors or many task types
  • Background building multi-tenant platforms with role-based access, audit trails, and vendor management workflows
  • Experience with cloud infrastructure (GCP or AWS), Docker, and CI/CD pipelines
  • Familiarity with LLM training, fine-tuning, or evaluation workflows
  • Experience with async Python (Trio, asyncio) or high-throughput API design
  • Background in dashboards, monitoring, or observability tooling
  • Experience working directly with external vendors or partners on technical integrations

Skills

PythonReactTypeScriptGCPAWSDockerAsyncioTrioAPIsCI/CD

Software Engineer, Full-stack

Full-stack engineers build and ship AI product experiences including developer tools, enterprise systems, vertical apps for industries like healthcare and finance, and growth features scaling to millions of users. Requires 5+ years experience, expertise in React/TypeScript, and product sense in fast-paced AI environments.

300k – 320kSan Francisco, CA +2Fullstack EngineeringHybrid5+ YOEAWSAPIs

Founding Engineer

Founding engineer owns full stack development for AI wearable hardware platform, including mobile apps, backend, firmware, and BLE. Builds magical prototypes, architects SDK/platform, and shapes product direction with founder.

300k – 300kSan Francisco, CAFullstack EngineeringOn-siteBleFlask

Software Engineer, Accelerators

Develop and optimize low-level software kernels and systems for new AI accelerator platforms to enable efficient large-scale training and inference of models like LLMs. Requires 3+ years in AI infrastructure, experience with data center-scale accelerators like TPUs, and strong systems skills.

295k – 380kSan Francisco, CAFullstack EngineeringOn-site3+ YOETpusLLMs

Full Stack Software Engineer, API Experience

Full stack engineer building developer-facing products like the API Playground, docs, SDKs, and onboarding flows. Requires 5+ years experience, strong TypeScript/React skills, and backend proficiency.

293k – 385kNew York, NYFullstack EngineeringHybrid5+ YOEGoRust

Full Stack Engineer, ChatGPT Finances

Full-stack engineer building consumer-facing financial features for ChatGPT, spanning polished frontends, backend APIs, data integrations, and AI capabilities.

293k – 325kSan Francisco, CAFullstack EngineeringHybrid5+ YOEGoReact