Skip to content

AI Researcher, Core ML (Turbo)

Develops efficient inference engines and RL/post-training pipelines for production-scale LLMs, optimizing algorithms, systems, and performance across the stack. Requires 3+ years in ML systems/RL/inference and advanced degree.

200k – 280kSan Francisco, CAAI ResearchOnsite3+ YOE

About the role

Requirements

  • Strong expertise in at least one area, with interest to grow across: large-scale inference systems (SGLang, vLLM, FasterTransformer, TensorRT), GPU performance, distributed serving; RL/post-training for LLMs (GRPO, RLHF/RLAIF, DPO); Transformer architectures; distributed systems/HPC for ML.
  • Comfortable from algorithms to engines: strong Python coding, profiling/optimizing GPU/networking/memory, implementing production-grade features.
  • Solid research foundation: track record in ML systems/RL/large-scale training (papers, open-source, production); ability to read papers and implement changes.
  • Full-stack problem-solving: identify bottlenecks, collaborate across teams.

Minimum qualifications:

  • 3+ years in ML systems, large-scale model training/inference, or equivalent.
  • Advanced degree in CS, EE, or related field, or equivalent experience.
  • Experience owning complex technical projects end-to-end.

Responsibilities

  • Advance inference efficiency: Design/prototype algorithms/architectures/scheduling; implement in engines (SGLang/vLLM, ATLAS, quantization); profile/optimize GPU/networking/memory.
  • Unify inference with RL/post-training: Design/operate RL pipelines (RLHF, RLAIF, GRPO, DPO); optimize with inference-aware techniques (async rollouts, speculative decoding); train/evaluate frontier models; co-design algorithms/infra; run ablations.
  • Own production systems: Profile/debug/optimize services; drive engine modifications (kernels, scheduling, APIs); establish metrics/benchmarks.
  • Technical leadership (Staff level): Set direction for cross-team efforts; mentor engineers/researchers.

Compensation

US base salary: $200,000 - $280,000 + equity + benefits.

Skills

PythonSglangvLLMTensorRTFastertransformerRLHFDpoGrpoGpu OptimizationSpeculative Decoding

Similar roles

AI Research jobs

Research Scientist

Leads original research in action-conditioned world models, physical AI, and generative modeling for embodied systems. Requires PhD in ML/CS/Robotics with top publications and expertise in generative models and large-scale training.

200k – 325kSan Francisco, CAAI ResearchOn-siteDpoRLHF

Founding ML Researcher

Founding ML Researcher shapes ML research direction for document AI, owns end-to-end lifecycle from research to production deployment. Requires expertise in VLMs, computer vision, unstructured data parsing; PhD preferred.

200k – 300kSan Francisco, CAAI ResearchOn-sitePyTorchDocument Ai

ML Scientist

Pioneers innovative ML techniques and builds foundation models for clinical information extraction and synthesis from medical records. Requires PhD in CS/math with NLP/ML focus, high-impact publications, and experience with large-scale model training using PyTorch/JAX.

200k – 250kBoston, MA +1AI ResearchOn-siteJAXLLMs

AI Research Engineer

Develops advanced AI agents for healthcare revenue recovery, focusing on human-like conversational AI, model improvements, LLM orchestration, and evaluation frameworks to handle insurance interactions and billing tasks.

200k – 300kNew York, NY +1AI ResearchOn-siteLLMsPython

Research Scientist, Frontier Risk Evaluations

Designs evaluation measures, harnesses, and datasets to assess risks from frontier AI systems, including dangerous capabilities testing. Collaborates with agencies, publishes methodologies for policymakers; requires 3+ years ML experience and publications in generative AI.

197k – 247kSan Francisco, CA +2AI ResearchOn-site3+ YOELLMsAi Safety