AI Researcher, Core ML (Turbo)

Develops efficient inference engines and RL/post-training pipelines for production-scale LLMs, optimizing algorithms, systems, and performance across the stack. Requires 3+ years in ML systems/RL/inference and advanced degree.

200k – 280kSan Francisco, CAAI ResearchOnsite3+ YOE

Apply

About the role

Requirements

Strong expertise in at least one area, with interest to grow across: large-scale inference systems (SGLang, vLLM, FasterTransformer, TensorRT), GPU performance, distributed serving; RL/post-training for LLMs (GRPO, RLHF/RLAIF, DPO); Transformer architectures; distributed systems/HPC for ML.
Comfortable from algorithms to engines: strong Python coding, profiling/optimizing GPU/networking/memory, implementing production-grade features.
Solid research foundation: track record in ML systems/RL/large-scale training (papers, open-source, production); ability to read papers and implement changes.
Full-stack problem-solving: identify bottlenecks, collaborate across teams.

Minimum qualifications:

3+ years in ML systems, large-scale model training/inference, or equivalent.
Advanced degree in CS, EE, or related field, or equivalent experience.
Experience owning complex technical projects end-to-end.

Responsibilities

Advance inference efficiency: Design/prototype algorithms/architectures/scheduling; implement in engines (SGLang/vLLM, ATLAS, quantization); profile/optimize GPU/networking/memory.
Unify inference with RL/post-training: Design/operate RL pipelines (RLHF, RLAIF, GRPO, DPO); optimize with inference-aware techniques (async rollouts, speculative decoding); train/evaluate frontier models; co-design algorithms/infra; run ablations.
Own production systems: Profile/debug/optimize services; drive engine modifications (kernels, scheduling, APIs); establish metrics/benchmarks.
Technical leadership (Staff level): Set direction for cross-team efforts; mentor engineers/researchers.

Compensation

US base salary: $200,000 - $280,000 + equity + benefits.

Skills

PythonSglangvLLMTensorRTFastertransformerRLHFDpoGrpoGpu OptimizationSpeculative Decoding

Similar roles

AI Research jobs

Hedra

Research Scientist

Leads original research in action-conditioned world models, physical AI, and generative modeling for embodied systems. Requires PhD in ML/CS/Robotics with top publications and expertise in generative models and large-scale training.

200k – 325kSan Francisco, CAAI ResearchOn-siteDpoRLHF

Unsiloed AI

Founding ML Researcher

Founding ML Researcher shapes ML research direction for document AI, owns end-to-end lifecycle from research to production deployment. Requires expertise in VLMs, computer vision, unstructured data parsing; PhD preferred.

200k – 300kSan Francisco, CAAI ResearchOn-sitePyTorchDocument Ai

Layer Health

ML Scientist

Pioneers innovative ML techniques and builds foundation models for clinical information extraction and synthesis from medical records. Requires PhD in CS/math with NLP/ML focus, high-impact publications, and experience with large-scale model training using PyTorch/JAX.

200k – 250kBoston, MA +1AI ResearchOn-siteJAXLLMs

Amperoshealth

AI Research Engineer

Develops advanced AI agents for healthcare revenue recovery, focusing on human-like conversational AI, model improvements, LLM orchestration, and evaluation frameworks to handle insurance interactions and billing tasks.

200k – 300kNew York, NY +1AI ResearchOn-siteLLMsPython

Scale AI

Research Scientist, Frontier Risk Evaluations

Designs evaluation measures, harnesses, and datasets to assess risks from frontier AI systems, including dangerous capabilities testing. Collaborates with agencies, publishes methodologies for policymakers; requires 3+ years ML experience and publications in generative AI.

197k – 247kSan Francisco, CA +2AI ResearchOn-site3+ YOELLMsAi Safety