Applied AI Scientist, Small Language Model and AI Training

Leads R&D on small language models and AI training, developing efficient architectures, optimizing performance, and ensuring safety. Collaborates with research, engineering, and product teams using Python, PyTorch, TensorFlow, or JAX.

219k – 276kSan Francisco, CAAI ResearchHybrid

Apply

About the role

The Opportunity

As an Applied Scientist specializing in Small Language Models and AI Training, you will lead research and development efforts focused on building efficient, high-performance language models tailored for practical applications. You will work closely with research, engineering, and product teams to advance model training techniques, optimize architectures, and scale AI solutions. Your work will directly contribute to AI systems that are safe, interpretable, and impactful across diverse usage scenarios.

What You’ll Do

Lead research and development of novel training methodologies and architectures for small and efficient language models.
Design, implement, and evaluate model training experiments to improve performance, robustness, and generalization of language models.
Collaborate closely with research scientists and engineers on scalable training pipelines and model deployment strategies.
Develop techniques for model compression, fine-tuning, and domain adaptation to optimize models for real-world applications.
Ensure AI safety, fairness, and alignment principles are integrated into model training processes and evaluated rigorously.
Mentor and support cross-functional teams on applied machine learning methods and best practices.
Evaluate and integrate new tools, frameworks, and datasets to accelerate AI training workflows.
Partner with product teams to translate model capabilities into actionable features aligned with user needs and ethical standards.

About You

Have demonstrated experience in applied research or engineering roles focused on training language models, ideally small or efficient models.
Strong programming skills in Python and familiarity with machine learning frameworks such as PyTorch, TensorFlow, or JAX.
Deep understanding of language model architectures, training techniques, and optimization strategies.
Experience with distributed training, data pipeline design, and scalable AI infrastructure.
Passion for AI safety, interpretability, and delivering user-centered AI technology.
Excellent communication skills with proven ability to collaborate across research, engineering, and product teams.

Preferred

Prior experience working with large and small language models in production or research settings.
Background in reinforcement learning, prompt engineering, or transfer learning techniques.
Experience with developer tools, APIs, or frameworks related to AI model integration and delivery.
Knowledge of AI alignment, fairness, and ethical AI training methodologies.

Compensation: Base salary $218,500 - $276,000 plus equity.

Skills

PythonPyTorchTensorFlowJAXLanguage ModelsDistributed TrainingModel CompressionFine-TuningReinforcement LearningAi Safety

Similar roles

AI Research jobs

Scale AI

Research Scientist, Safety Post Training

Develop and apply post-training methods and interpretability techniques to improve safety and understanding of frontier AI systems. Requires 3+ years of ML experience, expertise in RL techniques like RLHF and DPO, and published research in generative AI.

216k – 270kSan Francisco, CA +1AI ResearchHybrid3+ YOEDpoRLHF

Retell AI

Research Scientist - LLM

Conducts ML research to advance LLMs and audio models for real-time voice AI agents, focusing on reasoning, latency, and conversational quality. Prototypes models, designs evaluations, and bridges research to production systems requiring strong PyTorch expertise and experimental mindset.

225k – 400kRedwood City, CAAI ResearchOn-siteLLMsPyTorch

character.ai

Research Engineer, AI Safety & Alignment

Develops evaluation methods, alignment techniques, and adversarial testing for large language models to ensure safety and alignment with human values. Requires PhD in ML/CS, production code skills, GPU experience, and transformers/RL expertise.

225k – 400kRedwood City, CAAI ResearchOn-siteRLHFGpus

Baseten

Post-Training Research Scientist

Conducts research on post-training methodologies and performant inference for AI models, balancing pure research with applied work for production systems. Requires PhD in ML with top publications and ability to design rigorous experiments at scale.

210k – 285kSan Francisco, CAAI ResearchHybridJAXLLMs

Abridge

Machine Learning Scientist (All Levels)

Conducts machine learning research in medical NLP for conversation summarization, evidence extraction, and outcome prediction. Publishes at top AI conferences, deploys models to production, and requires MS/PhD plus strong PyTorch/TensorFlow experience.

205k – 300kSan Francisco, CA +2AI ResearchHybridJAXPyTorch