Research Intern, Model Shaping

121k – 131kSan Francisco, CAML EngineeringOnsiteEntry levelJun 15

Summary

Research intern on the Model Shaping team working on post-training methods, efficient neural network training, and foundation model evaluation. Requires strong ML fundamentals and PyTorch/JAX experience.

About the role

Responsibilities

Research and implement novel techniques in one or more of our focus areas
Design and conduct rigorous experiments to validate hypotheses
Document findings in scientific publications and blog posts
Integrate the research results into Together products

Requirements

Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
Strong knowledge of Machine Learning and Deep Learning fundamentals
Experience with deep learning frameworks (PyTorch, JAX, etc.)
Familiarity with the Transformer architecture and recent developments in foundation models

Preferred Requirements

Prior research experience with training foundation models or efficient machine learning
Publications at leading ML and NLP conferences (such as NeurIPS, ICML, ICLR, ACL, or EMNLP)
Understanding of model optimization techniques and hardware acceleration approaches
Contributions to open-source machine learning projects

Internship Program Details

Fall internship program spans 12 to 16 weeks
Internship dates: September 14th to December 18th
Located in San Francisco or Amsterdam office

Skills

PyTorchJAXMachine LearningDeep LearningTransformer architectureSupervised LearningPreference OptimizationReinforcement LearningDistributed TrainingFoundation Models

Similar roles at this salary range

All ML Engineering jobs →

Mozilla

Jun 19

Senior Machine Learning Engineer

Senior ML Engineer focused on fine-tuning and deploying LLMs and generative AI features into Firefox, emphasizing privacy, latency, and user experience.

139k – 218kUnited StatesML EngineeringRemote4+ YOERayLangChain

Twilio

Jun 16

Senior / Staff Applied Research Software Engineer

Senior or Staff Applied Research Software Engineer building AI/ML prototypes and production solutions. Requires 3-5+ years full-stack experience with modern web frameworks, databases, and strong AI-assisted coding skills.

142k – 252kUnited StatesML EngineeringRemote5+ YOEAISQL

Docker

Jun 15

ML Engineer

Founding ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Requires 5+ years applied ML experience shipping systems and 4+ years backend/infra engineering.

139k – 226kPalo Alto, CA +1ML EngineeringRemote5+ YOELLMsRetrieval

Together AI

Jun 12

Systems Research Engineer Intern - GPU Programming

Intern developing and optimizing GPU-accelerated kernels for ML/AI applications. Requires strong GPU programming background (CUDA/Triton) and knowledge of performance optimization.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelCUDATriton

Together AI

Jun 12

Research Intern, Inference

Research intern on the Inference team building efficient serving systems for large foundation models. Focus on distributed inference, compiler-aware optimization, and novel inference-time strategies.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelJAXCUDA

Apply