Skip to content

Research Intern, Model Shaping

121k – 131kSan Francisco, CAML EngineeringOnsiteEntry level
Summary

Research intern on the Model Shaping team working on post-training methods, efficient neural network training, and foundation model evaluation. Requires strong ML fundamentals and PyTorch/JAX experience.

About the role

Responsibilities

  • Research and implement novel techniques in one or more of our focus areas
  • Design and conduct rigorous experiments to validate hypotheses
  • Document findings in scientific publications and blog posts
  • Integrate the research results into Together products

Requirements

  • Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
  • Strong knowledge of Machine Learning and Deep Learning fundamentals
  • Experience with deep learning frameworks (PyTorch, JAX, etc.)
  • Familiarity with the Transformer architecture and recent developments in foundation models

Preferred Requirements

  • Prior research experience with training foundation models or efficient machine learning
  • Publications at leading ML and NLP conferences (such as NeurIPS, ICML, ICLR, ACL, or EMNLP)
  • Understanding of model optimization techniques and hardware acceleration approaches
  • Contributions to open-source machine learning projects

Internship Program Details

  • Fall internship program spans 12 to 16 weeks
  • Internship dates: September 14th to December 18th
  • Located in San Francisco or Amsterdam office
Skills
PyTorchJAXMachine LearningDeep LearningTransformer architectureSupervised LearningPreference OptimizationReinforcement LearningDistributed TrainingFoundation Models
Similar roles at this salary range
All ML Engineering jobs →
Mozilla

Senior Machine Learning Engineer

Senior ML Engineer focused on fine-tuning and deploying LLMs and generative AI features into Firefox, emphasizing privacy, latency, and user experience.

139k – 218kUnited StatesML EngineeringRemote4+ YOERayLangChain
Twilio

Senior / Staff Applied Research Software Engineer

Senior or Staff Applied Research Software Engineer building AI/ML prototypes and production solutions. Requires 3-5+ years full-stack experience with modern web frameworks, databases, and strong AI-assisted coding skills.

142k – 252kUnited StatesML EngineeringRemote5+ YOEAISQL
Docker

ML Engineer

Founding ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Requires 5+ years applied ML experience shipping systems and 4+ years backend/infra engineering.

139k – 226kPalo Alto, CA +1ML EngineeringRemote5+ YOELLMsRetrieval
Together AI

Systems Research Engineer Intern - GPU Programming

Intern developing and optimizing GPU-accelerated kernels for ML/AI applications. Requires strong GPU programming background (CUDA/Triton) and knowledge of performance optimization.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelCUDATriton
Together AI

Research Intern, Inference

Research intern on the Inference team building efficient serving systems for large foundation models. Focus on distributed inference, compiler-aware optimization, and novel inference-time strategies.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelJAXCUDA