Research Intern, Model Shaping
Research intern on the Model Shaping team working on post-training methods, efficient neural network training, and foundation model evaluation. Requires strong ML fundamentals and PyTorch/JAX experience.
Responsibilities
- Research and implement novel techniques in one or more of our focus areas
- Design and conduct rigorous experiments to validate hypotheses
- Document findings in scientific publications and blog posts
- Integrate the research results into Together products
Requirements
- Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
- Strong knowledge of Machine Learning and Deep Learning fundamentals
- Experience with deep learning frameworks (PyTorch, JAX, etc.)
- Familiarity with the Transformer architecture and recent developments in foundation models
Preferred Requirements
- Prior research experience with training foundation models or efficient machine learning
- Publications at leading ML and NLP conferences (such as NeurIPS, ICML, ICLR, ACL, or EMNLP)
- Understanding of model optimization techniques and hardware acceleration approaches
- Contributions to open-source machine learning projects
Internship Program Details
- Fall internship program spans 12 to 16 weeks
- Internship dates: September 14th to December 18th
- Located in San Francisco or Amsterdam office
Senior / Staff Applied Research Software Engineer
Senior or Staff Applied Research Software Engineer building AI/ML prototypes and production solutions. Requires 3-5+ years full-stack experience with modern web frameworks, databases, and strong AI-assisted coding skills.
ML Engineer
Founding ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Requires 5+ years applied ML experience shipping systems and 4+ years backend/infra engineering.
Systems Research Engineer Intern - GPU Programming
Intern developing and optimizing GPU-accelerated kernels for ML/AI applications. Requires strong GPU programming background (CUDA/Triton) and knowledge of performance optimization.