Research Engineer, FlexOlmo

129k – 193kSeattle, WAML EngineeringOnsite4+ YOEApr 22

Summary

Designs and implements infrastructure for training next-generation LLM architectures focused on Mixture-of-Experts and long-context models. Requires 4+ years building ML pipelines with PyTorch/JAX/TF, deep learning expertise, and strong software engineering skills.

About the role

Responsibilities

Building infrastructure to facilitate the next generation of LLM research.
Optimizing training and inference for language models.
Triaging between experiments and executing on the most impactful.
Bridging the gap between cutting-edge research and a widely adopted product.
Bringing software engineering best practices to a research environment.
Supporting and collaborating with an open-source community.
Releasing contributions back to the broader community in the form of open source software, model releases, and additions to Ai2's public API and open research datasets, as well as technical reports.

Requirements

A bachelor's degree in Data Science/CS/EE/Applied Mathematics/Statistics/ML/NLP, or a related field, or equivalent relevant experience, and expertise at building ML infrastructure.
4+ years of experience building infrastructure that handles data preprocessing/transformation and model training, evaluation, inference, and deployment. Experience in the complete model development cycle, including data set construction, training, tuning, evaluation, performance profiling, and monitoring.
Knowledge of modern deep learning and natural language processing techniques.
Strong software engineering skills, particularly around building performant systems and debugging.
Experience with Python and PyTorch/Jax/Tensorflow as well as feel at ease in picking up new programming languages, libraries, or APIs as project needs evolve.
Familiarity working with cloud compute resources (e.g. AWS) and containerization (e.g. Docker).
Strong collaboration and communication skills.

Nice-to-Haves

Advanced degree in Data Science/CS/EE/Applied Mathematics/Statistics/ML/NLP or related fields and/or relevant and equivalent engineering experience.
Contributions to open-source ML or research libraries (e.g. spaCy, AllenNLP, transformers).
Experience successfully operating models at scale in a production setting.
Experience in HPC settings.
Curiosity about AI research.

Compensation

Base salary range: $128,880 - $193,320, plus generous bonus plans.

Skills

PyTorchJAXTensorFlowPythonAWSDockerDeep LearningNatural Language ProcessingTransformersMixture-of-ExpertsLLMsMachine Learning Infrastructure

Similar roles at this salary range

All ML Engineering jobs →

Mozilla

Jun 19

Senior Machine Learning Engineer

Senior ML Engineer focused on fine-tuning and deploying LLMs and generative AI features into Firefox, emphasizing privacy, latency, and user experience.

139k – 218kUnited StatesML EngineeringRemote4+ YOERayLangChain

Distyl AI

Jun 18

AI Engineer, Evaluation

Design and implement evaluation frameworks and pipelines for AI systems using Evaluation-Driven Development. Build Python-based test suites, LLM graders, and measurement systems that guide prompt iteration and production deployment decisions.

150k – 250kSan Francisco, CA +1ML EngineeringHybrid2+ YOEPythonAI Systems

Grafana Labs

Jun 18

Senior AI Engineer

Senior Engineer building multi-agent AI systems, LLM integrations, and backend automation services that power Marketing Operations. Owns technical direction for agentic infrastructure connecting models to business systems.

154k – 185kUnited StatesML EngineeringRemote8+ YOERAGGit

Twilio

Jun 16

Senior / Staff Applied Research Software Engineer

Senior or Staff Applied Research Software Engineer building AI/ML prototypes and production solutions. Requires 3-5+ years full-stack experience with modern web frameworks, databases, and strong AI-assisted coding skills.

142k – 252kUnited StatesML EngineeringRemote5+ YOEAISQL

Together AI

Jun 15

Research Intern, Model Shaping

Research intern on the Model Shaping team working on post-training methods, efficient neural network training, and foundation model evaluation. Requires strong ML fundamentals and PyTorch/JAX experience.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelJAXPyTorch

Apply