Research Scientist / Engineer — Multimodal Agent

250k – 450kPalo Alto, CAML EngineeringHybridMar 31

Summary

Builds and trains large-scale multimodal agentic models involving reasoning, planning, coding, and tool calling. Requires strong ML foundations, PyTorch expertise, and experience with distributed training on massive datasets.

About the role

What You'll Do

Modeling

Architect large-scale multimodal agentic models that use reasoning, planning, coding, and tool calling to achieve complex, multi-step multimodal work.

Data

Hillclimbing existing tasks and formulating new tasks through data.
Design, implement, and run robust data pipelines for constructing, enriching, and filtering massive pixel datasets.

Systems

Train large-scale multimodal models on massive datasets and GPU clusters.

Evaluation

Define and build novel evaluation frameworks to measure multimodal agents.

Who You Are

Strong foundation in machine learning, foundation models and agentic systems.
Deep understanding of agentic systems and approaches in LLM/VLM reasoning, coding models, LLM/VLM tool calling.
Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets).

What Sets You Apart (Bonus Points)

Experience in the following around data, modeling, or evaluation: State-of-the-art foundation models in reasoning, State-of-the-art foundation models in coding, State-of-the-art foundation models in tool calling, State-of-the-art multimodal agents.

Compensation

The base pay range for this role is $250,000 – $450,000 per year.

Skills

PyTorchMachine LearningFoundation ModelsLLMVLMDistributed TrainingMixed Precision TrainingMultimodal ModelsAgentic SystemsReasoning ModelsCoding ModelsTool CallingData PipelinesGPU Clusters

Similar roles at this salary range

All ML Engineering jobs →

OpenAI

Jun 19

Research Engineer / Research Scientist

Research and develop improvements to models' personalization and agentic capabilities through reinforcement learning, dataset creation, and post-training methods. Requires strong ML engineering skills and research experience with novel models.

295k – 555kSan Francisco, CAML EngineeringHybrid7+ YOEPythonPyTorch

Plaid

Jun 18

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps

Plaid

Jun 18

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython

Decagon

Jun 18

Staff Software Engineer, Agents

Build and own end-to-end AI agents for enterprise customers, integrating latest text/voice models and iterating based on real-world usage. Requires 8+ years of software engineering experience with Python and TypeScript.

200k – 400kSan Francisco, CAML EngineeringOn-site8+ YOEPythonAI Agents

Jun 17

Staff Machine Learning Engineer, Notifications Relevance

Technical leader for Reddit's Notifications Relevance ML systems, driving large-scale recommendation systems spanning retrieval, ranking, budget optimization, and LLM-powered experiences.

230k – 322kUnited StatesML EngineeringRemote8+ YOEPythonGolang

Apply