Skip to content

Research Scientist / Engineer — Multimodal Agent

250k – 450kPalo Alto, CAML EngineeringHybrid
Summary

Builds and trains large-scale multimodal agentic models involving reasoning, planning, coding, and tool calling. Requires strong ML foundations, PyTorch expertise, and experience with distributed training on massive datasets.

About the role

What You'll Do

Modeling

  • Architect large-scale multimodal agentic models that use reasoning, planning, coding, and tool calling to achieve complex, multi-step multimodal work.

Data

  • Hillclimbing existing tasks and formulating new tasks through data.
  • Design, implement, and run robust data pipelines for constructing, enriching, and filtering massive pixel datasets.

Systems

  • Train large-scale multimodal models on massive datasets and GPU clusters.

Evaluation

  • Define and build novel evaluation frameworks to measure multimodal agents.

Who You Are

  • Strong foundation in machine learning, foundation models and agentic systems.
  • Deep understanding of agentic systems and approaches in LLM/VLM reasoning, coding models, LLM/VLM tool calling.
  • Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets).

What Sets You Apart (Bonus Points)

  • Experience in the following around data, modeling, or evaluation: State-of-the-art foundation models in reasoning, State-of-the-art foundation models in coding, State-of-the-art foundation models in tool calling, State-of-the-art multimodal agents.

Compensation

  • The base pay range for this role is $250,000 – $450,000 per year.
Skills
PyTorchMachine LearningFoundation ModelsLLMVLMDistributed TrainingMixed Precision TrainingMultimodal ModelsAgentic SystemsReasoning ModelsCoding ModelsTool CallingData PipelinesGPU Clusters
Similar roles at this salary range
All ML Engineering jobs →
OpenAI

Research Engineer / Research Scientist

Research and develop improvements to models' personalization and agentic capabilities through reinforcement learning, dataset creation, and post-training methods. Requires strong ML engineering skills and research experience with novel models.

295k – 555kSan Francisco, CAML EngineeringHybrid7+ YOEPythonPyTorch
Plaid

Machine Learning Engineer - Embedded Insights

Drive ML initiatives from concept to production on the Embedded Insights team. Identify opportunities, build and deploy models using Plaid's financial datasets, and partner with product teams to deliver scalable customer-facing intelligence products.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid5+ YOESQLMLOps
Plaid

Machine Learning Engineer

Advance Plaid’s foundation models by developing novel architectures, pretraining objectives, and fine-tuning strategies. Work across the full ML stack from data engineering to production serving and monitoring.

212k – 272kSan Francisco, CA +2ML EngineeringHybrid1+ YOELLMsPython
Decagon

Staff Software Engineer, Agents

Build and own end-to-end AI agents for enterprise customers, integrating latest text/voice models and iterating based on real-world usage. Requires 8+ years of software engineering experience with Python and TypeScript.

200k – 400kSan Francisco, CAML EngineeringOn-site8+ YOEPythonAI Agents
Reddit

Staff Machine Learning Engineer, Notifications Relevance

Technical leader for Reddit's Notifications Relevance ML systems, driving large-scale recommendation systems spanning retrieval, ranking, budget optimization, and LLM-powered experiences.

230k – 322kUnited StatesML EngineeringRemote8+ YOEPythonGolang