Applied Research Scientist - Foundation Models

Develops and optimizes transformer-based vision-language foundation models for physical security, owning full-cycle training, fine-tuning, compression, and deployment for real-time inference on images, videos, and text. Requires PhD/Master's in CS/EE, hands-on ML expertise with PyTorch/TensorFlow, Transformers, and ViTs.

140k – 175kRedwood City, CAML EngineeringHybrid

Apply

About the role

What you'll do

Develop & Optimize VLMs: Design and optimize transformer-based vision-language models to understand images, videos, and text, and optimize for real-time inference.
Pre-training & Fine-tuning: Own the full training pipeline—from pre-training on image-text data to fine-tuning for Ambient.ai’s physical security domain and use cases.
Model Compression & Optimization: Apply techniques like distillation, quantization, and pruning to reduce model size and latency, enabling efficient edge deployment.
Leverage Open-Source & Innovate: Use and extend state-of-the-art open-source models. Prototype new architectures and training methods to advance Ambient.ai’s multimodal AI research.
Cross-Team Collaboration: Work with engineering and product teams to integrate models into the platform. Iterate based on real-world feedback and deployment data to improve performance.
Research and Experimentation: Stay current with vision, NLP, and multimodal AI research. Design experiments to test new algorithms and continually enhance our core AI systems.

What you'll bring

Ph.D. or Master’s in CS, EE, or related field, with a strong foundation in AI/ML (Ph.D. preferred or Master’s with strong experience)
Proficient in Python/C++ and deep learning frameworks like PyTorch or TensorFlow. Comfortable with large-scale training pipelines
Hands-on experience with CNNs, Transformers, and Vision Transformers (ViT). Strong understanding of vision-language models and how to fine-tune or adapt them
Proven skills in model training and optimization, including fine-tuning on large datasets and applying distillation, quantization, or similar techniques. Experience with foundation or multimodal models is a plus.
Strong problem-solving ability: quick prototyping, diagnosing failure cases, and iterating on solutions
Startup experience preferred: Comfortable with ambiguity, fast iteration, and owning projects end-to-end

Skills

PyTorchTensorFlowPythonC++Vision TransformersTransformersCnnsVision-Language ModelsModel DistillationQuantizationModel PruningMultimodal Ai

Similar roles

ML Engineering jobs

Reality Defender

Applied Scientist II (Audio)

Builds, tunes, and deploys state-of-the-art audio deepfake detection models for production, ensuring robustness against real-world conditions like noise and compression. Requires Master's/PhD in ML or related field with 3+ years experience in ML model deployment, Python, PyTorch/JAX, and audio processing expertise.

140k – 180kNew York, NYML EngineeringRemote3+ YOEJAXPython

Artera

Machine Learning Engineer (AI Platform Lead)

Build and scale ML compute infrastructure and distributed training pipelines for foundation models. Optimize GPU/CPU efficiency and data throughput for large-scale model training and inference.

140k – 180kUnited StatesML EngineeringRemote5+ YOEJAXAWS

Applied Intuition

Robotic Software Engineer, Perception

As a Perception Autonomy Engineer, you will develop, integrate, and maintain real-time sensor software solutions for autonomous vehicles. This involves designing and deploying AI/ML sensor algorithms, creating interfacing software for sensor control, and collaborating with cross-functional teams for seamless deployment and testing.

140k – 220kSan Diego, CAML EngineeringOn-site5+ YOEC++Onnx

Applied Intuition

Robotic Software Engineer, Perception

As a Perception Autonomy Engineer, you will develop, integrate, and maintain real-time sensor software solutions for autonomous vehicles. This involves designing and deploying AI/ML sensor algorithms and collaborating with various teams to ensure seamless deployment and customer satisfaction.

140k – 220kAnn Arbor, MIML EngineeringOn-site5+ YOEC++Onnx

Twilio

Machine Learning Engineer

Design, build, and operate cloud-native data and ML infrastructure powering real-time intelligence for Twilio products. Requires 3-5 years of production ML/data systems experience and strong Python/SQL skills.

139k – 204kUnited StatesML EngineeringRemote3+ YOESQLAWS