Skip to content

Applied Research Scientist - Foundation Models

Develops and optimizes transformer-based vision-language foundation models for physical security, owning full-cycle training, fine-tuning, compression, and deployment for real-time inference on images, videos, and text. Requires PhD/Master's in CS/EE, hands-on ML expertise with PyTorch/TensorFlow, Transformers, and ViTs.

140k – 175kRedwood City, CAML EngineeringHybrid

About the role

What you'll do

  • Develop & Optimize VLMs: Design and optimize transformer-based vision-language models to understand images, videos, and text, and optimize for real-time inference.
  • Pre-training & Fine-tuning: Own the full training pipeline—from pre-training on image-text data to fine-tuning for Ambient.ai’s physical security domain and use cases.
  • Model Compression & Optimization: Apply techniques like distillation, quantization, and pruning to reduce model size and latency, enabling efficient edge deployment.
  • Leverage Open-Source & Innovate: Use and extend state-of-the-art open-source models. Prototype new architectures and training methods to advance Ambient.ai’s multimodal AI research.
  • Cross-Team Collaboration: Work with engineering and product teams to integrate models into the platform. Iterate based on real-world feedback and deployment data to improve performance.
  • Research and Experimentation: Stay current with vision, NLP, and multimodal AI research. Design experiments to test new algorithms and continually enhance our core AI systems.

What you'll bring

  • Ph.D. or Master’s in CS, EE, or related field, with a strong foundation in AI/ML (Ph.D. preferred or Master’s with strong experience)
  • Proficient in Python/C++ and deep learning frameworks like PyTorch or TensorFlow. Comfortable with large-scale training pipelines
  • Hands-on experience with CNNs, Transformers, and Vision Transformers (ViT). Strong understanding of vision-language models and how to fine-tune or adapt them
  • Proven skills in model training and optimization, including fine-tuning on large datasets and applying distillation, quantization, or similar techniques. Experience with foundation or multimodal models is a plus.
  • Strong problem-solving ability: quick prototyping, diagnosing failure cases, and iterating on solutions
  • Startup experience preferred: Comfortable with ambiguity, fast iteration, and owning projects end-to-end

Skills

PyTorchTensorFlowPythonC++Vision TransformersTransformersCnnsVision-Language ModelsModel DistillationQuantizationModel PruningMultimodal Ai

Similar roles

ML Engineering jobs

Applied Scientist II (Audio)

Builds, tunes, and deploys state-of-the-art audio deepfake detection models for production, ensuring robustness against real-world conditions like noise and compression. Requires Master's/PhD in ML or related field with 3+ years experience in ML model deployment, Python, PyTorch/JAX, and audio processing expertise.

140k – 180kNew York, NYML EngineeringRemote3+ YOEJAXPython

Machine Learning Engineer (AI Platform Lead)

Build and scale ML compute infrastructure and distributed training pipelines for foundation models. Optimize GPU/CPU efficiency and data throughput for large-scale model training and inference.

140k – 180kUnited StatesML EngineeringRemote5+ YOEJAXAWS

Robotic Software Engineer, Perception

As a Perception Autonomy Engineer, you will develop, integrate, and maintain real-time sensor software solutions for autonomous vehicles. This involves designing and deploying AI/ML sensor algorithms, creating interfacing software for sensor control, and collaborating with cross-functional teams for seamless deployment and testing.

140k – 220kSan Diego, CAML EngineeringOn-site5+ YOEC++Onnx

Robotic Software Engineer, Perception

As a Perception Autonomy Engineer, you will develop, integrate, and maintain real-time sensor software solutions for autonomous vehicles. This involves designing and deploying AI/ML sensor algorithms and collaborating with various teams to ensure seamless deployment and customer satisfaction.

140k – 220kAnn Arbor, MIML EngineeringOn-site5+ YOEC++Onnx

Machine Learning Engineer

Design, build, and operate cloud-native data and ML infrastructure powering real-time intelligence for Twilio products. Requires 3-5 years of production ML/data systems experience and strong Python/SQL skills.

139k – 204kUnited StatesML EngineeringRemote3+ YOESQLAWS