Skip to content

Senior Machine Learning Engineer - Enrichment & Content Intelligence

New York, NYHybrid
Summary

As a Senior Machine Learning Engineer, you will evolve large-scale ML pipelines and multimodal embedding frameworks to power Spotify's content-resolution systems. You will improve entity-resolution across music and video content, design experiments, and build scalable ML evaluation and monitoring infrastructure.

About the role

What You'll Do

  • Own and evolve large-scale ML pipelines powering Spotify’s content-resolution systems
  • Lead development of multimodal embedding frameworks supporting multimodal understanding, music video matching, SongDNA
  • Improve entity-resolution systems across music and video content, helping Spotify better understand relationships between recordings, versions, and content formats
  • Design and run experiments to improve precision, recall, and overall content-quality outcomes using offline evaluation, golden datasets, A/B testing, and impact analysis
  • Build scalable ML evaluation and monitoring infrastructure, including standardized datasets, retraining workflows, and continuous improvement systems
  • Contribute to the evolution of the Music Knowledge Graph by improving production ML capabilities, observability, and model lifecycle management
  • Partner closely with Product Managers, Data Scientists, and engineering teams across Content Platform and the wider Experience Mission
  • Help shape technical strategy for the squad and contribute to long-term ML direction across the product area
  • Mentor engineers and contribute to a strong culture of technical collaboration and experimentation

Who You Are

  • You have solid experience building, deploying, and maintaining machine learning systems in production at scale
  • You have strong experience training, evaluating, and operating ML models using modern frameworks such as PyTorch or TensorFlow
  • You have experience working with multimodal machine learning systems across audio, computer vision, text embeddings, or related domains
  • You understand entity resolution, deduplication, record linkage, or large-scale matching problems, ideally across multiple content modalities
  • You know how to design evaluation systems that balance model quality, operational performance, and real-world impact
  • You are experienced working with large-scale distributed data processing systems and ML infrastructure
  • You communicate effectively across engineering, product, and data science stakeholders
  • You are comfortable leading technical initiatives and influencing engineering direction within a team
  • Experience with Scio, Dataflow, Flyte, BigQuery, or similar distributed processing frameworks is a plus
  • Experience with Scala is a plus
  • Experience with computer vision, video understanding, multimodal embeddings, or recommendation systems is a strong plus
Skills
PyTorchTensorFlowMultimodal Machine LearningEntity ResolutionDistributed Data ProcessingScioDataflowFlyteBigQueryScalaComputer VisionVideo UnderstandingRecommendation Systems