Skip to content

Machine Learning Engineer

140k – 190kUnited StatesRemote4+ YOE
Summary

Build and deploy large-scale ML models for real-time fraud detection, engineering features from 1T+ events and maintaining production MLOps infrastructure on GCP. Requires 4+ years experience with Java/Scala, Python, Spark/Flink, and distributed systems.

About the role

What You'll Do

Model Development & Refinement

  • Design, build, and deploy online machine learning models (including ensemble methods, deep learning, transformer architectures and graph-based models) to catch evolving fraud vectors in real time

Feature Engineering at Scale

  • Engineer high-frequency time-series features from over 1 trillion behavioral events, optimizing for low-latency signal extraction and pattern recognition

Production MLOps

  • Maintain and enhance automated model training and deployment infrastructure, ensuring frictionless CI/CD of newly trained models

System Optimization

  • Write high-performance code to minimize scoring latency at runtime, ensuring core ML services scale seamlessly across distributed databases

Collaborative Innovation

  • Work cross-functionally with Core Infrastructure, Product Management, and Data Science teams to translate business-level fraud patterns into robust algorithmic solutions

Requirements

  • 4+ years of professional experience building and deploying large-scale machine learning models into high-traffic production environments
  • Strong proficiency in Java or Scala (for production backend) as well as Python (for data analysis and model prototyping)
  • Practical experience with Databricks and big data processing frameworks like Apache Spark, Apache Flink, or Hadoop, and working with NoSQL data stores like Bigtable
  • Deep understanding of statistical modeling, probability, and standard machine learning algorithms (e.g., XGBoost, Random Forests, Neural Networks, and Clustering techniques)
  • Ability to reason through data consistency, pipeline failures, and performance constraints in a distributed, multi-tenant cloud environment (GCP)

Preferred Qualifications

  • Experience explicitly in the fraud detection, risk mitigation, or cyber-security domains
  • Deep knowledge of streaming architectures (e.g., Apache Kafka)
  • Familiarity with containerization and orchestration tools like Docker and Kubernetes
  • Familiarity with leveraging AI coding assistants (e.g., Claude Code) to accelerate development and model prototyping
Skills
JavaScalaPythonDatabricksApache SparkApache FlinkHadoopBigtableXGBoostRandom ForestsNeural NetworksClusteringGCPApache KafkaDocker
Similar roles at this salary range
All ML Engineering jobs →
Together AI

Systems Research Engineer Intern - GPU Programming

Intern developing and optimizing GPU-accelerated kernels for ML/AI applications. Requires strong GPU programming background (CUDA/Triton) and knowledge of performance optimization.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelCUDATriton
Together AI

Research Intern, Inference

Research intern on the Inference team building efficient serving systems for large foundation models. Focus on distributed inference, compiler-aware optimization, and novel inference-time strategies.

121k – 131kSan Francisco, CAML EngineeringOn-siteEntry levelJAXCUDA
Pinterest

Machine Learning Engineer II, Computer Vision Applied Science

Build and fine-tune vision-centric VLMs and generative models using Pinterest's visual-text datasets. Requires 2+ years industry computer vision experience and an M.S. or Ph.D.

139k – 286kSan Francisco, CAML EngineeringRemote2+ YOELLMsRLHF
Mariana Minerals

Staff Machine Learning Engineer

Staff ML Engineer setting technical direction for autonomous mineral refining using reinforcement learning and simulation. Owns modeling, validation, and deployment of control systems on live industrial equipment.

160k – 200kAnn Arbor, MIML EngineeringOn-site8+ YOESimulationDigital Twins
Mariana Minerals

Machine Learning Engineer

Build and deploy reinforcement learning models to autonomously control mineral refining facilities, optimizing recovery rates, energy use, and uptime in real operating plants.

120k – 160kAnn Arbor, MI +2ML EngineeringOn-siteEntry levelPythonDeep Learning