Senior Machine Learning Engineer

Build and productionize large-scale recommendation systems, NLP/embedding models, and agentic AI workflows. Requires 6+ years ML/NLP experience and expertise in PyTorch/TensorFlow, RAG, and vector search.

165k – 259kUnited StatesML EngineeringRemote6+ YOE

Apply

About the role

What you'll do

Recommendation system

Build large scale recommendation systems utilizing embeddings generated for structured and unstructured data using methods such as a two tower architecture
Performant recommendation designs which can scale to millions of recommendations per day for different product features
Utilize graph based structures, search and scoring to enhance recommendation quality

Advanced NLP & Embedding Systems

Fine-tune (LORA/PEFT), customize and deploy embedding models (LLMs/SLMs) for multi-language text understanding and semantic search
Architect vector search solutions that enable language-agnostic clustering and classification across global datasets
Build and optimize high-performance retrieval systems using vector databases

MLOps Lifecycle Management

Architect and manage scalable MLOps and LLMOps infrastructure for robust model training, evaluation, deployment, and monitoring systems
Design comprehensive CI/CD pipelines, implement model monitoring frameworks to identify drift patterns, and ensure high availability and fault tolerance
Help establish metrics, experimentation frameworks, and statistical validation approaches for AI system performance

Agentic Workflows & Evaluation

Design and implement agentic systems for automated web extraction, NER, and entity resolution tasks
Build comprehensive evaluation frameworks for agent performance across data acquisition and processing workflows
Create feedback loops that continuously improve agent decision-making and data quality outcomes
Build, and scale MCP servers and integrate them into broader AI and product ecosystems

Cross-Functional Collaboration

End to end ownership of production workflows with close collaboration across engineering teams managing data, application, API and MCP layers to ensure models integrate seamlessly and scale with business needs
Work with Product Management to translate business requirements into scalable ML solutions

What you bring

6+ years hands-on ML/NLP experience (or 3+ years post-PhD/Master's) with at least two delivered, revenue-impacting products in production environments
Expertise in modern AI architectures including transformer stacks, prompt engineering, RAG systems, vector-based information retrieval and context engineering
Proven track record building and managing production systems by architecting and deploying scalable distributed systems of REST & MCP based microservices for applications and agents with observability and monitoring of latency, token utilization and system reliability
Strong applied research capabilities (PyTorch or TensorFlow) paired with software-engineering rigor (Python) and familiarity with open weight LLMs (QWEN, Gemma, OSS) and embedding models and vector search technologies (FAISS, Pinecone)
Executive communication skills with ability to persuade technical and non-technical audiences through data-driven storytelling, comfortable owning strategy, budget, and cross-functional collaboration
Utilize modern AI development tools (Claude Code, Codex, Cursor) in their engineering workflow to maximize development velocity and code quality

Skills

PyTorchTensorFlowPythonRAGFaissPineconeTransformersLoraPeftLLMs

Similar roles

ML Engineering jobs

Liftoff

Senior GenAI Software Engineer

Senior engineer building and shipping production-grade GenAI systems for ad creative generation, including multimodal models and interactive playables. Requires 5+ years experience, strong Python/JS skills, and proven LLM production experience.

165k – 230kUnited StatesML EngineeringRemote5+ YOELLMsHTML

Blue Rose Research

Machine Learning Lead (LLM)

Leads a team of senior data scientists developing and deploying LLM-based ML products, defining technical roadmaps, overseeing model training/inference pipelines, and translating outputs into actionable insights for civic applications. Requires 6+ years ML experience and 1+ years leading teams.

165k – 210kUnited StatesML EngineeringRemote6+ YOESQLJAX

Raylu

Senior AI Engineer

Builds and productionizes LLM-powered agent workflows, focusing on orchestration, evaluation, reliability, safety controls, and product iteration. Requires strong agentic systems expertise and engineering skills for scalable AI systems.

165k – 250kNew York, NYML EngineeringOn-siteLLMsMemory

Semgrep

Senior AI Software Engineer

Develops AI-powered solutions using LLMs to improve code security vulnerability detection, prioritization, and remediation. Collaborates with customers, implements end-to-end features, and evaluates models through experimentation on real-world data.

163k – 247kSan Francisco, CA +3ML EngineeringHybridAWSLLMs

Databricks

Senior Software Engineer, Model Serving

Designs and builds scalable infrastructure for high-throughput, low-latency AI/ML model serving on CPU/GPU. Requires 5+ years in distributed systems, inference expertise, and strong system design skills.

166k – 225kSan Francisco, CAML EngineeringOn-site5+ YOEGPURouting