Applied AI/ML Engineer

Builds state-of-the-art document processing infrastructure using LLMs, including QA agents, optimizers, multimodal models, and self-correcting systems. Monitors production models, runs experiments, and owns large product areas for real-world customer impact.

200k – 350kNew York, NYML EngineeringOnsite

Apply

About the role

Responsibilities

Design novel LLM techniques for increasing the complexity of use cases and data streams that Extend can be applied to.
Monitor existing models in production, understand what’s working well (and what isn’t), and run experiments to solve those issues.
Build a QA Agent to flag low confidence results.
Deploy an optimizer agent in a loop that improves document performance in the background.
Create multimodal models for document layout awareness.
Develop novel chunking strategies for handling long documents.
Build data pipelines and evaluate model performance.
Build a self-correcting system that automatically gets better over time.
Have complete ownership over the work you do and direct relationship with customers.

Benefits

90% health insurance premium coverage.
Relocation for candidates based outside of NYC.
Unlimited PTO policy.
Daily lunch & snacks covered (plus dinner if late).
Unlimited token / tooling access.
Learning and development investment.

Skills

LLMsMachine LearningAIPythonPyTorchTransformersMultimodal ModelsData PipelinesProduction MlLlm Agents

Similar roles

ML Engineering jobs

Snowflake

AI System Research and Development Engineer - Optimization

Develop and optimize GPU kernels and deep learning systems for LLM training and inference at Snowflake AI Research. Requires 5+ years in GPU/HPC optimization and strong proficiency in PyTorch, TensorFlow, JAX, and CUDA.

200k – 265kBellevue, WAML EngineeringOn-site5+ YOEJAXCUDA

Baseten

Post-Training Research Engineer

Build in-house tooling for post-training custom ML models using advanced techniques like RL and finetuning. Requires deep expertise in transformer training, PyTorch distributed systems, parallelism strategies, GPU performance optimization, and HPC platforms.

200k – 275kSan Francisco, CAML EngineeringHybridJAXRay

Glean

Machine Learning Engineer, Enterprise Brain

Develop ML systems for the Enterprise Brain, focusing on proactive AI for task prediction, automation, and agentic workflows using LLMs and advanced techniques. Requires 3+ years ML experience, Python proficiency, and expertise in evaluation and production systems.

200k – 300kPalo Alto, CA +1ML EngineeringHybrid3+ YOELLMsPython

Cantina

Machine Learning Engineer, Images

Designs, fine-tunes, and deploys image generation models for photorealistic AI bots, optimizing for consistency, latency, and quality. Requires 5+ years software engineering, 2+ years production ML, and expertise in diffusion models like Stable Diffusion and PyTorch.

200k – 265kSan Francisco, CAML EngineeringRemote5+ YOEGCPAWS

Together AI

Research Engineer, Core ML

Research Engineer building production ML systems at the intersection of efficient inference, RL/post-training, and serving engines. Translates algorithms into scalable infrastructure improving latency, throughput, and model quality. Requires 3+ years ML systems experience and advanced degree.

200k – 280kSan Francisco, CAML EngineeringOn-site3+ YOEDpovLLM