AI Product Engineer
Build agentic capabilities on a petabyte-scale observability platform. Own the full agent stack including context engineering, tool design, evals, and production reliability for incident investigation.
Builds and deploys custom integrations, APIs, and scalable solutions for enterprise clients using Mercor's AI platform. Requires strong engineering skills in modern languages and cloud environments, with customer-facing experience.
Build agentic capabilities on a petabyte-scale observability platform. Own the full agent stack including context engineering, tool design, evals, and production reliability for incident investigation.
As a Senior Software Engineer, AI Data & Evaluation, you will build data infrastructure and evaluation systems for frontier AI models. This role involves designing evaluation methodologies, building synthetic data generation systems, and architecting operational automation.
Build ML models and decision systems for search, ranking, candidate-job matching, and marketplace optimization at a fast-growing AI talent platform.
Builds and operates scalable data pipelines and systems for post-training workflows, model evaluations, and synthetic data generation. Partners with frontier AI labs and customers, requiring strong backend skills in Python/Go/Rust and ML evaluation expertise.
Designs and maintains benchmarks, evaluation systems, and failure analysis workflows for frontier LLMs, focusing on tool use, agentic behavior, and reasoning. Builds rubrics, scorers, and dashboards while collaborating with AI research teams in a fast-paced environment.