Skip to content

Senior Research Engineer - Safety Tooling and Data

230k – 380kNew York, NYML EngineeringRemote5+ YOE
Summary

Senior Research Engineer building data synthesis, analysis, and management tooling for AI safety model training and evaluation. Requires strong software engineering, statistics, and ML framework expertise.

About the role

Key Responsibilities

  • Design and implement robust data pipeline tooling that enables frequent, low-friction data generation and annotation
  • Create cohesive data infrastructure that supports continuous parallel operation with model experimentation
  • Establish standardized processes for data validation, analysis, and improvement of data both training and evaluation
  • Collaborate with the ML modeling team to align data capabilities with experimental needs
  • Maintain opinionated, well-documented solutions that become team standards
  • Develop systematic analysis frameworks to identify incoming data sources and benchmark coverage gaps

Qualifications

  • Extremely strong software engineering skills
  • Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
  • Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
  • Demonstrated ability to own complex technical projects from conception to deployment
  • Opinionated approach to technical architecture with ability to make principled decisions
  • Understanding of ML data requirements and the intersection of data engineering with modeling workflows
Skills
PythonPyTorchTensorFlowJAXData PipelinesStatistical AnalysisML Data EngineeringData ValidationExperiment EvaluationSoftware Architecture
Similar roles at this salary range
All ML Engineering jobs →
Airbnb

Staff Machine Learning Engineer

Build and deploy cutting-edge ML and Generative AI systems to transform Airbnb's customer support experience, focusing on LLM fine-tuning, RAG, and intelligent service automation.

212k – 260kSan Francisco, CAML EngineeringRemote9+ YOELLMRAG
Mixpanel

Senior Software Engineer, AI Platform

Senior Software Engineer building scalable AI infrastructure, agent orchestration frameworks, evaluation systems, and high-performance LLM serving at Mixpanel. Requires 5+ years experience and hands-on LLM/agent work.

226k – 306kSan Francisco, CAML EngineeringHybrid5+ YOELLMsMLOps
Twilio

Tech Lead, Applied Research

Tech Lead driving AI R&D and end-to-end delivery of production-ready prototypes using full-stack development, LLMs, and emerging technologies. Requires 10+ years experience and strong autonomy.

228k – 335kUnited StatesML EngineeringRemote10+ YOEGoSQL
Reddit

Senior Machine Learning Systems Engineer

Build large-scale ML experimentation and training orchestration platforms, including agentic AI execution systems, to accelerate Ads ML development at Reddit. Requires 5+ years infrastructure experience and 2+ years building production ML platforms.

217k – 303kUnited StatesML EngineeringRemote5+ YOERayArgo
Axion

Staff Software Engineer, Agentic Platform

Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.

250k – 270kSan Francisco, CA +1ML EngineeringHybrid7+ YOEMCPobservability