Senior Research Engineer - Safety Tooling and Data
Senior Research Engineer building data synthesis, analysis, and management tooling for AI safety model training and evaluation. Requires strong software engineering, statistics, and ML framework expertise.
Key Responsibilities
- Design and implement robust data pipeline tooling that enables frequent, low-friction data generation and annotation
- Create cohesive data infrastructure that supports continuous parallel operation with model experimentation
- Establish standardized processes for data validation, analysis, and improvement of data both training and evaluation
- Collaborate with the ML modeling team to align data capabilities with experimental needs
- Maintain opinionated, well-documented solutions that become team standards
- Develop systematic analysis frameworks to identify incoming data sources and benchmark coverage gaps
Qualifications
- Extremely strong software engineering skills
- Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
- Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
- Demonstrated ability to own complex technical projects from conception to deployment
- Opinionated approach to technical architecture with ability to make principled decisions
- Understanding of ML data requirements and the intersection of data engineering with modeling workflows
Senior Software Engineer, AI Platform
Senior Software Engineer building scalable AI infrastructure, agent orchestration frameworks, evaluation systems, and high-performance LLM serving at Mixpanel. Requires 5+ years experience and hands-on LLM/agent work.
Senior Machine Learning Systems Engineer
Build large-scale ML experimentation and training orchestration platforms, including agentic AI execution systems, to accelerate Ads ML development at Reddit. Requires 5+ years infrastructure experience and 2+ years building production ML platforms.
Staff Software Engineer, Agentic Platform
Senior individual contributor architecting and scaling agentic LLM systems that turn messy manufacturing data into reliable root-cause insights. Owns orchestration, retrieval, evaluation, and guardrails for non-deterministic production systems.