Computational Linguist and Data Annotation Lead

Leads definition of voice dictation customization and personalization, working with ML scientists on user style preferences. Coordinates external annotation team for large datasets and analyzes stylistic variations. Requires Linguistics PhD and multilingual expertise.

120k – 160kSan Francisco, CAAI ResearchOnsite

Apply

About the role

Responsibilities

Lead work in defining customization and personalization for Wispr Flow's voice dictation outputs and style.
Work closely with ML scientists to define user preferences, from casual English to GenZ messaging styles to polished academic writing.
Coordinate a team of external experts to annotate large datasets.
Analyze data to determine variations in style.

Requirements

Linguistics PhD.
Ability to mentor, manage, and give feedback to expert labelers.
Self-awareness about stylistic choices and user preferences.
Multilingual speaker.
Meticulous attention to detail.
Eagerness to learn and build familiarity with LLM development.

Nice-to-haves

Experience in professional editing, teaching, or translation.

Skills

Computational LinguisticsData AnnotationLlm DevelopmentMultilingualStyle AnalysisDataset AnnotationMl CollaborationLinguisticsPersonalizationCustomization

Similar roles

AI Research jobs

Ataraxis AI

Research Scientist

Develops novel ML models for self-supervised learning, survival analysis, multi-modal learning, causal inference, and interpretability in oncology precision medicine. Requires PhD in ML/statistics, PyTorch expertise, and research publications.

120k – 210kNew York, NYAI ResearchOn-sitePythonPyTorch

Bland AI

Machine Learning Researcher, Multimodal LLMs

Develops next-generation multimodal LLMs integrating speech, text, tools, and real-time reasoning for conversational AI agents. Requires strong background in LLMs, multimodal models, fast experimentation, and production deployment experience.

140k – 250kSan Francisco, CAAI ResearchRemoteLLMsPrompting

Bland AI

Copy of Machine Learning Researcher, Audio

Conducts foundational research and develops scalable ML models for speech-to-text, text-to-speech, and neural audio codecs in real-time voice AI agents. Requires deep expertise in voice modeling, self-supervised learning, and production deployment at enterprise scale.

140k – 250kSan Francisco, CAAI ResearchRemoteTtsStt

Labelbox

Forward Deployed Research Scientist

Forward Deployed Research Scientist collaborates with frontier AI labs on data strategies, fine-tunes open-weight LLMs, runs ablation studies, and validates data impact for client projects. Requires MS/PhD in ML/NLP/CS, hands-on LLM fine-tuning, and fast-paced experimental rigor.

140k – 200kSan Francisco, CAAI ResearchHybridDpoLLMs

Astera

Research Scientist - Simplex

Develops theories of intelligence grounded in neural network internal structures, focusing on belief geometries in LLMs and biological brains. Conducts experiments bridging mathematics, ML interpretability, and safety research; requires PhD-level quantitative depth and hands-on coding.

140k – 200kEmeryville, CAAI ResearchOn-siteLLMsPyTorch