Skip to content

Member of Technical Staff - Voice Model

Develop voice AI models for natural, low-latency spoken interactions on the Grok team. Handle data pipelines, model training with JAX/PyTorch, evaluations, and product integrations. Requires Python expertise, large-scale data processing, and distributed systems experience.

150k – 450kPalo Alto, CAML EngineeringOnsite

About the role

Responsibilities

  • Design and execute large-scale speech data curation and processing pipelines, including collection of diverse real-world audio, synthetic data generation, and automated annotation workflows.
  • Work on pre-training and post-training of speech-language models, with targeted enhancements through supervised fine-tuning, reinforcement learning, and other techniques.
  • Build and iterate a comprehensive evaluation framework covering objective metrics, human preference studies, content factuality assessments, real-time interaction quality, and experimentation infrastructure.
  • Work closely with product teams to integrate voice models into applications and real-time environments, define spoken interaction specifications, and handle the full lifecycle from prototype to global-scale deployment.

Basic Qualifications

  • Python expert with deep proficiency in writing clean, efficient code for AI/ML systems.
  • Hands-on experience processing large-scale datasets using tools like Spark and Ray for cleaning, augmentation, and feature extraction.
  • Proficiency in pre-training and post-training speech-language models using JAX/PyTorch, including supervised fine-tuning, reinforcement learning, and optimizations for accuracy, factuality, natural spoken style, detail, and multilingual fluency.
  • Ability to set up and run rigorous evaluation pipelines: objective metrics, human preference studies, content factuality checks, and iterative A/B testing.
  • Experience building or working with large-scale distributed training and inference systems on Kubernetes.
  • Proactive, self-driven attitude — ready to grind in a fast-paced, high-caliber team.

Compensation and Benefits

$150,000 - $450,000 USD base salary, plus equity, comprehensive medical, vision, dental coverage, 401(k), short & long-term disability insurance, life insurance, and various perks.

Skills

PythonSparkRayJAXPyTorchKubernetesSupervised Fine-TuningReinforcement LearningSpeech-Language ModelsData Curation

Similar roles

ML Engineering jobs

Member of Technical Staff, Model Training

Own the training pipeline for search and agent models, building from product usage data through fine-tuning and evaluation to production deployment. Requires deep expertise in transformer fine-tuning, data curation, and training models for ranking, retrieval, and agent behavior.

150k – 300kCaliforniaML EngineeringOn-siteData CurationLabel Quality

Member of Technical Staff, Search Ranking

Own the multi-stage ranking pipeline for web-scale search, balancing precision, recall, latency, and compute cost across retrieval, first-pass ranking, and neural reranking.

150k – 300kUnited StatesML EngineeringOn-site7+ YOERankingRetrieval

Staff Software Engineer, Engineering AI Team

Staff engineer builds AI-driven platform infrastructure for SDLC transformation, owns end-to-end experiments using AI agents like Claude, and ensures high-velocity code delivery with strong abstractions and real-world grounding. Requires staff-level architecture experience and AI-native workflows.

150k – 180kUnited StatesML EngineeringRemoteCI/CDClaude

Member of Technical Staff - ML Training Systems

Build and optimize ML training systems for production-scale language models using PyTorch and frameworks like Hugging Face. Requires 5+ years experience in high-performance code and training optimizations; onsite in NYC or SF.

150k – 350kNew York, NY +1ML EngineeringOn-site5+ YOEPyTorchContainers

Member of Technical Staff - ML Performance

Engineers optimize ML systems for performance at scale, focusing on GPU utilization, inference engines, and container runtime to boost throughput and reduce latency for language and diffusion models. Requires 5+ years experience with PyTorch, CUDA, and performance debugging.

150k – 350kNew York, NY +1ML EngineeringOn-site5+ YOEvLLMCUDA