Skip to content

Software Engineering Lead, Machine Learning

Leads development and deployment of ML models for NLP, retrieval, ranking, reasoning, dialog, and code-generation systems. Requires Master's/PhD, production ML experience, deep NLP expertise, Python proficiency, and MLOps knowledge in a fast-paced startup.

135k – 300kCaliforniaML EngineeringHybrid

About the role

Responsibilities

  • Conceptualize, develop, and deploy machine learning models that underpin our NLP, retrieval, ranking, reasoning, dialog and code-generation systems.
  • Implement advanced machine learning algorithms, such as Transformer-based models, reinforcement learning, ensemble learning, and agent-based systems to continually improve the performance of our AI systems.
  • Lead the processing and analysis of large, complex datasets (structured, semi-structured, and unstructured), and use your findings to inform the development of our models.
  • Work across the complete lifecycle of ML model development, including problem definition, data exploration, feature engineering, model training, validation, and deployment.
  • Implement A/B testing and other statistical methods to validate the effectiveness of models. Ensure the integrity and robustness of ML solutions by developing automated testing and validation processes.
  • Clearly communicate the technical workings and benefits of ML models to both technical and non-technical stakeholders, facilitating understanding and adoption.

Requirements

  • Master’s degree or Ph.D. in Computer Science, Machine Learning, or a related quantitative field.
  • Proven industry experience in building and deploying production-level machine learning models.
  • Deep understanding and practical experience with NLP techniques and frameworks, including training and inference of large language models.
  • Deep understanding of any of retrieval, ranking, reinforcement learning, and agent-based systems and experience in how to build them for large systems.
  • Proficiency in Python and experience with ML libraries such as TensorFlow or PyTorch.
  • Excellent skills in data processing (SQL, ETL, data warehousing) and experience working with large-scale data systems.
  • Experience with machine learning model lifecycle management tools, and an understanding of MLOps principles and best practices.
  • Familiarity with cloud platforms like GCP or Azure.
  • Familiarity with the latest industry and academic trends in machine learning and AI, and the ability to apply this knowledge to practical projects.
  • Good understanding of software development principles, data structures, and algorithms.

Compensation (California based)

  • Standard base salary: $135,000-$300,000 annually. Compensation determined by location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

Skills

PythonTensorFlowPyTorchNLPTransformersReinforcement LearningRetrievalRankingMLOpsSQLETLGCPAzure

Similar roles

ML Engineering jobs

Machine Learning Platform Engineer

Build and operate the ML platform to productionize models, enable real-time inference, and manage the full ML lifecycle with MLOps best practices. Requires 3+ years platform engineering and 1+ years owning ML systems end-to-end.

135k – 160kUnited StatesML EngineeringRemote3+ YOEGoKafka

Software Engineer - Forecasting & Scheduling

Builds forecasting interfaces, data pipelines, and scheduling systems for thousands of support agents using ML models. Requires Python ML libraries experience and background in ML/algorithmic teams with focus on performance optimization.

135k – 280kUnited StatesML EngineeringRemoteScipyMLOps

Software Engineer - Forecasting & Scheduling

Develops forecasting interfaces, data pipelines, and scheduling systems to predict support contact volume and optimize agent schedules for thousands, incorporating ML models and constraints like labor laws. Requires Python/ML experience and performance focus.

135k – 280kSan Francisco, CAML EngineeringHybridScipyMLOps

Software Engineer - AI Agents & Platform

Builds autonomous AI agents and platform infrastructure for customer support, enhancing LLM performance with RAG techniques, scaling systems with Golang, and integrating STT/TTS. Requires 5+ years software engineering experience with LLMs.

135k – 280kSan Francisco, CAML EngineeringOn-site5+ YOEGoRAG

ML Engineer

Founding ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Requires 5+ years applied ML experience shipping systems and 4+ years backend/infra engineering.

139k – 226kPalo Alto, CA +1ML EngineeringRemote5+ YOELLMsRetrieval