Software Engineering Lead, Machine Learning

Leads development and deployment of ML models for NLP, retrieval, ranking, reasoning, dialog, and code-generation systems. Requires Master's/PhD, production ML experience, deep NLP expertise, Python proficiency, and MLOps knowledge in a fast-paced startup.

135k – 300kCaliforniaML EngineeringHybrid

Apply

About the role

Responsibilities

Conceptualize, develop, and deploy machine learning models that underpin our NLP, retrieval, ranking, reasoning, dialog and code-generation systems.
Implement advanced machine learning algorithms, such as Transformer-based models, reinforcement learning, ensemble learning, and agent-based systems to continually improve the performance of our AI systems.
Lead the processing and analysis of large, complex datasets (structured, semi-structured, and unstructured), and use your findings to inform the development of our models.
Work across the complete lifecycle of ML model development, including problem definition, data exploration, feature engineering, model training, validation, and deployment.
Implement A/B testing and other statistical methods to validate the effectiveness of models. Ensure the integrity and robustness of ML solutions by developing automated testing and validation processes.
Clearly communicate the technical workings and benefits of ML models to both technical and non-technical stakeholders, facilitating understanding and adoption.

Requirements

Master’s degree or Ph.D. in Computer Science, Machine Learning, or a related quantitative field.
Proven industry experience in building and deploying production-level machine learning models.
Deep understanding and practical experience with NLP techniques and frameworks, including training and inference of large language models.
Deep understanding of any of retrieval, ranking, reinforcement learning, and agent-based systems and experience in how to build them for large systems.
Proficiency in Python and experience with ML libraries such as TensorFlow or PyTorch.
Excellent skills in data processing (SQL, ETL, data warehousing) and experience working with large-scale data systems.
Experience with machine learning model lifecycle management tools, and an understanding of MLOps principles and best practices.
Familiarity with cloud platforms like GCP or Azure.
Familiarity with the latest industry and academic trends in machine learning and AI, and the ability to apply this knowledge to practical projects.
Good understanding of software development principles, data structures, and algorithms.

Compensation (California based)

Standard base salary: $135,000-$300,000 annually. Compensation determined by location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for variable compensation, equity, and benefits.

Skills

PythonTensorFlowPyTorchNLPTransformersReinforcement LearningRetrievalRankingMLOpsSQLETLGCPAzure

Similar roles

ML Engineering jobs

PrizePicks

Machine Learning Platform Engineer

Build and operate the ML platform to productionize models, enable real-time inference, and manage the full ML lifecycle with MLOps best practices. Requires 3+ years platform engineering and 1+ years owning ML systems end-to-end.

135k – 160kUnited StatesML EngineeringRemote3+ YOEGoKafka

Assembled

Software Engineer - Forecasting & Scheduling

Builds forecasting interfaces, data pipelines, and scheduling systems for thousands of support agents using ML models. Requires Python ML libraries experience and background in ML/algorithmic teams with focus on performance optimization.

135k – 280kUnited StatesML EngineeringRemoteScipyMLOps

Assembled

Software Engineer - Forecasting & Scheduling

Develops forecasting interfaces, data pipelines, and scheduling systems to predict support contact volume and optimize agent schedules for thousands, incorporating ML models and constraints like labor laws. Requires Python/ML experience and performance focus.

135k – 280kSan Francisco, CAML EngineeringHybridScipyMLOps

Assembled

Software Engineer - AI Agents & Platform

Builds autonomous AI agents and platform infrastructure for customer support, enhancing LLM performance with RAG techniques, scaling systems with Golang, and integrating STT/TTS. Requires 5+ years software engineering experience with LLMs.

135k – 280kSan Francisco, CAML EngineeringOn-site5+ YOEGoRAG

Docker

ML Engineer

Founding ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Requires 5+ years applied ML experience shipping systems and 4+ years backend/infra engineering.

139k – 226kPalo Alto, CA +1ML EngineeringRemote5+ YOELLMsRetrieval