Member of Technical Staff
Hands-on technical leader building and scaling large language models and AI systems. Requires 3-5+ years of AI/ML experience with strong Python and deep learning frameworks.
Responsibilities
- Build collaborative relationships with the Principal and Staff engineering community as well as with engineering and product management leaders, and partner to deliver impact.
- Define the vision and strategy for the organization and have a substantial impact on the vision and strategy of customer and partner organizations.
- Plan and deliver projects that impact multiple organizations, including models and large language model training, pipeline parallelism training of large language models, and fine tuning large language models with truthful data.
- Identify opportunities for technological differentiation, investment, or divestment.
- Ensure organization’s work is aligned with broader company objectives.
- Introduce innovative techniques and analyses to the AI field to facilitate breakthroughs in quantitative reasoning and language understanding.
- Provide mentorship and guidance to senior technical leaders and managers.
- Work on hands-on technical problems including design and implementation.
- Perform cutting-edge research on advanced techniques from AI and deep learning, including neural network architectures, language modeling, and speech recognition.
- Work closely with leaders across the company to deliver impactful projects which may involve work in areas such as machine learning, applied data science, recommendation systems, and information retrieval systems.
Requirements
- Bachelor’s degree in Computer Science, Artificial Intelligence, Machine Learning, Information Technology, or a related field, plus 5 years of progressive post-baccalaureate experience in AI/ML research or development; OR Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Information Technology or a related field, plus 3 years of experience in AI/ML research or development.
- 3 years of experience in each of the following:
- Developing and applying AI models, including large language models, neural networks, or reinforcement learning.
- Experience in Python and other relevant languages (such as C++, Java), using ML frameworks (such as PyTorch, TensorFlow, or JAX).
- Assessing system performance and designing scalable, efficient solutions for inference and model deployment.
- Optimizing computational efficiency and memory usage through advanced algorithmic techniques or quantization.
- Building and maintaining robust, high-availability infrastructure for AI/ML services.
- Collaborating with cross-functional teams to define technical vision, strategy, and deliver impactful projects.
- Mentoring and guiding senior technical leaders and managers in hands-on technical problem solving.
- Driving innovation by introducing new techniques and analyses to advance AI capabilities in quantitative reasoning and language understanding.
- Leading efforts in resource management, automation, and orchestration for large-scale ML infrastructure.
Compensation
- $324,000 - $396,000 per year
Research Engineer/Research Scientist
Research Engineer/Scientist improving model capabilities for personalized AI experiences. Focus on tool-use, instruction following, evaluations, and training improvements. Requires strong ML engineering and research experience.
Member of Technical Staff
Hands-on technical contributor focused on stabilizing and advancing large language model training, fine-tuning, and research in AI/deep learning. Requires a bachelor's degree and 2+ years of experience with distributed systems, ML infrastructure, and programming in Rust/C++/Python.
Principal Research Engineer, Post-Training
Lead technical vision and execution for post-training systems that transform foundation models into intelligent, engaging products. Drive alignment algorithms, RL, and infrastructure for large-scale LLM training and serving.
Research Engineer / Research Scientist
Research and develop improvements to models' personalization and agentic capabilities through reinforcement learning, dataset creation, and post-training methods. Requires strong ML engineering skills and research experience with novel models.