Skip to content

Senior AI Engineer

160k – 170kNew York, NYML EngineeringOnsite5+ YOE
Summary

Senior AI Engineer building and scaling production GenAI systems for CreditAI, including multi-agent workflows, RAG pipelines, LLM fine-tuning, and AWS deployments. Requires 5+ years of AI/ML experience and deep Python expertise.

About the role

Responsibilities

  • Design and implement multi-agent and agentic orchestration frameworks using agent SDKs such as the Claude Agent SDK, Google ADK, or AWS AgentCore, incorporating tools, external data sources, memory, and state management
  • Build and maintain MCP servers and integrations to extend AI system capabilities with structured tool use and external context
  • Build and optimize RAG pipelines including embedding strategies, vector database, retrieval quality tuning, and cost-aware ingestion design
  • Integrate with managed LLM services across cloud providers to support diverse deployment and cost optimization strategies
  • Fine-tune, optimize, and deploy open-source deep learning models for production use cases, leveraging GPU infrastructure for training and inference
  • Apply systems thinking to design and optimize AI and LLM systems, balancing quality, scalability, latency, cost, and operational complexity, while implementing efficiency improvements using model selection, prompt design, batching, caching, and retrieval strategies
  • Design and implement automated evaluation frameworks to assess LLM system quality, accuracy, and performance across production workloads
  • Apply reinforcement learning techniques (e.g., RLHF, RLAIF) to improve model alignment and task-specific performance
  • Architect and manage high-throughput, real-time data pipelines using Kafka
  • Design, deploy, and scale production AI services on AWS (Batch, Lambda, ECS, S3, etc), applying modern containerization, CI/CD, and infrastructure-as-code practices
  • Implement comprehensive observability frameworks using Datadog — tracking token usage, pipeline latency, error rates, consumer lag, and model performance with actionable alerting
  • Identify and resolve production bottlenecks across distributed systems, including database query optimization, consumer scaling, and LLM throughput tuning
  • Conduct code reviews; contribute to team standards around reliability, testing, and operational excellence
  • Communicate progress, trade-offs, and outcomes to relevant stakeholders
  • Continuously learn and adapt to advancements in NLP and Generative AI

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • 5+ years of experience as an AI Engineer, Machine Learning Engineer, or applied AI practitioner, with a strong foundation in computer science and algorithms
  • Deep Python expertise with a track record of shipping production systems at scale; strong software engineering practices including clean code, testing, code review, and CI/CD
  • Hands-on experience designing, building, and deploying LLM-driven or GenAI applications, including multi-agent architectures and agentic workflows, with familiarity with vector databases, embeddings pipelines, or semantic search systems
  • Hands-on experience designing and implementing automated evaluation frameworks for LLM systems
  • Solid understanding of machine learning and applied AI concepts, with the ability to take solutions from prototype to production and translate research ideas into scalable, real-world systems
  • Experience with GPUs for model training or inference, including tuning and deploying open-source deep learning models in production; proficiency with PyTorch or TensorFlow for model development and fine-tuning
  • Practical experience with cloud-based deployments and infrastructure tools (e.g., AWS, Docker, GitHub) and an understanding of modern DevOps practices, containerization, orchestration, and caching strategies
  • Strong problem-solving and systems thinking, with the ability to balance trade-offs across model quality, scalability, inference latency, and cost
  • Excellent communication and collaboration skills, with experience working closely with product managers, engineers, and domain experts to deliver actionable technical solutions
  • Strong ownership and initiative, with the ability to independently drive projects from problem definition to delivery

Nice-to-Haves

  • Experience with reinforcement learning techniques (RLHF, RLAIF)
  • Experience with Kafka for real-time data pipelines
  • Experience with Datadog for observability and monitoring
Skills
PythonPyTorchTensorFlowAWSDockerKafkaRAGLLMMulti-agent systemsDatadogGitHubCI/CD
Similar roles at this salary range
All ML Engineering jobs →
Zoox

Machine Learning Engineer - Simulation Framework

Machine Learning Engineer focused on GPU-based simulation frameworks, reinforcement learning, and bridging sim-to-real gaps for autonomous vehicle safety validation. Requires MS/PhD and strong C++/Python experience.

151k – 257kFoster City, CA +1ML EngineeringHybrid7+ YOEJAXC++
Talkiatry

Senior AI Engineer

Build full-stack AI systems including agentic workflows, RAG pipelines, and production infrastructure for mental healthcare applications. Requires 2+ years software engineering experience and 1+ year with LLMs or agentic AI.

170k – 195kUnited StatesML EngineeringRemote2+ YOERAGReact
Grafana Labs

Staff AI Engineer

Staff AI Engineer building and shipping LLM/agent-powered observability features for incident detection, triage, and resolution. Requires strong production software engineering experience plus practical GenAI/LLM application skills.

175k – 220kUnited StatesML EngineeringRemote7+ YOEAWSGCP
Pinterest

Staff Software Engineer, Trends Machine Learning Infrastructure

Lead technical direction for Pinterest's unified AI-powered Trends and Audience Insights platform. Architect scalable ML data pipelines and LLM capabilities while mentoring engineers and driving cross-team integrations.

177k – 365kSan Francisco, CAML EngineeringHybrid8+ YOELLMsCodex
Airbnb

Machine Learning Engineer

Build and deploy cutting-edge Agentic AI and LLM systems to transform Airbnb's customer service experience. Requires PhD or equivalent experience and production ML/AI deployment expertise.

170k – 180kSan Francisco, CA +1ML EngineeringOn-site3+ YOELLMSFT