Skip to content

Senior Software Engineer, Applied AI (Forward Deployed)

180k – 264kNew York, NYHybrid7+ YOE
Summary

Senior forward-deployed engineer builds and deploys AI-powered workflows using LLMs and ML frameworks for client operational problems. Requires 7+ years experience in Python, data/ML systems, deployment infrastructure, and client-facing skills; hybrid in NYC area.

About the role

What You’ll Do

  • Collaborate with delivery leaders to scope technical solutions to operational problems
  • Identify workflow optimizations through deep engagement with customer problems and work to build into a stable and scalable solution
  • Design and implement AI-powered workflows using LLMs, embedding models, retrieval systems, and automation tools
  • Translate messy real-world constraints (e.g., inconsistent data, latency requirements) into elegant engineering solutions
  • Iterate quickly based on real-time feedback from operators and clients
  • Build reusable tooling and infrastructure that accelerates future deployments

What We Need

  • 7+ years of software engineering experience, including significant time spent building data, ML, or backend systems
  • Python & ML/LLM Frameworks: Deep proficiency in Python with hands-on experience using Hugging Face, LangChain, OpenAI, Pinecone, and related ecosystems
  • Deployment & Infrastructure: Skilled in full-stack and API-based deployment patterns, including Docker, FastAPI, Kubernetes, and cloud environments (GCP, AWS)
  • Platform Orchestration: Experienced with workflow orchestration libraries, pub/sub systems (Kafka), and schema governance
  • Data Management: Expertise in data governance and operations, including Unity Catalog and policy management, cluster/job orchestration, data contracts and quality enforcement, Delta/ETL pipelines, and replay processes
  • Strong product and distributed systems skills — you understand business needs and how to translate them into technical architecture
  • Experience building usable systems from messy data and ambiguous requirements
  • Excellent communication and client-facing skills; you’ve led conversations with technical and non-technical stakeholders alike
  • Proven experience owning projects from scoping through deployment in ambiguous, high-stakes environments
  • Be willing to be on-call for our customers when situations arise
  • Strong engineering background demonstrated by a Bachelor’s degree in Data Science, Computer Science and related fields OR equivalent professional experience
Skills
PythonHugging FaceLangChainOpenAIPineconeDockerFastAPIKubernetesGCPAWSKafkaUnity CatalogLLMs
Similar roles at this salary range
All ML Engineering jobs →
Notable

AI Platform Engineer

Design, build, and maintain LLM integrations powering AI features. Own end-to-end delivery from requirements through production monitoring with focus on scalability and reliability.

170k – 205kSan Mateo, CAML EngineeringHybrid5+ YOEGKEHelm
Hinge Health

Staff Machine Learning Scientist

Own ML systems for send-time optimization, propensity modeling, and nudge decisions at consumer scale. Set experimentation standards and mentor a small ML team.

205k – 307kSan Francisco, CAML EngineeringHybrid7+ YOESQLdbt
Docker

Staff ML Engineer

Founding Staff ML Engineer building production ML systems for governance, security, and agentic platform capabilities at Docker. Owns architecture, data pipelines, evaluation, and model lifecycle while mentoring the growing team.

205k – 330kPalo Alto, CA +1ML EngineeringRemote8+ YOELLMsRetrieval
Nuance Labs

Member of Technical Staff - Research Fellow

3-month research fellowship for early-career researchers working on frontier Multimodal LLMs, generative modeling, and real-time audiovisual AI. Own a research problem in pretraining, post-training, RL, evaluation, or multimodal modeling. Strong PyTorch and first-author tier-1 paper required.

200k – 250kSeattle, WAML EngineeringOn-sitePyTorchDeep Learning
Snowflake

Senior Software Engineer — LLM Post-Training Platform

Build and scale Snowflake's Cortex Training LLM post-training platform, handling distributed GPU scheduling, orchestration, and productionizing research for enterprise-scale model adaptation.

200k – 288kBellevue, WAML EngineeringOn-site5+ YOERayFSDP