Skip to content

Staff Data Scientist | ML

United StatesData ScienceRemote
Summary

Staff Data Scientist on the Pay team building and productionizing ML/LLM models to detect erroneous healthcare claims payments. Owns end-to-end data science work, leads high-impact initiatives, and sets technical direction.

About the role

What You'll Do

  • Solve real business problems with data: Own end-to-end data science work, from problem framing and data exploration to modeling, validation, and production impact.
  • Lead high-impact initiatives: Drive complex efforts such as vendor leakage detection and prevention, delivering measurable revenue or cost improvements.
  • Innovate on methods and approaches: Go beyond standard analyses by developing new metrics, models, or workflows when existing approaches fall short.
  • Build and productionize models: Design, build, and support production ML or LLM-powered solutions in collaboration with engineering and product partners.
  • Work deeply with data: Use SQL fluently to explore large datasets, build reliable data assets, and validate results.
  • Influence cross-functionally: Partner with Product, Engineering, Finance, and Operations to align on goals, tradeoffs, and execution plans.
  • Set technical direction (especially at L7): Raise the bar for data science quality, scalability, and impact across the organization through mentorship, design reviews, and technical leadership.

What You'll Bring

  • Proven track record of hands-on data science impact on real-world problems, not just research or dashboards.
  • Strong proficiency in SQL, with experience working directly on complex, large-scale datasets.
  • Experience building, shipping, or supporting production ML systems, preferably some exposure to LLM-based products or workflows.
  • Ability to independently scope ambiguous problems, identify the right data and methods, and drive work to completion.
  • Strong communication skills, with the ability to explain complex analyses and models to non-technical stakeholders.
Skills
SQLMachine LearningLLMPythonData ExplorationData ModelingProduction ML SystemsData ValidationLarge-Scale DatasetsCross-functional Collaboration