Skip to content

Software Engineer, Data

240k – 280kPalo Alto, CAOnsite1+ YOE
Summary

Build scalable data platforms for AI model training pipelines, including acquisition, preparation, quality evaluation, and delivery. Collaborate with ML and data engineers; requires 1+ years software experience and strong coding in Python/Rust/etc.

About the role

Responsibilities

  • Develop a highly reliable and scalable enterprise data platform to orchestrate data acquisition, preparation, training, quality evaluation, and delivery for model training
  • Create new features such as data lineage, visibility, and monitoring for end-to-end training that improve the quality of the data and model performance
  • Collaborate with peers on architecture, design, and code reviews
  • Build prototypes to prove out key design concepts and quantify technical constraints
  • Own all aspects of software engineering and product development
  • Deep dive into business problems, find efficient solutions and apply first principles thinking

Basic Qualifications

  • Bachelor's degree in computer science, data science, engineering, math, physics, or scientific discipline; OR 2+ years of professional experience building software in lieu of a degree
  • 1+ years of experience in application development, software engineering, data engineering, or data science

Preferred Skills and Experience

  • Programming experience in Python, Rust, Java, C#, Scala, Go or similar languages
  • Frontend experience in Angular, React, or similar JavaScript frameworks
  • Hands-on experience with Kubernetes and containerized deployments
  • Experience with Ray, AI training and orchestration
  • Experience with relational and non-relational databases, data lakes e.g. PostgreSQL, Iceberg, Clickhouse, or similar
  • Experience with data exploration tools like Grafana, Superset, or similar
  • Good understanding of version control, testing, continuous integration, build, deployment and monitoring
  • Good understanding of statistics, machine learning algorithms and frameworks

Compensation and Benefits

  • Annual Salary Range: $240,000 - $280,000 USD
  • Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks
Skills
PythonRustJavaKubernetesRayPostgreSQLIcebergClickhouseGrafanaSuperset
Similar roles at this salary range
All Data Engineering jobs →
Honor

Staff Data Platform Engineer

Staff Data Platform Engineer building and leading AWS-native data platform architecture, orchestration, governance, and AI-readiness for analytics and ML workloads. Requires 8-10+ years experience with AWS data systems and strong technical leadership.

194k – 220kUnited StatesData EngineeringRemotedbtPython
Justworks

Manager, Data Engineering

Lead and mentor a team of data engineers building scalable data pipelines and platform infrastructure. Hands-on coding, operational excellence, and cross-functional collaboration with analytics, data science, and business teams.

205k – 262kNew York, NYData EngineeringHybridSQLAWS
Nuance Labs

Member of Technical Staff — ML Data Infra

Build and operate large-scale multimodal data pipelines for AI avatar model training. Design production-grade systems for petabyte-scale video, audio, and text data.

200k – 300kSeattle, WAData EngineeringOn-siteRayDVC
Jump

Data Platform Lead

Own end-to-end data platform strategy and lead the data engineering team. Build scalable multi-tenant infrastructure, AI-on-data capabilities, and productized integrations for sports analytics clients.

210k – 210kLos Angeles, CAData EngineeringRemotedbtAWS
CodeRabbit

Staff Analytics Engineer

CodeRabbit is seeking a Staff Analytics Engineer to build and own their BigQuery and dbt data foundation. This role involves architecting the data warehouse, defining key metrics, building revenue models, and developing GTM intelligence layers.

240k – 250kSan Francisco, CA +1Data EngineeringHybriddbtGCP