Databricks DevOps / SRE Jobs
Open devops / sre roles at Databricks, pulled live from their hiring system.
View devops / sre jobs across all companies
79% of open devops / sre roles call out Kubernetes; AWS and Azure appear in roughly a third. Most of these devops / sre roles are on-site or hybrid; 7% are fully remote.
Staff Software Engineer - AI Research Infrastructure
Build and operate the large-scale training and inference infrastructure that powers Databricks AI Research, enabling researchers to run experiments across thousands of GPUs. Partner with ML scientists and platform teams to deliver reliable, high-performance orchestration and tooling.
Staff Software Engineer - AI Research Infrastructure
Builds and operates research infrastructure for large-scale AI model training and inference across GPU fleets. Partners with scientists and engineers to create scheduling, orchestration, and dev tooling for efficient experimentation. Requires 5+ years in distributed systems and systems programming.
Senior Software Engineer, Compute Infrastructure
Designs and builds compute abstractions, workload orchestration, and fleet management systems to power Databricks' large-scale infrastructure across clouds. Requires 5+ years in distributed systems, proficiency in Java/Scala/Go/C++, and cloud/container experience.
Sr Staff Production Engineer- Public Sector
Senior Production Engineer owns secure cloud infrastructure, IAM, and automation across AWS, Azure, GCP for public sector and regulated environments. Requires 12+ years experience, cloud expertise, and TS/SCI clearance eligibility.
Staff Production Engineer- Public Sector
Owns secure cloud infrastructure, IAM, and automation across AWS, Azure, GCP for public sector environments. Requires 8+ years experience, deep cloud expertise, IaC tools like Terraform, and TS/SCI clearance eligibility.
Sr Software Engineer, Infrastructure
Senior Software Engineer builds and automates scalable AWS infrastructure, manages Kubernetes clusters, and implements observability frameworks. Requires 5+ years Python experience, IaC expertise, and strong cloud/DevOps skills.
Sr Production Engineer- Public Sector
Senior Production Engineer owns secure cloud infrastructure, IAM, networking, and automation across AWS, Azure, and GCP for public sector and regulated environments. Requires 5+ years experience, cloud expertise, security clearance eligibility, and strong operational skills.
Sr. IT Systems/Automation Engineer
Senior IT Systems/Automation Engineer owns and evolves IT automation platforms, leads mobile security and MDM programs, and delivers automation for onboarding, offboarding, and compliance workflows. Requires 5+ years experience with tools like Tines, Jamf, and Okta.
Sr. Manager - Production Engineering
Lead engineering team managing cloud IAM operations, CSP provisioning, compliance, and security data pipelines across AWS, Azure, GCP. Requires 8+ years in security/cloud engineering, 5+ years management, and BS in technical field.
Sr. Staff Software Engineer, Observability
Develops observability platforms handling billions of time series and petabytes of logs across global cloud regions. Requires 15+ years in systems languages, distributed systems, cloud tech, and mentoring engineers.
Principal Engineer, Compute Fleet Management
Leads compute fleet management across AWS, Azure, and GCP, optimizing billions of resources for peak performance, 99.99% availability, and 60%+ utilization. Requires deep distributed systems expertise and cross-team leadership for mission-critical infrastructure.
Senior Software Engineer - Infrastructure and Tools
Build and extend scalable infrastructure for Databricks' data and AI platform, including multi-cloud systems and Kubernetes at massive scale. Requires 5+ years experience in Java/Scala/Go/C++/Python, distributed systems, and cloud technologies.