ClickHouse Operations Engineer

Automate, manage, and optimize large-scale ClickHouse clusters handling trillions of events and 100+ PB data. Build provisioning systems with Terraform, Ansible, Kubernetes; focus on performance, scaling, and bleeding-edge features.

United StatesDevOps / SRERemote

Apply

About the role

What you'll be doing

ClickHouse is the core piece of infrastructure at PostHog. Every product and customer relies on it to ingest, store, and query data.

We need someone to automate, manage, and maintain ClickHouse as we grow towards capturing trillions of events per year and having one of the world’s largest clusters.

This includes ClickHouse operations and scaling infrastructure, as well as node and instance-level performance optimization. Ensure the right hardware deployed at the right time for each workload on ClickHouse.

Build systems and automations for provisioning and scaling of large ClickHouse clusters, handling over 100 PB's of data. Investigate and experiment using the latest hardware that cloud providers have to offer. Use Terraform, Ansible, and Kubernetes to automate dynamic provisioning of instances and work on bleeding edge ClickHouse implementation, like open format backed tables, and query performance tooling.

You’ll fit right in if:

OLAP Database Experience. Focused on ClickHouse, but strong experience with other OLAP Databases is great. Experience with internals of ClickHouse and other OLAP Databases, not high level users.
Automating Dynamic Provisioning Instances. Strong experience with Terraform, Ansible and Kubernetes.
Experience with Scale and Complexity! Building and operating high-scale complex data storage solutions.
The Stack we need. Python, Terraform, Ansible, Kubernetes, AWS, and Zookeeper (or alternative).

Skills

ClickHouseTerraformAnsibleKubernetesPythonAWSZookeeperOlap Databases

Similar roles

DevOps / SRE jobs

Cursor

Software Engineer, Services Platform

Build platform primitives for service provisioning, deploy tooling, workflow orchestration, and service ownership at a fast-scaling AI coding tool company. Requires experience with durable workflows like Temporal, internal dev platforms, and strong focus on developer experience and reliability.

San Francisco, CA +1DevOps / SREOn-site5+ YOECI/CDTemporal

Beacon AI

Software Engineer, Cloud Infrastructure

Build and operate AWS cloud and LLM infrastructure powering RAG, inference, and data pipelines for an aviation AI platform. Requires strong AWS depth, Python data pipelines, and production LLM experience.

135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOEAWSVpc

Figma

Software Engineer, Traffic

Design, build, and operate scalable distributed systems and edge networks on AWS to handle Figma's growing customer traffic and services. Requires 4+ years building infrastructure at scale, experience with TypeScript or Go, and distributed/traffic systems.

153k – 376kSan Francisco, CA +1DevOps / SRERemote4+ YOEGoAWS

Clickhouse

Cloud Engineer - Product Metrics

Design, build, and operate petabyte-scale distributed systems for product metrics using Golang, Kubernetes, and ClickHouse. Requires 5+ years building scalable systems and 2+ years with Golang.

141k – 230kUnited StatesDevOps / SRERemote5+ YOEGoAWS

Supabase

Postgres Deployment Engineer

Own stability and deployment of PostgreSQL products. Package software with Nix, manage upgrades, optimize CI/CD, and resolve production issues. Requires 3+ years PostgreSQL experience and Nix proficiency.

United StatesDevOps / SRERemote3+ YOECGo