What you'll be doing
ClickHouse is the core piece of infrastructure at PostHog. Every product and customer relies on it to ingest, store, and query data.
We need someone to automate, manage, and maintain ClickHouse as we grow towards capturing trillions of events per year and having one of the world’s largest clusters.
This includes ClickHouse operations and scaling infrastructure, as well as node and instance-level performance optimization. Ensure the right hardware deployed at the right time for each workload on ClickHouse.
Build systems and automations for provisioning and scaling of large ClickHouse clusters, handling over 100 PB's of data. Investigate and experiment using the latest hardware that cloud providers have to offer. Use Terraform, Ansible, and Kubernetes to automate dynamic provisioning of instances and work on bleeding edge ClickHouse implementation, like open format backed tables, and query performance tooling.
You’ll fit right in if:
- OLAP Database Experience. Focused on ClickHouse, but strong experience with other OLAP Databases is great. Experience with internals of ClickHouse and other OLAP Databases, not high level users.
- Automating Dynamic Provisioning Instances. Strong experience with Terraform, Ansible and Kubernetes.
- Experience with Scale and Complexity! Building and operating high-scale complex data storage solutions.
- The Stack we need. Python, Terraform, Ansible, Kubernetes, AWS, and Zookeeper (or alternative).