Skip to content

Principal Engineer, Online Systems

Leads reliability, scalability, and modernization of Pinterest's critical online systems including storage, caching, and real-time analytics at massive scale. Drives strategic vision, cross-functional initiatives like Kubernetes migration, requiring 12+ years in distributed systems and expertise in C++, Java, or Python.

285k – 500kSan Francisco, CAPalo Alto, CADevOps / SREHybrid12+ YOE

About the role

What you’ll do:

  • Improve reliability, scalability and infra efficiency for Pinterest’s critical online systems across storage and caching, online service and realtime analytics systems to support continued business growth.
  • Lead cross-functional strategic initiatives to modernize Pinterest’s online serving stack through multi-region deployment, Kubernetes migration, and the adoption of cutting-edge technologies and industry best practices.
  • Quality champion for online systems and the broad engineering organization, holding a high bar on execution quality and operational excellence.
  • Drive the online systems vision and strategy for the next 3 years and beyond, and translate that strategy into cross-functional roadmaps with measurable outcomes.
  • Foster a diverse and inclusive engineering culture that makes all feel welcome, and invest in mentorship, critical thinking, and candid feedback to strengthen individual and organizational growth.

What we’re looking for:

  • 12+ years of software engineering experience with deep expertise in distributed systems, especially online serving, storage, and caching systems: hands-on experience building and operating highly available, reliable, production-grade systems at large scale; strong technical judgment in reliability, scalability, performance, and infrastructure efficiency; proficiency in at least one of C++, Java, or Python.
  • Proven track record driving large-scale technical impact: experience leading reliability and scalability improvements, cost efficiency initiatives, and modernization efforts across critical production systems; able to define technical strategy, influence architecture decisions, and drive execution through cross-functional collaboration, strong communication, and operational excellence.
  • Strong ownership, quality mindset, and thoughtful use of AI: demonstrates high standards for engineering quality, integrity, and accountability for final outcomes; able to use AI to accelerate analysis, debugging, design exploration, or operational workflows while applying critical thinking, validating correctness, and maintaining sound technical judgment rather than outsourcing ownership to tools.
  • Exceptional collaboration skills with cross-functional partners, with the ability to navigate ambiguity, make tradeoffs, and keep stakeholders aligned on priorities and progress.
  • Bachelor’s degree in Computer Science, a related technical field, or equivalent experience.

Skills

KubernetesC++JavaPythonDistributed SystemsStorage SystemsCaching SystemsMulti-Region Deployment

Similar roles

DevOps / SRE jobs

Principal Systems Engineer

Principal Systems Engineer sets technical direction for core infrastructure, owns architecture for reliability and performance at scale, and mentors senior engineers. Requires deep expertise in virtualization, distributed storage like Ceph, and Linux kernel primitives.

280k – 380kNew York, NYDevOps / SREOn-siteQemuCeph

Principal Engineer, Compute Fleet Management

Leads compute fleet management across AWS, Azure, and GCP, optimizing billions of resources for peak performance, 99.99% availability, and 60%+ utilization. Requires deep distributed systems expertise and cross-team leadership for mission-critical infrastructure.

264k – 322kBellevue, WADevOps / SREOn-siteAWSGCP

Principal Production Engineer

Owns reliability, scalability, and observability of cloud infrastructure including compute, storage, and networking at massive scale. Drives SLOs, incident response, tooling, and mentors engineers; requires 15+ years experience with data centers and internet-scale operations.

261k – 326kSan Francisco, CA +1DevOps / SREOn-site15+ YOEBGPOspf

Principal Systems Software Engineer

Leads architecture of next-generation AI infrastructure, unifying BMaaS, IaaS, and CaaS with focus on high-performance I/O paths, kernel optimizations, and GPU workloads. Requires 12+ years hyperscale experience, deep Linux/virtualization expertise, and hardware-software co-design skills.

260k – 340kSan Francisco, CA +1DevOps / SREOn-site12+ YOEKvmQemu

Senior Principal Software Engineer, Infrastructure

Technical visionary architecting Docker's foundational platform for accounts, billing, data, governance, and infrastructure. Drives cross-company strategy enabling enterprise growth, requiring 12+ years experience in large-scale distributed systems.

251k – 352kSeattle, WADevOps / SRERemote12+ YOEAWSGCP