Senior Software Engineer, Observability
Senior engineer on the Auth0 Platform Observability team responsible for designing, building, and maintaining scalable observability infrastructure (metrics, logs, traces) using Datadog, Terraform, and OpenTelemetry.
Responsibilities
- Champion observability best practices, acting as an educator who can effectively correct anti-patterns and teach other engineering teams how to build robust, standardized instrumentation.
- Be an expert in running services in production environments.
- Contribute to the process of designing services for high growth and high availability.
- Provision, configure, and monitor cloud-native infrastructure and services.
- Design, build, and maintain scalable observability infrastructure using tools like Terraform.
- Troubleshoot performance issues and operational issues.
- Automate operational tasks and improve scripts.
- Assist with and provide feedback for performance testing and automation.
- Actively participate in major incident response to diagnose root causes and identify critical gaps in current telemetry tooling.
- Act as a technical leader, driving cross-team initiatives to improve instrumentation and observability standards across the broader engineering organization.
Requirements
- 5+ years of platform engineering, SRE, or DevOps experience.
- Experience with cloud infrastructure like AWS, Google Cloud, or Azure.
- Expertise in the Datadog ecosystem (Metrics, Logs, Traces, and Error Tracking), including establishing alerting standards, implementing tagging taxonomies, and managing Datadog configurations via Terraform.
- Strong coding skills in Node.js or Golang.
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
- A data-driven approach to debugging complex, cross-service performance bottlenecks.
- Deep understanding of microservice architecture and best practices.
- Experience in coaching and mentoring more junior engineers.
- Proven ability to lead cross-functional technical initiatives and collaborate seamlessly with multiple engineering teams.
- Hands-on experience with OpenTelemetry (OTel), Vector, or similar frameworks for instrumenting applications.
Senior Manager, DevOps
Lead DevOps strategy and team to improve engineering velocity, platform reliability, and operational efficiency across multi-cloud (AWS/GCP) environments. Drive IaC, Kubernetes delivery, observability, AI-powered tooling adoption, and cross-functional collaboration.
Software Engineer, Dev Velocity
Build internal developer platform, tooling, and automation to accelerate engineering velocity. Focus on CI/CD pipelines, test infrastructure, build systems, and metrics to help engineers ship faster and more reliably.
Software Engineer, Cloud Infrastructure
Build and operate AWS cloud and LLM infrastructure powering retrieval-augmented generation, vector search, and ML pipelines for aviation AI systems. Requires strong AWS depth, Python data pipelines, and production LLM experience.
Senior DevOps Engineer
Senior DevOps Engineer managing CI/CD automation, infrastructure as code, and cloud-native deployments on Azure/AWS with Kubernetes, Terraform, and observability tooling. Requires 5+ years DevOps experience and a CS bachelor's or equivalent.