What You’ll Do
- Oversee the daily operations of the Network Operations Center. We have a team of engineers in the US and India.
- Monitor performance and system health to ensure we reach our uptime and reliability metrics.
- Manage incidents and outages. We are the go-to team for when something goes wrong and everyone looks to you to help get things back on track.
- Lead and mentor a team of NOC engineers. You’ve been in this role before and enjoy helping uplift and uplevel your teammates.
- Develop, implement and continually improve NOC processes and procedures. We iterate on everything until it’s working for everyone.
- Collaborate and coordinate with other departments to triage technical issues. You get to interact with the rest of the company more than your typical engineer and enjoy developing those relationships.
- Ensure all incidents are logged, tracked, and resolved in a timely manner. Without strong data, we can’t accurately report problems and hotspots so that we can focus on what matters most. You’ll help lead the charge in ensuring we understand and document our RCAs and get the fixes in place to make sure they don’t happen again.
- Provide regular reports on system performance and incidents. You can take the numbers and the data and craft them into actionable plans for teams to execute on.
- Help build and maintain business metrics databases and dashboards to aid in reporting on health and performance on our platform and processes. You’re comfortable working with databases and using tooling and AI to provide insight into how we’re operating as a business.
- Participate in an on-call rotation as needed.
The Skills and Experience You’ll Bring
- 1-3+ years of experience in network operations, project management, and business analytics with at least 2 years in a leadership role.
- Strong knowledge of cloud infrastructure and networking with providers such as AWS, Google Cloud or Azure.
- Experience with platform observability tools such as Datadog, New Relic, etc.
- Experience with an Incident Management tool such as Rootly, Pagerduty, Opsgenie, io.
- Experience working with business analytics tools such as AWS Quicksight, Looker, Tableau.
- Excellent problem-solving and troubleshooting skills.
- Strong leadership and team management abilities.
- Excellent communication skills, both verbal and written. Must be able to communicate effectively to both technical and non-technical audiences.
- Ability to work under pressure and manage multiple priorities.
Bonus Skills
- Experience with software development or using a scripting language such as python to automate systems.
- Familiarity with ITIL or other service management frameworks.
Compensation
For Washington-based candidates, salary range is $69,000 - $96,000 + target bonus.