Cloud Engineer
Cloud Engineer builds monitoring solutions across cloud and on-prem environments using Datadog, develops configurations in YAML/JSON, writes scripts in Python/shell/PowerShell, and collaborates with clients. Requires systems administration experience, Linux/Unix knowledge, and bachelor's in CS or equivalent.
What you'll do
- Develop quality configurations in YAML and JSON
- Work closely with clients and team members from diverse backgrounds to deliver projects
- Build cloud and on-prem monitoring solutions for Infrastructure, APM, and Logs
- Build grok patterns for custom logs
- Modify code to send traces and subtraces to Datadog
- Analyze and maintain existing software applications
- Write custom scripts in shell, Python, PowerShell
- Design highly scalable, testable configuration tools
What you have
- Bachelor's degree or equivalent experience in Computer Science or related field
- Experience with systems administration
- Linux/Unix experience with on-prem or cloud systems
- Experience working on multiple projects simultaneously
- Self-learner and eager to understand new technologies
- Experience in a modern monitoring platform is a plus
- Scripting/programming experience in shell, Python, Go, or PowerShell is a plus
- Willingness to work from our Boston office three days a week (Tuesday, Wednesday & Thursday)
Compensation
Base Salary: $80,000-$170,000 annually, depending on experience
Benefits
- 100% Employee Healthcare Coverage (Medical, Dental & Vision)
- Retirement Plan (5% 401k Match, IRA)
- Unlimited Paid Time Off (4-week minimum) (Vacation, Sick & Public Holidays)
- Family Leave (Maternity, Paternity)
- Equity
- Hybrid Work Opportunities
- Fitness & Commuter Subsidies available
- SL & LT Disability
Site Reliability Engineer - AI Agents
Design, build, and operate reliable infrastructure for AI agent workflows and model serving on AWS and Kubernetes. Build platform APIs, SDKs, and self-service tooling while ensuring observability and incident response for production AI systems.
Cloud Operations Engineer
As a Cloud Operations Engineer, you will ensure the operational success of MongoDB Atlas customers by monitoring, detecting, and resolving incidents. This role involves coordinating with a global team, automating tasks, and contributing to documentation.
Software Engineer, Developer Productivity
Software Engineer on the Build Team improving developer tooling for MongoDB's database, focusing on build systems like Bazel, performance optimization, and support for multi-language stacks including C++, Rust, Python, and Java. Requires internship experience and interest in AI tools for development acceleration.
Site Reliability Engineer - 2
Designs, implements, and operates cloud infrastructure, automates toil, and ensures system reliability using SRE practices for a high-growth travel platform. Requires 2+ years SRE experience, hands-on with AWS, Java, Terraform, and AI/ML operations.
NOC Engineer I
Leads NOC operations monitoring platform health, managing incidents, mentoring engineers, and creating performance dashboards/reports. Requires 1-3+ years in network ops/business analytics with 2+ years leadership, cloud/observability tool experience.