Skip to content

Cloud Engineer

80k – 170kBoston, MAHybrid
Summary

Cloud Engineer builds monitoring solutions across cloud and on-prem environments using Datadog, develops configurations in YAML/JSON, writes scripts in Python/shell/PowerShell, and collaborates with clients. Requires systems administration experience, Linux/Unix knowledge, and bachelor's in CS or equivalent.

About the role

What you'll do

  • Develop quality configurations in YAML and JSON
  • Work closely with clients and team members from diverse backgrounds to deliver projects
  • Build cloud and on-prem monitoring solutions for Infrastructure, APM, and Logs
  • Build grok patterns for custom logs
  • Modify code to send traces and subtraces to Datadog
  • Analyze and maintain existing software applications
  • Write custom scripts in shell, Python, PowerShell
  • Design highly scalable, testable configuration tools

What you have

  • Bachelor's degree or equivalent experience in Computer Science or related field
  • Experience with systems administration
  • Linux/Unix experience with on-prem or cloud systems
  • Experience working on multiple projects simultaneously
  • Self-learner and eager to understand new technologies
  • Experience in a modern monitoring platform is a plus
  • Scripting/programming experience in shell, Python, Go, or PowerShell is a plus
  • Willingness to work from our Boston office three days a week (Tuesday, Wednesday & Thursday)

Compensation

Base Salary: $80,000-$170,000 annually, depending on experience

Benefits

  • 100% Employee Healthcare Coverage (Medical, Dental & Vision)
  • Retirement Plan (5% 401k Match, IRA)
  • Unlimited Paid Time Off (4-week minimum) (Vacation, Sick & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Equity
  • Hybrid Work Opportunities
  • Fitness & Commuter Subsidies available
  • SL & LT Disability
Skills
YAMLJSONDatadogLinuxUnixPythonPowerShellshellGoGrokSREInfrastructureAPMLogssystems administration
Similar roles at this salary range
All DevOps / SRE jobs →
Kraken

Site Reliability Engineer - AI Agents

Design, build, and operate reliable infrastructure for AI agent workflows and model serving on AWS and Kubernetes. Build platform APIs, SDKs, and self-service tooling while ensuring observability and incident response for production AI systems.

96k – 192kUnited StatesDevOps / SRERemote5+ YOEAWSBash
MongoDB

Cloud Operations Engineer

As a Cloud Operations Engineer, you will ensure the operational success of MongoDB Atlas customers by monitoring, detecting, and resolving incidents. This role involves coordinating with a global team, automating tasks, and contributing to documentation.

90k – 176kUnited StatesDevOps / SRERemote2+ YOEGoAWS
MongoDB

Software Engineer, Developer Productivity

Software Engineer on the Build Team improving developer tooling for MongoDB's database, focusing on build systems like Bazel, performance optimization, and support for multi-language stacks including C++, Rust, Python, and Java. Requires internship experience and interest in AI tools for development acceleration.

78k – 154kNew York, NYDevOps / SREHybridEntry levelC++Rust
Navan

Site Reliability Engineer - 2

Designs, implements, and operates cloud infrastructure, automates toil, and ensures system reliability using SRE practices for a high-growth travel platform. Requires 2+ years SRE experience, hands-on with AWS, Java, Terraform, and AI/ML operations.

86k – 192kPalo Alto, CADevOps / SREOn-site2+ YOEAWSLLM
DAT Freight & Analytics

NOC Engineer I

Leads NOC operations monitoring platform health, managing incidents, mentoring engineers, and creating performance dashboards/reports. Requires 1-3+ years in network ops/business analytics with 2+ years leadership, cloud/observability tool experience.

69k – 96kSeattle, WA +2DevOps / SREHybrid1+ YOEAWSITIL