Skip to content

Senior Software Engineer, Platform

126k – 189kSeattle, WADevOps / SREOnsite8+ YOE
Summary

Builds foundational platform architecture for AI research agents, including SDKs, APIs, execution frameworks, and benchmarking infrastructure to enable researchers to develop intelligent systems over scholarly literature. Requires strong Python skills, 8+ years experience, cloud infrastructure, and AI integration expertise.

About the role

Your Next Challenge

  • Design and extend APIs that expose structured scholarly data to academic researchers and AI agent workflows
  • Contribute to dashboards and tools for evaluating data quality and model precision
  • Scale our systems in a cost-effective and sustainable manner and be a driving force behind continuous improvements in reliability and automation
  • Improve team velocity through contributions to software design, feature development, platform and framework enhancements, capacity planning, and deployment tooling
  • Help set and maintain best practices in how we measure, monitor, and respond to system availability, latency, and general application health
  • Engage in the entire software development lifecycle, from ideation and design to implementation and testing to deployment and operational support, with a strong sense of ownership for results
  • Collaborate across engineering and research teams to ensure maintainability, test coverage, and robust deployment
  • Communicate effectively across engineering, product, and research teams to align priorities and drive projects to completion

What You’ll Need

  • Bachelor's degree and 8+ years of technical experience; relevant experience may substitute for education
  • Strong Python engineering skills, with experience building production services and developer-facing tools
  • Experience with cloud infrastructure (AWS, GCP, or similar), containerization, and deployment automation
  • Solid understanding of integrating AI-powered capabilities into applications, including agent workflows, memory management, and external tool integrations
  • Experience with analytics and observability tools like Datadog or similar for monitoring performance, system health, and reliability
  • Familiarity with designing APIs, SDKs, or frameworks that other engineers build on top of
  • Strong communication skills and a sense of ownership for results

Compensation: Base salary range $126,000 - $189,000, plus generous bonus plans.

Skills
PythonAPIsSDKsAWSGCPContainerizationDatadogAI agent workflowsObservability toolsDeployment automation
Similar roles at this salary range
All DevOps / SRE jobs →
Northwood Space

Senior Network Engineer

Design, deploy, and operate enterprise network infrastructure for corporate facilities and hybrid cloud environments with zero-trust architecture and compliance requirements. Requires 5+ years enterprise networking experience and ability to obtain TS/SCI clearance.

133k – 215kLos Angeles, CA +1DevOps / SREOn-site5+ YOEAWSVLAN
Pinterest

Site Reliability Engineer II

Operate and scale a cloud-native CTV advertising platform on AWS and Kubernetes. Focus on reliability, GitOps workflows, infrastructure automation, observability, and incident response.

114k – 235kSan Francisco, CADevOps / SRERemote4+ YOEAWSEKS
Forterra

Senior Software Engineer-Internal Tools

Senior Software Engineer on the DevOps and Tooling team building internal tools. Requires 3-5+ years experience, Rust or strong systems background, TypeScript/React, Linux, Docker, and CI/CD.

125k – 140kArlington, VA +1DevOps / SREOn-site5+ YOEAWSRust
Beacon AI

Software Engineer, Cloud Infrastructure

Build and operate AWS cloud infrastructure and LLM platform services including RAG pipelines, vector search, model endpoints, and data ingestion for an aviation AI company.

135k – 260kSan Carlos, CADevOps / SREHybrid4+ YOEAWSGlue
MongoDB

Site Reliability Engineer

Senior or Staff Site Reliability Engineer focused on continuous delivery infrastructure using Argo Workflows, ArgoCD, and Kubernetes. Owns deployment tooling, onboarding flows, and participates in 24/7 on-call. Requires 6+ years building and operating distributed systems.

127k – 249kBoston, MA +6DevOps / SREHybrid6+ YOEGoAWS