Staff Platform Engineer

The Robert James Group

United States
Permanent
Remote
$150,000 - $500,000/year
AWS & Infrastructure as CodeTypeScript & Backend ToolingSite Reliability Engineering (SRE)

Role Overview

We are seeking a proactive Staff Platform Engineer to join our engineering team. In this pivotal role, you will design and build self-serve platforms, internal libraries, and automation systems that empower our product engineers to ship features with speed and reliability. You will be responsible for treating infrastructure as a product, ensuring our AI-native mortgage platform scales seamlessly while maintaining peak performance and observability.

Key Responsibilities

  • Infrastructure as Product: Own and evolve our AWS infrastructure using Terraform and Pulumi, focusing on developer experience and self-service capabilities.
  • Internal Tooling: Design and maintain shared TypeScript libraries, SDKs, and CLIs to standardise development workflows across the organisation.
  • Deployment Excellence: Build and manage CI/CD pipelines using Buildkite and implement per-pull-request ephemeral environments to ensure safe, rapid deployments.
  • Reliability & Performance: Drive platform stability through the implementation of SLOs, error budgets, and sophisticated observability stacks using Datadog and distributed tracing.
  • Scaling Operations: Solve complex scaling challenges involving Aurora PostgreSQL database tuning and compute autoscaling for AI-driven workloads.

Required Skills and Qualifications

  • 7–25 years of experience in platform engineering with a heavy focus on internal developer experience (DevEx).
  • Strong background in high-growth technology startups (e.g., Ramp) or a blend of Big Tech and startup experience.
  • Deep proficiency in TypeScript/Node.js for building backend internal libraries and APIs.
  • Expert-level knowledge of AWS (ECS, Lambda, RDS, MSK, IAM) and Infrastructure as Code (Terraform or Pulumi).
  • Proven track record of managing production incident response and reliability engineering.

Nice-to-Have Qualifications

  • Experience with AI-native platform architecture and specific scaling needs.
  • Advanced knowledge of networking security and zero-trust architectures.
  • Contributions to open-source developer tooling or infrastructure projects.