Catio

Senior SRE / DevOps Engineer

Catio
Engineering|Remote

Senior SRE / DevOps Engineer

Posted Oct 24
RemoteFull-timeSenior (5-8 years)40h/wkEquity

About the Role

Architect the Future of Tech Stack Intelligence

Catio is building the world’s first AI-powered Copilot for Tech Architecture—equipping CTOs, architects, and developers with a data-driven platform to evaluate, plan, and evolve their tech stacks with unparalleled intelligence. We're backed by top-tier VCs and led by industry experts, and we're looking for builders who want to define a new category.

We are seeking a seasoned Senior SRE / DevOps Engineer to join our foundational team. You will be instrumental in architecting the operational backbone of our platform, establishing the core of our AWS-based cloud operations and infrastructure-as-code strategy.

Your Mission

At Catio, SRE is not about tickets and manual operations; it's about engineering excellence. You will be a software engineer who builds the platforms, tools, and automation that enable reliability and velocity at scale. Your focus will be on engineering resilience into our systems and creating self-service infrastructure that empowers our entire engineering team to innovate quickly and securely.

This is a unique opportunity to shape the infrastructure of a deeply technical product built for engineering leaders. You will:

  • Design, implement, and administer a secure, scalable, and cost-effective AWS infrastructure that serves as the bedrock of our AI platform.
  • Develop and champion our infrastructure-as-code strategy using tools like Terraform and Helm to manage and evolve our cloud environments with precision and repeatability.
  • Define and deploy sophisticated observability pipelines and dashboards across metrics, logs, and traces, with a preference for Splunk, to provide deep insights into system health and performance.
  • Author clear, concise internal documentation and architectural decision records (ADRs) that become the blueprint for our infrastructure.
  • Collaborate closely with product and engineering teams to ensure our infrastructure capabilities are a catalyst, not a constraint, for product innovation.
  • Operate with a high degree of autonomy, proposing and executing on scalable, secure, and production-ready solutions that anticipate future needs.

Must Haves

Essential requirements for this position

  • 4+ years of experience managing Kubernetes clusters
  • 4+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with a strong AWS and Kubernetes focus.
  • Advanced expertise in cloud architecture design and administration of core AWS services (VPC, IAM, ECS/EKS, RDS, CloudWatch, etc.).
  • Strong understanding of monitoring, logging and observability frameworks, preferably Splunk.
  • Proficient in infrastructure-as-code frameworks such as Terraform, Pulumi, or AWS CDK.
  • Proven track record of owning production infrastructure and driving operational excellence at high-growth startups or SaaS companies.
  • Experience setting up and managing CI/CD pipelines and security best practices in cloud environments.
  • Excellent communication skills with the ability to distill complex infrastructure topics into clear written reports and dashboards.
  • Self-starter mindset and thrives in fast-paced, early-stage environments.

Nice to Haves

Preferred qualifications and extras

  • Exposure to compliance frameworks such as SOC 2.
  • Experience with Steampipe.
  • Interest in architectural visualization, cloud governance, or cost optimization tooling.

Relevant Skills

AWSKubernetesInfrastructure as Code (IaC)TerraformHelmSplunkObservabilityCI/CDCloud SecurityPrometheusGrafanaCloudWatch

Equal Opportunity

Catio is an equal opportunity employer committed to diversity and inclusivity. We never discriminate based on race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. If you require reasonable accommodations due to religious beliefs, pregnancy, or disabilities, let us know at any time.