Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

Site Reliability Engineer (SRE) - grok.com & API

xAI

London, OH

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence with a flat organizational structure. The Site Reliability Engineer will work on backend services powering grok.com and the API, focusing on highly scalable and reliable services processing tens of thousands of queries per second hosted on Kubernetes clusters (on-prem & cloud). The ideal candidate has expert knowledge of Kubernetes, continuous deployment systems (Buildkite, ArgoCD), monitoring technologies (Prometheus, Grafana, PagerDuty), and infrastructure as code technologies (Pulumi or Terraform). The role is based in London or Palo Alto, with usual office work 5 days a week but allowing work-from-home days when required. London candidates must attend late meetings weekly. The interview process includes a CV review, a 15-minute phone interview, and two technical interviews conducted via Google Meet. Benefits include competitive cash compensation, xAI equity, and private health and dental insurance.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company