Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

Director of Site Reliability Engineering

Walmart Global Tech

Bentonville, AR

Are you passionate about pioneering cutting-edge technology leveraging GenAI and big data to revolutionize Walmart’s customer service experiences? Do you dream of working on innovative systems that make a significant impact on hundreds of millions of customers across the globe? We are seeking a visionary and hands-on Director of Site Reliability Engineering (SRE) to lead and scale a world-class SRE organization. This leader will be responsible for building a high-performing team, driving operational and engineering excellence, and ensuring the availability, scalability, and performance of our systems.



About Team: Customer Care Technology


The Customer Care Technology team builds best-in-class customer service experiences for hundreds of millions of Walmart customers and customer service agents globally. We are a group of software engineers, data scientists, and machine learning experts pushing the boundaries of GenAI technology in complex enterprise applications. The Customer Care Technology team is part of the Enterprise Business Systems organization in Walmart Global Tech. We partner with our product and business teams to drive significant measurable business impact. Our mission is to help customers save money and live better.


What you'll do:


  • Build and Lead a High-Impact Team: Recruit, mentor, and retain top-tier Distinguished, Principal, and Staff-level SREs. Foster a culture of ownership, innovation, and continuous improvement.
  • Champion Operational Excellence: Establish and uphold best practices to ensure system reliability, availability, and performance. Drive incident response, root cause analysis, and postmortem processes that raise the operational bar.
  • Drive Engineering Excellence: Implement and scale CI/CD pipelines, enforce robust unit and integration test coverage, and promote engineering practices that accelerate delivery without compromising quality.
  • Develop Scalable Systems and Processes: Build tools, dashboards, and metrics to proactively monitor system health, detect anomalies, and automate remediation. Lead retrospectives to identify systemic improvements.
  • Foster a Strong Team Culture: Create an inclusive, collaborative, and high-trust environment. Provide coaching and career development opportunities to help engineers grow and thrive.
  • Communicate Vision and Strategy: Effectively communicate vision and strategy to cross-functional teams, from senior leadership to partner teams and engineers.



What you'll bring:


  • Bachelors, Masters, or PhD from a reputed institution.
  • 10+ years of software engineering and/or site reliability experience in a related industry.
  • 5+ years of experience managing and mentoring engineering teams.
  • Proven experience leading SRE or infrastructure teams at scale.
  • Deep understanding of distributed systems, cloud infrastructure, and DevOps practices.
  • Strong leadership, communication, and cross-functional collaboration skills.
  • Track record of building high-performing teams and delivering reliable, scalable systems.
  • Excellent verbal and written communication skills, adept at communicating with executive levels, peers, and subordinates.
  • Demonstrated history of customer obsession and an agile mindset.
  • Strong sense of ownership and urgency.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company