Application Deadline expected to close June 13, 2025
Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.
Meet the Team
As Director, Site Reliability you’ll be joining a high-performing, multi-functional leadership group that includes engineering, product, design, and operations leaders. The broader team is composed of experienced engineers who value innovation and accountability. You'll lead and collaborate with teams that are deeply invested in building scalable, user-friendly products and infrastructure.
Your Impact
We are seeking an experienced leader to encourage and guide a high-performing team dedicated to ensuring the reliability and scalability of cloud services, with a focus on a rapidly growing next-generation project. The ideal candidate will have hands-on SRE or systems/network administration experience, with familiarity in AWS. This role involves close collaboration across product engineering, service engineering, and SRE teams in a high-trust, well-coordinated environment.
Responsibilities
Lead the SRE team to ensure high availability, scalability, and performance of cloud services
Serve as an escalation point for critical incidents, coordinating resolution efforts with multi-functional team
Develop and maintain service level objectives, service level indicators and key performance indicators.
Coordinate on-call rotations and manage the complexities of "the pager"
Collaborate with engineering team to implement reliability features and supportability improvements
Foster a culture of continuous improvement with a strong focus on automation
Build and implement strategies for capacity planning, performance tuning, and cost management in a cloud environment, particularly AWS
Minimum Qualifications
5 years of experience in an SRE, system administration, or network administration role
Strong understanding of cloud computing, particularly AWS
Proficiency in scripting languages (Python, Shell, Go) and infrastructure-as-code tools (Terraform, Ansible)
Conceptual knowledge of containerization and orchestration technologies like Docker and Kubernetes
Outstanding troubleshooting skills across network, application, and hardware systems
Preferred Qualifications
Ability to successfully connect with team members who have strong and individualistic personalities
Excellent collaboration skills and the ability to bring out the best in a technically diverse team
Pay Range: 231600 USD - 330000 USD
Why Cisco
At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint. Simply put – we power the future.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.