Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

Lead DevOps/SRE

Yomly

Dubai, Dubai, United Arab Emirates

Why Yomly?

At Yomly, we’re redefining HR technology by empowering leaders, people teams, and employees to work smarter and more effectively. Our platform delivers HR and payroll solutions that provide deeper insights and enhance operational efficiency. We take pride in building reliable, secure, and high-performance systems that scale globally.

As we grow, our focus on platform reliability is sharpening. You’ll join at a formative moment — helping shape the systems, tools, and team culture that make reliability real at scale.


What you'll do?

As Lead Reliability Engineer, you’ll play a dual role: part hands-on builder, part team enabler. You’ve chosen the management path because you care about people and systems — and you’ll do both here. You’ll lead a small but growing team of platform reliability engineers and devops specialists, working closely with security, QA, and product engineering to raise the bar across performance, scalability, and incident resilience.


Team and Practice Leadership

  • Line-manage a small team of engineers with a focus on support, growth, and psychological safety;
  • Shape ways of working: how we plan, prioritize, deliver and reflect;
  • Create an environment where engineers can do their best work, balancing quality and velocity in a fast-evolving platform;
  • Be a confident voice for reliability principles — not just what to do, but how to improve how we do it.


Platform and Tooling

  • Guide technical direction for our AWS-native stack (EKS, Aurora, Terraform, etc.);
  • Champion reliability design patterns — from resilience strategies and failover models to alerting and rollback standards — and guide their adoption across teams;
  • Lead delivery of scalable, automated, and observable infrastructure;
  • Actively contribute code and reviews — especially in early, complex, or cross-cutting work;
  • Drive improvements in CI/CD, incident response, and developer experience.


Observability and Operational Readiness

  • Advance our monitoring maturity: improve dashboards, alerts, and instrumentation;
  • Use incidents and reviews as coaching opportunities to build operational muscle across the organisation;
  • Help define and socialize early SLOs — bringing them to life in service of customer experience.


Strategy Support

  • Partner with the engineering leadership team to translate big-picture vision into grounded team plans;
  • Help the team focus on high-leverage work, supporting thoughtful tradeoffs that enable meaningful impact even when decisions are complex or imperfect, while pacing change responsibly;
  • Work directly with external vendors (e.g., AWS, New Relic, PagerDuty) to understand costs and capabilities. Keep the team informed on spend patterns and raise concerns and opportunities early;
  • Be intentional about tech debt, toil, and where we’re investing engineering energy.


Mindset & Culture Fit

We’re building a culture of ownership, urgency, and grit. We value people who run toward problems, not away from them. You’ll thrive here if you take initiative, hustle to unblock what’s in your way, and push for impact—even when the path isn’t fully clear.


Qualifications:

CRITICAL:

  • You will have played a key role in Platform, DevSecOps, or SRE work within AWS-based web architecture — ideally in a B2B environment serving at least 10,000 daily active users.
  • You’ve already progressed to senior engineer or beyond, with proven experience supporting a team both from the perspective of an individual contributor and more recently as a manager.
  • You’ve seen both sides — enabling others while still delivering high-impact technical work yourself.


Additional Skills and Experience That Will Help You Succeed

  • Strong foundation in AWS infrastructure, including EKS and RDS/Aurora;
  • Experience with infrastructure-as-code (preferably Terraform);
  • Familiarity with observability tooling and practices (e.g., CloudWatch, Prometheus);
  • Hands-on experience improving CI/CD workflows and release automation;
  • Understanding of SLOs, error budgets, and incident management practices;
  • Comfort balancing pragmatic delivery with long-term technical soundness;
  • Enthusiasm for coaching and building a high-trust, collaborative engineering culture.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company