Search through thousands of job postings to find your next opportunity
No technologies match your search.
SAP
Bengaluru, Karnataka, India
Posted 1mo
Indegene
Poland
Posted 1mo
Readiness IT LATAM - una empresa CONKORD
Providencia, Santiago Metropolitan Region, Chile
Posted 1mo
Xero
Seattle, WA
Posted 1mo
Xero
Denver, CO
Posted 1mo
Akamai Technologies
United States
Posted 1mo
Hirenza
United States
Posted 1mo
Sanderson Government & Defence
England, United Kingdom (Remote)
$65,000.00 - $75,000.00
Posted 1mo
Cloudbeds
Romania
Posted 1mo
Informatech Pty Ltd
Canberra, Australian Capital Territory, Australia
$160,000.00 - $200,000.00
Posted 1mo
Position - Senior Site Reliability Engineer
Location – Charlotte, NC onsite
Contract to hire (C2H)
6 months contract after that convert into fulltime.
Job Description:
We are seeking a Senior Site Reliability Engineer (SRE) with deep expertise in AWS networking, infrastructure automation, and production system reliability and ability to lead when neededd. This role demands a strong grasp of observability, operational excellence, and the ability to drive the adoption of DevOps/SRE best practices across engineering teams. You will be instrumental in shaping SLIs/SLOs, defining our DevOps maturity roadmap, and building robust, scalable infrastructure using Terraform, Lambda, Step Functions, and more.
You’ll be leading a team of SREs and collaborating closely with DevOps, Security, and Application teams to ensure reliable delivery and availability of services. Lead and mentor a team of SREs in developing scalable infrastructure and operational processes. Design and implement SLIs, SLOs, and Error Budgets across critical services and evangelize them across product teams. Architect and manage AWS networking environments including VPCs, Transit Gateways, Route 53, VPNs, NACLs, and Security Groups. Manage and monitor Palo Alto and FortiGate firewalls, and integrate them with cloud environments for hybrid network visibility. Define and evolve DevOps maturity models, guiding teams toward higher automation and reliability. Build and manage observability dashboards using Grafana, CloudWatch and Datadog to track application and infrastructure health. Implement and maintain Infrastructure as Code (IaC) using Terraform to automate cloud deployments across environments. Develop and maintain serverless applications using AWS Lambda and Step Functions to support platform automation and operations. Collaborate with developers to define GitLab CI/CD pipelines and streamline the build, test, and deployment lifecycle. Champion incident response, blameless postmortems, and continuous improvement initiatives. Write scripts in Python or Bash to automate tasks and integrate systems
REQUIREMENTS
Nice to have Skills: