Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

Site Reliability Engineer

HCLTech

United States

HCLTech is looking for a highly talented and self- motivated Site Reliability Engineer (SRE) to join it in advancing the technological world through innovation and creativity.


Job Title: Site Reliability Engineer (SRE)

Job ID: (2540023)

Position Type: Full-time

Location: Frisco TX (Open to Remote USA)


Role/Responsibilities

  • SRE Devops
  • cloud-native solutions
  • OpenSearch (Elasticsearch)
  • CloudWatch
  • EC2
  • VPC
  • IAM
  • Lambda
  • Terraform

Qualifications & Experience

Job Description:

We are seeking a skilled and experienced Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in DevOps, cloud infrastructure, and observability solutions. You will be responsible for designing, implementing, and maintaining CI/CD pipelines, infrastructure as code, and monitoring systems to ensure the reliability and performance of applications.


Responsibilities:

  1. Define, measure, and automate various SRE metrics such as availability, latency, and system health.
  2. Implement Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) to ensure system reliability.
  3. Develop and maintain automated solutions for monitoring, alerting, and incident response.
  4. Design, implement, and maintain observability solutions on AWS using OpenSearch, Grafana, and Prometheus.
  5. Develop and manage infrastructure as code using Terraform to provision observability resources.
  6. Configure OpenSearch for centralized logging and build search indices for monitoring and alerting.
  7. Set up monitoring tools like Prometheus to collect and store metrics from various AWS services and applications.
  8. Integrate monitoring data with Grafana for visualization, dashboards, and alerting mechanisms.
  9. Develop automation scripts and CI/CD pipelines for deploying and updating the observability stack.
  10. Collaborate with DevOps, SRE, and development teams to ensure seamless integration of the observability stack with other systems.
  11. Continuously optimize the observability stack for performance, scalability, and cost efficiency.
  12. Provide technical documentation and knowledge transfer to relevant teams.
  13. Ensure security, compliance, and best practices in all observability solutions deployed in AWS.
  14. Deliver engineering solutions with hands-on DevOps experience within Agile SDLC/teams using CI/CD.
  15. Design and implement full-scale transformations to enable automation and continuous integration/deployment for customer applications.
  16. Work with containers using Docker and orchestration tools like Kubernetes; set up Kubernetes clusters.


Required Skills and Qualifications:

  1. 12+ years of experience in DevOps and cloud-native solutions
  2. Strong experience with AWS services or similar cloud providers: OpenSearch (Elasticsearch), CloudWatch, EC2, VPC, IAM, Lambda, etc.
  3. Proficiency in infrastructure as code (IaC) tools like Terraform, including managing state, modules, and resources.
  4. Hands-on experience with monitoring and observability tools: Grafana, Prometheus, and OpenSearch.
  5. Expertise in managing logs, metrics, and tracing with observability solutions.
  6. Proficiency in creating dashboards, visualizations, and alerts using Grafana.
  7. Familiarity with CI/CD pipelines.
  8. Good understanding of networking and security within AWS/GCP/Azure (VPC, subnets, security groups, etc.).
  9. Experience in scripting (e.g., Python, Bash) to automate tasks.


Desired Experience:

  1. Cloud certification
  2. Experience working in large-scale cloud environments with complex cloud architecture.
  3. Familiarity with AWS Lambda for custom log processing or metrics extraction.
  4. Experience with Helm charts for deploying Prometheus and Grafana in a containerized environment.
  5. Experience with cloud cost management and optimization strategies.


Pay and Benefits

Pay Range Minimum: $73,000K per year

Pay Range Maximum: $149,000K per year


HCLTech is an equal opportunity employer, committed to providing equal employment opportunities to all applicants and employees regardless of race, religion, sex, color, age, national origin, pregnancy, sexual orientation, physical disability or genetic information, military or veteran status, or any other protected classification, in accordance with federal, state, and/or local law. Should any applicant have concerns about discrimination in the hiring process, they should provide a detailed report of those concerns to [email protected] for investigation.


Compensation and Benefits

A candidate’s pay within the range will depend on their work location, skills, experience, education, and other factors permitted by law. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year


How You’ll Grow


At HCLTech, we offer continuous opportunities for you to find your spark and grow with us. We want you to be happy and satisfied with your role and to really learn what type of work sparks your brilliance the best. Throughout your time with us, we offer transparent communication with senior-level employees, learning and career development programs at every level, and opportunities to experiment in different roles or even pivot industries. We believe that you should be in control of your career with unlimited opportunities to find the role that fits you best.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company