Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

DevOps Engineer

LanceSoft, Inc.

Federal Territory of Kuala Lumpur, Malaysia

Role and Responsibilities

  • AIOps Strategy and Implementation: Develop and execute a comprehensive AIOps strategy to modernize IT operations, focusing on AI-driven incident management and predictive analytics. Collaborate with stakeholders to identify high impact use cases, prioritize initiatives, and define KPIs for success (e.g., reduced MTTR, increased uptime).
  • AIOps Platform Management: Evaluate, configure, and manage AIOps platforms (e.g., BigPanda) to aggregate, normalize, and correlate alerts from monitoring tools. Implement AI/ML models for anomaly detection, root cause analysis, and incident prioritization.
  • Automation Integration: Design and implement automation workflows using tools like Ansible and Rundeck, integrated with AIOps platforms via APIs and webhooks. Develop scripts and playbooks to automate incident remediation, provisioning, and configuration tasks for self-healing systems.
  • Infrastructure as Code (IaC) and Configuration Management: Apply IaC principles using tools like Terraform and Ansible to manage infrastructure resources programmatically. Utilize configuration management tools to ensure compliance and remediate configuration drift across servers and cloud instances.
  • Predictive Analytics and Monitoring: Leverage AIOps platforms to monitor IT environments, predict potential incidents, and optimize performance. Analyse metrics and trends to proactively address bottlenecks and scalability challenges.
  • CI/CD and DevOps Integration: Integrate AIOps and automation solutions with CI/CD pipelines to support automated testing, deployment, and infrastructure changes. Implement version control and audit trails for automation and AIOps artifacts.
  • Documentation and Knowledge Sharing: Document AIOps workflows, automation scripts, and best practices to enable knowledge transfer. Provide training to IT teams on AIOps tools, automation techniques, and incident management processes.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company