Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

DevOps Support Engineer

Solv Kenya

Nairobi, Nairobi County, Kenya

DevOps – 80%

  • Infrastructure Automation: Develop, implement, and maintain infrastructure automation scripts using tools such as Terraform, Ansible, or Chef.
  • CI/CD Pipeline Management: Design, implement, and optimize continuous integration and continuous deployment (CI/CD) pipelines to ensure efficient and reliable code build and release using Bitbucket/ GitHub and Jenkins.
  • Cloud Infrastructure Management: Manage and scale cloud-based infrastructure (e.g., AWS) to meet the needs of Solv production environment, including provisioning, monitoring, and cost optimization.
  • Monitoring & Logging: Implement and maintain monitoring and logging systems to ensure system health, performance, and security. Use tools such as Prometheus, Grafana, ELK stack, CloudWatch or any other monitoring and logging tools adopted by Solv Kenya for the same purpose.
  • Collaboration: Work closely with developers, QA engineers, NOC engineer, project manager and system administrators to improve the development lifecycle, including setting up and maintaining version control systems, automating testing, and managing deployments.
  • System Performance Optimization: Identify bottlenecks in system performance and propose solutions for improvements, ensuring high availability and reliability of services.
  • 100% Security & Compliance: Ensure the security of the infrastructure by implementing best practices, including patching, vulnerability scanning, and access control management. Ensure compliance with security and regulatory standards in use at Solv Kenya as well as those cascaded by SCV.
  • Incident Management: Respond to incidents in production environments, troubleshoot and resolve issues, and contribute to root cause analysis and resolution.
  • Documentation: Maintain detailed, accurate and up-to-date documentation related to infrastructure, deployments, and procedures in Confluence.
  • Technical and Project Support: Support automations of on cloud infrastructure needed to deliver integrations with third parties as well as Identify technical problems and develop, test and deploy software updates and fixes.

Production Support – 20%

  • Work with the development team to automate manual support processes Provide strong incident management during outages to improve recovery time Analysis of incidents to identify underlying trends and focus areas.
  • Support the design and development of standard processes to eliminate pain points, issues and causes of current Production Management processes.
  • Facilitate industry-standard Root Cause Analysis (RCA) exercises because of, critical incidents and initiating the Problem Management cycle
  • Interact directly with IT leaders, managers and key stakeholders to communicate status on active major incidents proactively or problem tickets.
  • Provide ad-hoc reports to the business where this will add value or reduce costs.
  • Carries out detailed technical analysis of the production to identify performance, stability and resilience enhancements.
  • End- user Support on production systems with prompt solutions that allow end-users to provide the best customer experience
  • Perform any other duties assigned by CTO from time to time




Requirements

  • Bachelor’s Degree (B. Tech/ B.E/ BCA) degree
  • 2-3 Years of experience in NOC Support
  • Understanding on ITSM, JIRA
  • Good understanding of cloud technologies IT Infra, like- AWS, Azure
  • Strong knowledge of version control systems (e.g., Git).
  • Experience with CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI, TravisCI).
  • Proficient in scripting languages such as Python, Bash, or PowerShell.
  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes).
  • Experience with system monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack, Splunk).
  • Strong analytical skills and a desire to learn new concepts and technologies and apply them.
  • Strong attitude to take ownership and responsibility for the production servers/services. Linux: Good knowledge of Linux systems
  • AWS: Good knowledge of AWS services EC2, VPC, S3, ELB, IAM. Good to have any scripting knowledge such as Python, Bash, or PowerShell
  • Fundamentals: Basic Networking & Security, TCP/UDP, IP Routing, Application Protocols: SMTP, HTTP, HTTPS, SSH, FTP, SFTP
  • Experience in monitoring and system management of critical real-time applications Strong analytical skills
  • Understanding of commerce line of business
  • Strong communicator, able to describe complex technical issues to business users, both verbally and written.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company