Search through thousands of job postings to find your next opportunity
No technologies match your search.
SAP
Bengaluru, Karnataka, India
Posted 1d
Indegene
Poland
Posted 1d
Readiness IT LATAM - una empresa CONKORD
Providencia, Santiago Metropolitan Region, Chile
Posted 1d
Xero
Seattle, WA
Posted 1d
Xero
Denver, CO
Posted 1d
Akamai Technologies
United States
Posted 1d
Hirenza
United States
Posted 1d
Sanderson Government & Defence
England, United Kingdom (Remote)
$65,000.00 - $75,000.00
Posted 1d
Cloudbeds
Romania
Posted 1d
Informatech Pty Ltd
Canberra, Australian Capital Territory, Australia
$160,000.00 - $200,000.00
Posted 1d
1. Ensure the stability, reliability, and efficient operation of the Xiaomi's global business, maintaining high availability of services at all times.
2. Responsible for core operational tasks such as resource provisioning and management, incident response, capacity management, monitoring, and reliability improvements.
3. Review technical architecture design, assess soundness of the design, and proactively identify and resolve reliability risks.
4. Conduct in-depth analysis of systemic deficiencies, identify bottlenecks and develop optimization strategies; plan and execute projects to improve system reliability and ensure cost-effectiveness and highly availability of the systems.
5. Participate in 24/7 on-call rotation, promptly respond to and resolve production incidents to ensure service availability.
6. Analyze and improve processes to build stable, highly available systems; drive continuous automation improvements, and minimize manual intervention.
1. Proficiency in one of the following programming languages: Python, Go, or shell scripting, with demonstrated ability to independently develop modules or platforms.
2. Familiar with cloud computing; experience in managing multi-cloud or hybrid cloud platforms (e.g., Alibaba Cloud, Azure, AWS) is preferred.
3. Strong foundation in computer science, with hands-on experience in Linux, networking, load balancing, and designing high-availability and disaster recovery architectures.
4. A good team player with a strong sense of responsibility, self-driven and highly motivated.
5. Minimum 3 years of working experience in operations and maintenance of large-scale web services is preferred; hands-on experience in managing or operating large-scale web services or projects is a plus.
6. Fluent in Mandarin (spoken) is a plus.