Find Your Dream Job

Search through thousands of job postings to find your next opportunity

Date Posted

Job Type

Technology

Work Setting

Salary Range

$0k $100k $200k+

Experience Level

Site Reliability Engineer

Oracle

Romania

Job Description

We are looking for a Site Reliability Engineer (SRE) to join our team and help ensure the reliability, scalability, and performance of our systems. In this role, you will bridge the gap between development and operations by implementing best practices in automation, monitoring, incident response, and infrastructure management.

Key Responsibilities:

  • Design, implement, and maintain scalable, reliable, and high-performance infrastructure.
  • Develop and improve monitoring, alerting, and logging systems to ensure system health and performance.
  • Automate operational tasks, deployments, and infrastructure provisioning.
  • Collaborate with development and operations teams to improve system reliability and efficiency.
  • Identify and resolve production issues, ensuring minimal downtime and fast recovery.
  • Conduct root cause analysis and post-mortems for incidents, implementing preventive measures.
  • Optimize system performance, capacity planning, and cost efficiency.
  • Enhance security, compliance, and risk management practices for infrastructure and applications.


Qualifications & Skills:

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Experience with cloud platforms such as OCI, AWS, GCP, or Azure.
  • Proficiency in scripting and automation using Python, Bash, or similar languages
  • Hands-on experience with infrastructure-as-code tools like Terraform, Ansible, or CloudFormation.
  • Familiarity with containerization and orchestration (Docker, Kubernetes).
  • Strong knowledge of CI/CD pipelines and DevOps best practices.
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK, Datadog, etc.)
  • Understanding of networking, Linux system administration, and database management.
  • Strong problem-solving skills and a proactive approach to system reliability.


Career Level - IC4

Responsibilities

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

About Us

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing [email protected] or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

New SRE Jobs

Connecting top SRE talent with leading companies.

For SRE Professionals

For Employers

Company