XOPS is a fast-growing startup building the future of observability and automation for IT operations. Our platform unifies complex system data to deliver visibility, control, and intelligent workflows across the enterprise, empowering IT teams to manage the entire employee technology lifecycle with precision. As industries embrace AI to automate cars, rockets, and even farming, IT operations remain stuck in the past, reliant on spreadsheets and manual processes. We believe it is time for a change.
At XOPS, we are pioneering autonomous IT operations, freeing teams from tedious tasks and elevating them into strategic leadership roles. Our mission is to drive operational excellence, financial stewardship, and security across the enterprise, while transforming the employee experience. We are just getting started, and we are looking for exceptional teammates to help shape the future.
XperiencOps Inc. is seeking a talented and experienced Senior SRE/DevOps Engineer to join our team. In this role, you will be responsible for maintaining and building the infrastructure and tools necessary for our development and operations teams to deliver high-quality software solutions. You will collaborate closely with cross-functional teams to automate processes, improve scalability and reliability, and ensure the smooth operation of our systems, and provide on-call escalation support for issues that may arise.
Responsibilities:
Design, implement, and maintain CI/CD pipelines to automate the build, testing, and deployment of complex enterprise applications
Deploy, configure, and manage cloud infrastructure and services to support our enterprise applications and high-availability services
Identify and implement process improvements to increase efficiency, reliability, and scalability
Monitor and troubleshoot Production systems and respond to Incidents in a timely manner
Collaborate with development teams to understand their requirements and provide guidance on best practices during design reviews
Perform regular system and application performance tuning and optimization
Ensure proper security measures are in place to protect our systems and data, and meet our compliance needs
Document and communicate infrastructure design, configuration, and troubleshooting procedures to team members
Requirements
Bachelor's or Master's degree in Computer Science, Engineering, or a related field
10+ years of experience as a DevOps Engineer or Site Reliability Engineer with responsibility of Production environments (ideally with experience supporting Enterprise customers)
Experience with AWS as Cloud Platform is a must
Deep understanding of CI/CD pipelines and related tools such as GitHub Actions, GitLab CI/CD, CodePipeline, etc
Experience using Terraform to deploy and maintain infrastructure-as-code (experience using AFT is a plus)
Expertise in managing server-less architectures and AWS Serverless Services, experience with Serverless Framework preferred
Proficiency in scripting languages like Bash or Python
Experience with Observability tools such as New Relic (preferred), DataDog, or SumoLogic
Experience with Managed Relational Databases (Aurora MySQL is ideal)
Experience with containerization technologies such as Docker and orchestration tools like Kubernetes or ECS
Familiarity with version control systems like Git
Strong knowledge of networking, security, and monitoring tools and best practices
Strong problem-solving and troubleshooting skills
Excellent written and verbal communication and collaboration abilities
Experience working as part of a globally distributed team in an enterprise setting
Preferred: Certification as a Professional AWS Certified DevOps Engineer or Professional AWS Certified Solutions Architect
For this role, the estimated base salary range is between $170,000 - $208,000 USD. The actual base salary will vary based on various factors, including market and individual qualifications objectively assessed during the interview process. The listed range above is a guideline, and the base salary range for this role may be modified.
Benefits
Competitive Compensation: Salary, Equity, and 401K
Comprehensive Vision, Dental, and Healthcare plans
Discretionary Time off Policy (If you need time off, take time off!)
11 Company-paid Holidays
Hybrid Work Policy - 3 days in office/2 days remote
A chance to be part of a rapidly growing startup and make a real impact!