Livepeer is on a mission to build the world’s open video infrastructure. Founded in 2017, it is the world’s first open-source protocol for decentralized video streaming, built on Ethereum. The project has empowered developers to create scalable, cost-effective, and censorship-resistant video applications. The Livepeer network has transcoded billion of minutes, serving Web3 and Web2 platforms across gaming, entertainment, social media, and beyond. In 2024, Livepeer AI was introduced, unlocking Livepeer’s compute network for AI inference workflows. From real-time video transcription and object detection to scene recognition and AI-powered editing, Livepeer AI brings advanced machine learning directly into the decentralized video stack. These new tools not only reduce costs but also empower developers to build richer, smarter, and more engaging video experiences—whether for Web3 platforms, AI-powered dApps, or even traditional video use cases.
Your Role:
Livepeer AI is looking for an experienced, self-driven SRE Engineer – someone that loves to build tools automate everything and deliver the best production experiences for end users. They are passionate about keeping all our user-facing services and Livepeer production systems running smoothly. They specialise in systems (operating systems, storage subsystems, networking, GPU clusters, Docker), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.
We value reliability. We approach the infrastructure with craft and think a lot about form and function. You should feel equally at home talking to developers and designers. We are looking for someone who cares about the reliability of the infrastructure as much as we do. You will ensure the final product is high quality and works as intended.
Responsibilities:
Provide tech leadership in SRE execution and planning
Lead complex infra projects for both internal and external stakeholders
Orchestrate and run our infrastructure
Add to and tune our monitoring
Reduce or automate manual processes
Be on an on-call (PagerDuty) rotation to respond to incidents that impact Livepeer’s availability
Plan the growth of our infrastructure as we continue to scale
Vendor management
Manage the technical roadmap for the SRE team
Infrastructure cost monitoring and optimisations
Supporting engineers and improving development workflows
Talk directly to large customers
Co-ordinate with team members across timezones
Experience Required:
Build a technical competent SRE team through a clear set of OKRs
Build essential tooling to improve the infra ops
Have run global mission-critical infrastructure
Have managed systems that handle high request volumes
Know your way around Linux and the Unix Shell
Have used configuration management systems
Have used infrastructure automation tools
Have implemented CI / CD pipelines
Have experience with some of the following technologies: