Cloud Operations Engineer
Redis
Who we are
We're Redis. We built the product that runs the fast apps our world runs on. (If you checked the weather, used your credit card, or looked at your flight status online today, you’re welcome.) At Redis, you’ll work with the fastest, simplest technology in the business—whether you’re building it, telling its story, or selling it to our 10,000+ worldwide customers. We’re creating a faster world with simpler experiences. You in?
Cloud Operations Engineer
Why would you love this job?
Join Redis’s Cloud Operations team and help drive the reliability, scalability, and performance of our global cloud platform. This is a highly technical, hands-on role focused on operating and evolving complex production systems at scale. You'll also collaborate with talented teams across engineering, customer support, and technical account management to deliver robust, world-class solutions.
Our culture is built on ownership, innovation, and continual improvement.
We thrive on solving technical problems in high-impact, collaborative environments. We value continuous optimization, automation, and seeing solutions through from idea to implementation. If this sounds like you, you’ll feel right at home here.
What you’ll do
- Monitor, maintain, and optimize large-scale, multi-cloud production environments for reliability and performance.
- Collaborate with multiple teams to support seamless releases and operational enhancements.
- Participate in troubleshooting and analysis of complex technical issues, implementing lasting solutions.
- Automate operational workflows and incident response to reduce manual intervention and accelerate recovery.
- Contribute to process improvement and participate in operational retrospectives.
- Share on-call responsibilities as part of a globally distributed, 24/7 team.
What will you need to have?
- 5+ years of experience in SRE, DevOps, Cloud Engineering, or System/Network Administration for production systems.
- Practical, hands-on expertise with at least one major cloud provider (AWS, GCP, Azure).
- Demonstrated experience using a range of monitoring, alerting, and incident management tools.
- Broad expertise in Linux and scripting (e.g., Python or Bash), with growing interest in leveraging AI for system optimization and automation.
- A strong sense of ownership for technical outcomes, a drive for reliability, and commitment to continual improvement.
- Flexibility for operating in a fast-paced, global, and on-call environment.
About You
You are a technical expert who values precision, reliability, and efficiency. You enjoy collaborating with high-caliber teams and are motivated by taking ownership, learning from challenges, and making a sustained, positive impact on large-scale, production environments. This isn’t a customer-facing role but everything you do helps ensure our customers get a fast, stable, and seamless experience.
#LI-BL1 #LI-Hybrid