Senior Cloud Operations Engineer
Oversight
About Oversight
Oversight is the world’s leading provider of AI-based spend management and risk mitigation solutions for large enterprises. Based in Atlanta, GA, Oversight works with many of the world’s most innovative companies and government agencies to digitally transform their spend audit and financial control processes.
Oversight’s AI-powered platform works across our customers’ financial systems to continuously monitor and analyze all spend transactions for fraud, waste, and misuse. With a consolidated, consistent view of risk across their enterprise, customers can prevent financial loss and optimize spend while strengthening the controls that improve compliance. Learn More.
Position Overview: Job Purpose
The Cloud Operations Engineer is responsible for managing and maintaining cloud-based infrastructure and ensuring the seamless operation of cloud environments. The ideal candidate will have extensive experience in cloud infrastructure, automation, monitoring, and ensuring high availability of mission-critical systems. In this role, you will work closely with engineering, product, and operations teams to ensure seamless delivery and operation of our SaaS products, focusing on automation, scalability, and continuous improvement.
Qualifications
- Bachelor degree in computer science program preferred
- 6+ years of hands on experience in Design, deploy, and maintain scalable and secure cloud infrastructure platforms
- Proven experience with cloud platforms such as AWS, Azure, or GCP.
- Proficiency in CI/CD tools such as Jenkins, GitLab CI, or CodePipeline, with a focus on automation and reliability.
- Strong scripting skills (Python, Bash, Java) and experience with version control systems (BitBucket, CodeCommit).
- Proven expertise in cloud-native architectures, containerization (Docker, Kubernetes), and serverless computing.
- Strong experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Ansible.
- Experienced monitoring and logging tools like Prometheus, Cloudwatch, Grafana, ELK Stack, etc.
- Deep understanding of networking concepts (VPCs, load balancers, DNS, firewalls).
- Ability to manage multiple projects simultaneously and prioritize tasks in a fast-paced environment.
- Strong leadership skills with a proven ability to mentor and guide technical teams.
Preferred Qualifications
- Experience with multi-cloud SaaS environments
- Knowledge of serverless architectures and microservices.
- Relevant certifications such as AWS Certified Cloud Engineer, Cloud DevOps Engineer, Solutions Architect, or equivalent
- Experience with SaaS-specific compliance standards like SOC 2 or GDPR
Responsibilities
- Oversee the deployment, operation, and scaling of SaaS applications across cloud platforms
- Ensure high availability, reliability, and performance of SaaS applications by implementing best practices in monitoring, alerting, and incident management
- Manage cloud infrastructure as code (IaC) to support multi-tenant SaaS environments
- Design and maintain continuous integration/continuous deployment (CI/CD) pipelines to support rapid and reliable delivery of SaaS features and updates
- Automate deployment processes, reducing time to market and minimizing errors in production environments.
- Integrate security practices into the CI/CD pipeline to ensure compliance with industry standards and regulations
- Ensure SaaS applications meet security and compliance requirements, including data protection, access control, and regulatory compliance.
- Assist in capacity planning and forecasting for cloud resources.
- Collaborate closely with software development, IT operations, and security teams to ensure seamless integration of DevOps practices
- Develop and implement Cloud Operation strategies that align with the organization’s goals and enhance the efficiency and reliability of the development lifecycle
- Advocate for the adoption of new technologies and methodologies to enhance the efficiency and reliability of SaaS operations