Senior Site Reliability Engineer
CCC Intelligent Solutions
This job is no longer accepting applications
See open jobs at CCC Intelligent Solutions.See open jobs similar to "Senior Site Reliability Engineer" Technology Crossover Ventures.Salary range is:
$98,240.00 - $150,000.00This position is bonus/commission eligible.
CCC Intelligent Solutions is a leading technology company helping to improve the insurance claims process for millions of people. Our award-winning SaaS platform connects more than 35,000 businesses, including insurance carriers, repair facilities, automakers, part suppliers, lenders, and others to streamline the process from start to finish.
Our advanced capabilities in AI, IoT, telematics, data, and analytics drive continual innovation across our platform, as we work to advance the multi-trillion-dollar P&C insurance economy’s digital transformation.
At CCC, our mission is to keep people’s lives moving forward when it matters most. Diversity of experience and perspective is key to our pursuit so we can deliver a future of possibilities for our customers.
The Role
The Site Reliability Engineer (SRE) will be closely working with Product Development team and be responsible for the overall reliability and availability of those applications. This person must have a passion for troubleshooting and getting to the root cause of any issue that is identified, resolving that issue, and owning the lifecycle of that feedback within the application teams.
Key Responsibilities:
- Help build an SRE culture by sharing best practices, approaches, documentation, and code with other engineering teams across the organization.
- Document tribal knowledge as you acquire it over time by creating runbooks/playbooks and ensuring critical system information is readily available to those who need it through dashboards.
- Configuring and maintaining the monitoring tooling as it relates to the target application.
- Monitor application/infrastructure and take steps to improve overall system software performance, availability, and reliability by incorporating changes through defined feedback loops within the software delivery lifecycle.
- Apply automation to any tasks/parts of the system that are performed manually.
- Work closely with software developers and testers to ensure the product is responding correctly to non-functional requirements such as security, performance, and availability.
- Resolve ICC escalations and help prevent reiteration of incidents by creating processes and automation.
- Be key part of our response to high-severity incidents, ensuring we meet all SLAs and SLOs
- Assist product development team with managing their error budget.
- Embrace failures and treat incidents as learning opportunities through conducting blameless postmortems reports.
- Participate in product engineering stand-ups and related design activities.
- Coach other team members to ensure systems are supported by following SRE best practices.
Requirements:
- Past enterprise level experience in DevOps, Software, Infrastructure or Site Reliability Engineering with the ability to demonstrate understanding of high-level technical briefs, talks and ideas.
- Experience leading teams in troubleshooting, issue resolution, or escalations.
- Ability to document solutions, SRE architectural patterns, and best practices to ensure that teams have guidance as needed.
- Proven ability to dig through metrics, logs, and available sources to triage and resolve an incident at any time.
- Experience of the full software delivery lifecycle
- Solid understanding of Microservices and APIs
- Experience and interest in working in an Agile environment.
- Versed in system management, monitoring and analysis to identify opportunities to improve service health, manageability and reliability.
- Experience writing and modifying SQL queries and generating reports.
- Eager to problem solve and troubleshoot issues that may arise day-to-day.
- Effective communication and interpersonal skills
Tech Stack
- Java, J2EE, RESTful services, JMS, Kafka, SQL, SOAP, Apache ACTIVEMQ
- JavaScript, vue.js, jQuery, JSP, Struts
- Oracle, MySQL, Postgres
- Jboss, Kubernetes, EKS, MSK, Lambda, CloudWatch, Signals, AppDynamics.
- Jenkins, Spinnaker, CI/CD Pipeline, Git/Github
- Python, Anisble.
Nice to Have
- Experience functioning as an SRE in maintaining reliability of the applications and infrastructure.
- Proficient in infrastructure as code practices.
- Experience building CI/CD pipelines from scratch.
- Able to troubleshoot complicated, cross-platform issues by handling OS, Networking, Database, and applications in cloud-based and on-premises environments.
About the company’s commitment to its employees
CCC Intelligent Solutions employees are part of an inclusive culture that brings together diverse backgrounds and perspectives. Our team is defined by our values of: Integrity, Customer-Focus, Innovation, Diversity & Inclusion, and Tenacity. Together, we help our clients and each other achieve new goals.
CCC is committed to providing employees with opportunities to advance their careers and skillsets. CCC team members receive access to training and education reimbursement is available.
CCC offers competitive compensation and generous benefits. Health insurance, PTO, 401K, are just some of the benefits available to team members. For more information about our benefits please check out our careers site. Careers | CCC Intelligent Solutions | CCCIS
Each team member plays an important role in the company’s success and each team member has a voice. CCC employee engagement and job satisfaction ratings consistently exceed industry norms – underscoring the value CCC places on its employees.
Please check out our corporate profile as well as get a chance to meet some of us by clicking on the following link.
CCC Intelligent Solutions Jobs and Company Culture (themuse.com)
This job is no longer accepting applications
See open jobs at CCC Intelligent Solutions.See open jobs similar to "Senior Site Reliability Engineer" Technology Crossover Ventures.