BENGALURU, KARNATAKA, India
3 days ago
Site Reliability Developer 3

The ideal candidate is going to work in an agile environment on problems that are focused on improving uptime by reducing time to mitigation when issues are surfaced/reported by automated means or through customer incidents, contributing to code for automation of monitoring, patching, and remediation of service anomalies. You will work alongside a software development team within the greater OAC organization where you will support existing features in the cloud as well as new operational processes, automation, and content. You will play a key role in improving the processes supporting the OAC services, so that the service functions more and more autonomously over time.

Key Responsibilities:

Perform DevOps activities to support service reliability with customers, release cycles, and production stability. Participate in a follow-the-sun model for 24x7 support of Oracle Analytics Services Respond to service incidents, troubleshoot, and lead resolution efforts, including root cause analysis. Become an expert in Oracle Analytics services, to prevent/resolve customer issues effectively and prevent regressions and repeats. Document various processes & runbooks as well as update existing processes. Deliver interim patches, hot-fixes, and upgrades with high quality. Partner with development, product, and support teams to resolve service failures/outages. Monitor service metrics, analyse trends, and implement improvements to CI/CD pipelines and operational processes followed by the team. Follow all best practices and procedures as established by the organization. Mentor and guide junior engineers, contribute to a culture of knowledge sharing and technical excellence, and drive continuous improvement.
Por favor confirme su dirección de correo electrónico: Send Email