Senior Manager of Site Reliability Engineering - Data Protection and Recovery
Chase bank
Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.
As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms-Data Protection and Recovery organization, you are the non-functional requirement owner and champion for the applications and infrastructure operations in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area.Job responsibilities
Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance between features, efficiency, and stabilityEffectively negotiates with peers and executive partners to ensure optimal outcomes for all Drives the adoption of site reliability practices throughout the organizationEnsures your teams demonstrate site reliability best practices with the ability to demonstrate this empirically through stability and reliability metricsDrives a culture of continual improvement and solicits real-time feedback to improve the customer’s experience and product line servicesEnsures your team collaborates with other teams within your group’s specialization and avoids duplication of work where possibleFollows blameless, data-driven, post-mortem strategies and conducts regular team debriefs to enable learning from both successes and mistakesProvides personalized coaching for entry to mid-level team members Ensures your team documents and shares their knowledge and innovations via internal forums, communities of practice, guilds, and conferencesRequired qualifications, capabilities, and skills
Formal training or certification on infrastructure engineering concepts and 5+ years applied experience. In addition, 2+ years of experience leading technologists to manage and solve complex technical items within your domain of expertise. Consolidate bullet points in this section.7+ years experience in Infrastructure Operations, driving site reliability and performance engineeringAdvanced proficiency in site reliability culture and principles and can demonstrate how to implement site reliability across platform teams while avoiding common pitfallsExperience leading technologists to manage and solve complex technological issues at a firmwide levelAbility to influence the team’s culture by championing innovation and change for successExperience hiring, developing, and recognizing talentProficiency in at least one programming language (e.g., Python)Demonstrated proficiency in technical processes Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)Experience with troubleshooting common compute, storage, and networking technologies and hardware issuesPreferred qualifications, capabilities, and skills
Demonstrate data fluency3+ years experience with enterprise data protection products such as Cohesity or Commvault
Por favor confirme su dirección de correo electrónico: Send Email