Senior Manager, Site Reliability
Ping Identity Corporation
As a Ping Identity SRE, you will be involved in every facet of our On-Demand SaaS services and will build, deploy, and maintain the infrastructure of one of the largest identity platforms in the world. We follow a DevOps model: our teams are integrated with development teams, and running continuous deployments daily, and SREs are expected to provide input in the product's design, development, deployment, and operations.
Working within the Cloud Operations team, you'll build automated infrastructure and deployment processes. You'll be the expert on operational excellence and how systems can be built to be; redundant, scalable, and observable.
Responsibilities:
Oversee and maintain our production infrastructure hosted on AWS with a 99.99%+ uptime SLA Leadership and Mentorship of a team of 8-10 SREs. Collaboration with other SRE, Security and Development teams. Define processes for the team to efficiently meet target dates. Drive projects to completion working with multiple Development teams. Capacity analysis and planning. Drive the team with automation standards. Analyze complex system behavior, performance and application issues. Oversee observability and analysis across multiple datacenters.Requirements:
Experience leading a software focused SRE team of 8-10 staff. Experience working in organizations with a global presence. The ability to drive decisions around build vs buy. Develop, maintain and administer modern infrastructure tooling. 3+ years Amazon Web Services (AWS) or other cloud platform experience. Knowledge of scripting and programming standards (Python/Ruby/Bash/Go/etc.) Experience with Docker and container orchestration (Kubernetes). Experience using Git in a large team environment. Experience with Security design principles. Experience in a high-volume or critical production service environment. IP networking; familiarity with the functionality, operating, and failure modes of networks.Nice to Have
Experience with observability tooling such as NewRelic, Splunk, Grafana, and Cloudwatch. Experience with DevOps automation tools such as Jenkins, Artifactory, Spacelift Solid experience with server configuration with Puppet/Chef/Salt.LI-NY1#
Por favor confirme su dirección de correo electrónico: Send Email