There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Consumer & Community Banking, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate. Provide guidance and support in developing appropriate design levels and achieving peer consensus. Collaborate with software engineers and teams to design and implement automated continuous integration and continuous delivery pipelines. Work with teams to design, develop, test, and implement solutions for availability, reliability, and scalability in applications. Implement infrastructure, configuration, and network as code for applications and platforms. Collaborate with technical experts, stakeholders, and team members to resolve complex issues. Utilize service level indicators and objectives to proactively address issues before they affect customers. Promote the adoption of site reliability engineering best practices within the team.
Required qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 3+ years applied experience Proficient in site reliability culture and principles, with the ability to implement them within applications or platforms. Proficiency in at least one programming language such as Python, Java/Spring Boot, or .Net. Strong knowledge of software applications and technical processes within a specific technical discipline (e.g., Cloud, AI, Android). Experience in observability, including monitoring, alerting, and telemetry collection using tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Experience with continuous integration and delivery tools like Jenkins, GitLab, or Terraform. Familiarity with container and container orchestration technologies such as ECS, Kubernetes, and Docker. Ability to troubleshoot common networking technologies and issues. Capability to contribute to large teams by presenting information logically and effectively with minimal supervision. Proactive in recognizing roadblocks and eager to learn technologies that drive innovation. Ability to identify new technologies and solutions to meet design constraints.
Preferred qualifications, capabilities, and skills Initiative in implementing ideas to solve business problems.