At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career.
Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express.
How will you make an impact in this role?
At American Express, we know that with the right backing, people and businesses have the power to progress in incredible ways. Whether we’re supporting our customers’ financial confidence to move ahead, taking commerce to new heights, or encouraging people to explore the world, our colleagues are constantly redefining what’s possible - and we’re proud to back each other every step of the way. When you join #TeamAmex, you become part of a diverse community of over 60,000 colleagues, all with a common goal to deliver an exceptional customer experience every day.”
We’re looking for a Site Reliability/Application Support Engineers (SRE/AS) responsible for Digital Payments application performance, availability, and reliability. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineering (SRE) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant systems. This role will ensure that American Express internal and external services have reliability and uptime appropriate to users' needs. We also ensure a continuous improvement, while keeping an ever-watchful eye, automated, on capacity and performance.
This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to day work through better automation, monitoring, alerting, testing, and deployment. You’ll be expected to work with several Technology partners to identify areas of opportunity within the availability platform and build a solution to automate monitoring solutions for the modernization platform, technology, and constant innovations to drive efficiencies. You will be responsible for implementing tracing, monitoring, tooling solutions to maximize the performance and availability of our Digital Payments applications.
What you will be doing:
Research latest technology, concepts, conceptualize solution and develop proof of concept that will improve resiliency and performance of the production infrastructure Design and implement innovative solution/framework that will improve software engineering velocity, infrastructure resiliency and security, and data availability Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering Work with Technology teams to resolve major incidents Conduct root cause analysis (RCA) for incidents and implement preventive measures. Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. Continuously improve automated remediation tasks to ensure the highest levels of availability Implement and maintain security best practices for payment systems, ensuring compliance with industry standards.Minimum Qualifications:
BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent 4-8 years of work experience in DevOps/SRE. Development or support of Java/J2EE/REACT JS applications and Node applications. Good understanding of cloud technologies - Kubernetes, Openshift, Docker etc. Knowledge of Public Cloud technologies GCP, AWS, AZURE etc. would be an advantage. Familiarity with programming languages such as Python, Go, or Java. Experience with version control systems (e.g., Git) and CI/CD tools (e.g., Jenkins, GitLab CI). Strong problem-solving skills and ability to work in a fast-paced environment. Knowledge of payment systems, 3DS protocols, or PCI-DSS compliance is a plus. Knowledge of server-side technologies such as JBose, NodeJS, Jetty Monitoring and analyzing PMI data Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics,Prometheus etc. Knowledge on Unix shell scripting, PERL or Python programming is preferred Experience in handling DDoS/BOT attack and different security remediations Working experience with Network load balancers, Global Traffic Managers (GTMs), Local Traffic Managers (LTMs) Hands on experience on configuring Splunk, Grafana dashboards, Kibana, Elasta alerts etc. Working experience on network rules creation, load balancer configurations, network packet analysis Analytical knowledge and exposure on root cause identification using analyzer tools like IBM support assistant, Splunk etc. Good understanding of Linux OS internals, performance tools, Core commands, security etc. Experience on supporting three tier architecture which includes exposure to databases such IBM DB2, Couchbase, Mongo, Redis etc. Certificate Management automation - Message signing, SSL, etc. Exposure to ITIL processes is preferred On call / 24*7 support required Exposure to enterprise platform migration from dedicated to cloud environment is preferredWe back you with benefits that support your holistic well-being so you can be and deliver your best. This means caring for you and your loved ones' physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:
Competitive base salaries Bonus incentives Support for financial-well-being and retirement Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location) Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need Generous paid parental leave policies (depending on your location) Free access to global on-site wellness centers staffed with nurses and doctors (depending on location) Free and confidential counseling support through our Healthy Minds program Career development and training opportunitiesOffer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.