Charlotte, NC
5 days ago
Senior Site Reliability Engineer

The Site Reliability Engineering team is responsible for enabling the organization by developing tooling and architectural patterns to leverage the public cloud in a reliable and scalable manner. As a Senior Software Site Reliability Engineer II, you will serve as a technical leader for the team as well as the organization and help evolve our technology through automation and reliable architecture as well help increase velocity by adoption of such implementations.

 What you’ll do: Driving solutions and implementing systems that propel the organization as we leverage the capability of the Cloud to provide a seamless Platform Design, deploy, and maintain high-throughput Kafka clusters supporting real-time data streaming at scale Core infrastructure service architecture and reliability (Kafka, DNS, GCS, BigQuery, ContainerOptimizedOS, etc. ) Core infrastructure tools and frameworks (Configuration Management, IAM, CI/CD, Infrastructure as Code, AIOps, Monitoring, HA, etc.) Working with Public Cloud, e.g. Google Cloud Platform, AWS, etc as well as container orchestration systems like Kubernetes Collaborate across Engineering and Product teams to translate application requirements to infrastructure capabilities Maintaining an automation centric vision and incorporating SRE methodologies in an effort to increase reliability and decreasing toil Involvement in technical design and architecture discussions and decisions as well as contributing to technical troubleshooting in various parts of the stack What’s great about the role: You will have the opportunity to contribute greatly to an extremely engineering-focused organization. Your contributions will have a noticeable impact on our members as well as your fellow Karmanauts (that’s what we call ourselves). You will be involved in organizational efforts of continuous improvement to increase and ensure the reliability of Credit Karma. You will get broad exposure to our full stack, consisting of desired and progressive technologies such as Google Cloud Platform, Kafka, Terraform, etc. You will grow and learn and have fun doing it--it’s part of our culture. And, of course, all those awesome company perks that you have probably already read about. What we’re looking for: 5+ years of experience and strong understanding of Linux systems, networking (TCP/IP, HTTP, DNS, TLS), and containers. Experience supporting Kafka/Pubsub data infrastructure and working alongside data engineering teams.  Strong understanding of Computer Engineering with a focus on Infrastructure, Platform, and Application (Cloud, Containerization, Container orchestration, Network, Application Reliability, Database Architecture). Experience running Infrastructure at scale; utilizing Configuration Management and automation to ensure scale and reliability. Proficient in scripting with Python, Go, or other high-level object-oriented languages for automation and process optimization. Ability to communicate effectively vertically and horizontally within the organization via demonstrated written and verbal communication skills. What we’d like to see: Experience operating large kafka clusters with exposure to contributing/updating open source kafka clients/frameworks. Experience developing technical design documents, roadmaps, and architectural plans for at scale infrastructure solutions. Advanced knowledge of Python, Go, or other higher-level OOP languages (eg. Ruby, C++, Scala, etc). Familiarity with information security principles and best practices in virtual environments. Benefits at Credit Karma includes: Medical and Dental Coverage Retirement Plan Commuter Benefits Wellness perks Paid Time Off (Vacation, Sick, Baby Bonding, Cultural Observance, & More) Education Perks Paid Gift Week in December
Por favor confirme su dirección de correo electrónico: Send Email