DevOps Engineer 3
We are seeking a highly skilled and experienced DevOps Engineer to join our dynamic team. This role is ideal for a technical expert who thrives in a fast-paced environment and is passionate about building scalable, secure, and automated infrastructure solutions. You will play a critical role in designing, deploying, and maintaining cloud-native systems across multi-cloud platforms, with a strong focus on AWS (our default platform), and exposure to GCP and Azure as we expand our cloud footprint.
This position requires deep technical knowledge, strong problem-solving skills, and the ability to work independently with minimal supervision. You will collaborate across engineering, development, and operations teams to drive automation, improve system reliability, and support rapid delivery of business-critical applications.
Key Responsibilities
Provide expert-level operational support for cloud platforms, applications, and data servicesLead the design and implementation of infrastructure solutions using Infrastructure as Code (IaC) principlesManage and optimize EKS, serverless architectures, and CI/CD pipelinesImplement and maintain robust monitoring and alerting systems using tools like Prometheus, Grafana, and CloudWatchDrive automation for deployment, scaling, and recovery processes to ensure high availability and performance.Collaborate with cross-functional teams to troubleshoot complex issues and deliver scalable solutionsContribute to the development of secure, efficient, and repeatable software delivery pipelinesParticipate in on-call rotations and respond to production incidents with urgency and precisionDocument technical procedures, create knowledge articles, and mentor junior engineersStay current with emerging technologies, including GenAI, and explore their integration into DevOps workflowsRequired Qualifications
Proven experience in a senior or lead DevOps role, with a strong background in cloud infrastructureDeep expertise in AWS, with working knowledge of GCP or AzureHands-on experience with EKS, Lambda, Fargate, and container orchestrationProficiency in CI/CD automation tools such as GitHub Actions, or ArgoCDStrong programming/scripting skills (e.g., Python, TypeScript) for building scalable IaC and automationSolid understanding of monitoring, logging, and alerting best practicesExperience with Kubernetes architecture, multi-region deployments, and disaster recovery planningFamiliarity with security best practices, secret management, and compliance frameworksExcellent communication skills and the ability to work effectively in a distributed team environment, occasionally presenting to senior audiencesExperience with Agile methodologies and project management tools like JiraExposure to GenAI tools and their applicationExperience leading infrastructure projects or acting as a technical lead