Portland, OR, USA
1 day ago
Principal Site Reliability Engineer
Kforce has a client that is seeking a Principal Site Reliability Engineer in Portland, OR. Summary: We are seeking a Principal Site Reliability Engineer to join our skilled team. In this role, you will manage and maintain our production Cloud environment, ensuring a top-notch SaaS experience. If you enjoy innovation, providing technical vision, and working with a team to build reliable, scalable frameworks, this role is for you. You will analyze and improve our services and processes to enhance reliability, performance, scalability, and cost efficiency. You will also advocate for reliability methodologies and collaborate with various teams to integrate these practices into our platform and products. What You'll Do: * Architect, build, and maintain highly available, fault-tolerant systems using AWS/other services * Use Terraform to define infrastructure as code, enabling scalable, repeatable, and secure deployments * Continuously review and recommend the design, maintenance, development and implementation, including deployment and support, of our SaaS production platform solution using Docker and other modern web technologies * Set up and enforce guardrails for databases, infrastructure, and applications, ensuring consistency and adherence to best practices * Support operationally critical environments using monitoring tools, scripts, and logging * Document designs and implementations * Design and manage secure networking solutions, including AWS VPCs, and firewalls * Partner with SRE and Engineering teams to embed reliability and security best practices into the application life cycle * Collaborate with fellow Engineers, Product Managers, and Quality Assurance Engineers to develop and deliver services that meet or exceed enterprise customer reliability and quality expectations * Participate and be effective at pair/mob programming and code reviews, both giving and receiving feedback
Por favor confirme su dirección de correo electrónico: Send Email