Lead Site Reliability Engineer
Insight Global
Job Description
Insight Global is looking for a Lead Site Reliability Engineer/DevOps Engineer to join our client in the Artificial Intelligence space on a full-time, permanent basis. This is a hybrid role that will require the successful candidate to work on-site, downtown Vancouver 1-day per week. Within the role, you will be responsible for building the best infrastructure and maintaining the health of the internal systems. Ideal candidates should have experience working in a SaaS start-up environment in a lead capacity. There is a large emphasis on monitoring and alerting as you'll be person ensuring the health of the systems through actionable alerts. While New Relic is the main monitoring tool used, experience with similar tools is just as valuable. From a cloud perspective, strong prior AWS experience is a must have. Additionally, strong experience within infrastructure as code tools such as Terraform, Docker and containerization is a must have requirement. Lastly, the successful candidate should have a solid understanding of cloud security and compliance best practices, including SOC 2 readiness and audit support as it pertains to cost savings.
We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to Human Resources Request Form (https://airtable.com/app21VjYyxLDIX0ez/shrOg4IQS1J6dRiMo) . The EEOC "Know Your Rights" Poster is available here (https://www.eeoc.gov/sites/default/files/2023-06/22-088\_EEOC\_KnowYourRights6.12ScreenRdr.pdf) .
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .
Skills and Requirements
- 8+ years' experience working as a Site Reliability Engineer or DevOps Engineer, more recently in a lead capacity
- Excellent experience with how to increase the health of systems through creating actionable alerts with monitoring tools such as New Relic, Grafana, Prometheus or PagerDuty
- Strong knowledge and working experience in an AWS environment
- Expert with Infrastructure as Code experience in Terraform or similar tools, Docker and containerization
- Strong understanding of cloud security and best practices for SOC 2 readiness and support - Understanding of scripting and programming languages such as Python and Bash
- Ability to understand backend code written in JavaScript/TypeScript
- Experience working in MongoDB or similar databases null
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to HR@insightglobal.com.
Por favor confirme su dirección de correo electrónico: Send Email