Madrid, Spain
25 days ago
DevOps Infrastructure Engineer

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections,  where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The Position

As a Cloud DevOps Infrastructure Engineer at Roche, you will be at the forefront of optimizing the reliability, availability, scalability, and performance of our Cloud Platform infrastructure. Your role involves applying software engineering principles across the entire lifecycle, from inception to decommissioning. You will dive into the heart of our cloud infrastructure platforms, leveraging your specialized knowledge to optimize performance, scalability, and reliability. Be the driving force behind continuous improvements, ensuring our platforms meet and exceed the highest standards.

The Cloud platform team globally serves our internal Roche customers and IT partners designing, building and operating modern distributed systems on a Public Cloud Infrastructure globally (Europe, NALA and APAC).

Job responsibilities:

Engages in and improves, with low guidance, the whole lifecycle of cloud platforms and services—from inception and design through deployment, operation and retirement by applying software engineering principles to build and manage large scale  IT infrastructure products and services (both on-premises or in public cloud) optimizing:

services reliability, availability, capacity and performance  and eliminating work through automation

software development and deployment  by abstracting away the complexity of infrastructure providing self-service tools and APIs for developers that allow code and ship software quickly.

Implements and maintains CI/CD pipelines, enabling developers to easily build, test, and deploy their applications. 

Develop self-healing features.

Contribute to Disaster Recovery execution plans.

Ensure IT infrastructure services reach and maintain the agreed service level indicators (SLIs), objectives (SLOs), agreements (SLAs) in compliance with QA requirements.

Contribute to the maintenance of services once they are live by measuring and monitoring availability, latency and overall system health.

Contribute to activities focused on availability, tuning, performance, efficiency, change and configuration management, monitoring, emergency response and capacity planning.

Manages ITSM process(es) and track resolution for reporting and resolving incidents, problems, changes, requests and releases. 

Monitors and resolves incidents/problems (including major ones) with platform operations, suggesting priorities and collaborating in the resolution when required.

Practice sustainable incident response and blameless postmortems. 

Ensures implemented solutions and components comply with Quality/Regulatory standards, as applicable.

Implements cost, compliance and security best practices, ensuring that platforms and services meet the corresponding requirements

Contribute to audit exercises providing the required evidence to the audit teams.

Collaborate with developers, Managed Services suppliers, other teams and vendors to:

continuously improve application development velocity

optimize services reliability, availability and performance

Works closely with development teams to ensure that new features and changes are rolled out with reliability in mind bringing a broader and more strategic understanding of reliability that spans multiple facets of development.

Act as an analyst by transforming the customers and developers needs into specific technical requirements to be implemented by the product team or by other teams.

Maintain in-depth knowledge of current and emerging technologies within their technical, infrastructure area of responsibility to further the objectives of the team or department and ability to tackle complex and interdisciplinary issues. 


 

Job requirements:

Good interpersonal skills and good oral and written communication skills.

English language proficiency is required. 

Proficiency in German, Spanish or Chinese is considered a plus.

Customer and delivery focus mindset.

Proven scripting and automation skills with strong knowledge in delivering and managing infrastructure as code.

Experience working with cloud Infrastructure platforms, their availability, administration, configuration and integration. 

Familiarity with Software Engineering and DevOps principles and automated testing and CI/CD tools. 

Knowledge of agile methodologies and principles.

Ability to continuously research, learn, innovate and share knowledge

Ability to work effectively with team members and virtual teams from different locations and different cultural backgrounds.

Problem-solving and decision-making skills.

Customer orientation, partnership, collaboration and trust.

Technology Skills:  

Proven scripting and automation skills with expertise in delivering and managing infrastructure as code.

Recent experience/exposure designing, implementing, operating infrastructure or designing hybrid multi-cloud solutions in GCP (Google Cloud) Public Cloud Platforms

Creation of high-availability, fault-tolerant and auto-scaled Dev/Test/Stage and Production environments using Infrastructure as Code techniques using  Terraform.

Hands-on technical skills in automation (Phyton and Terraform), infrastructure as code, logging, monitoring and observability (Datadog and Google Cloud native services), infrastructure configuration, scripting languages and applications. Source code management (GitLab, BitBucket, GitHub). 

Experience with Infrastructure Performance Analysis, reporting and Capacity Planning is nice to have.

DevOps Pipeline Automation experience: Gitlab CI/CD.

Experience defining SLOs, SLIs, SLAs, error budgets, toil reduction, blameless postmortems and incident management.


 

Education / Years of Experience: 

Bachelor’s degree in Computer Science/Engineering or equivalent work experience in information technology environment (networking, infrastructure, database).

Certifications on GCP or equivalent working knowledge on Google Cloud.

You will bring +3 years of relevant work experience in one or more multinational work environments (e.g. healthcare industry experience is a plus).

Moderate travel required and ability to work across multiple time zones, including on-call, maintenance and extended hours of work.

Who we are

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.


Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.

Por favor confirme su dirección de correo electrónico: Send Email