At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.
We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways.
Are you curious about being part of our growth story while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.
Location: United States (remote)
Iron Mountain, a trusted global leader in information management, partners with over 95% of the Fortune 1000. We empower organizations to securely store, manage, and extract value from both physical and digital information. Our Enterprise IT group is responsible for the systems and infrastructure that drive our global operations, focusing on innovation, automation, and reliability across our digital services. As we modernize our technology, our EIT DevOps team is essential in provisioning and maintaining scalable platforms, optimizing delivery pipelines, and enabling secure, efficient
operations in a global cloud environment.
Job Summary:
Iron Mountain is seeking a Senior DevOps Engineer to join our dynamic EIT DevOps team. This role is responsible for the staging and production infrastructure of Iron Mountain’s Digital Services within the federal sector, and is pivotal in managing environments across Google Cloud Platform (GCP), Amazon Web Services (AWS), and Microsoft Azure.
Core responsibilities include provisioning and maintaining secure, scalable, and robust cloud infrastructure for the InSight DXP Platform. You will apply extensive knowledge of cloud services and DevOps best practices to ensure application efficiency, high availability, and performance.
Additionally, creating and maintaining FedRAMP controls and documentation compliance. You will execute automation pipelines, upgrade infrastructure, troubleshoot complex issues, and contribute to the ongoing enhancement of deployment processes. Close collaboration with development, operations, and other EIT teams is crucial for delivering seamless and reliable solutions.
Your role in our mission:
Cloud Infrastructure & Compliance: Deploy, manage, and maintain secure, compliant cloud infrastructure across AWS, Azure, and/or GCP, specifically ensuring government workload compliance (FedRAMP, NIST).
Automation & CI/CD: Automate infrastructure provisioning using IaC tools (e.g., Terraform, OpenTofu) and streamline CI/CD pipelines (e.g., GitLab) for efficient infrastructure and application delivery. This includes developing scripts for routine operations like patching, scaling, and monitoring.
System Resilience & Optimization: Design and implement self-healing systems, manage backup and disaster recovery, and optimize application and infrastructure performance through monitoring, capacity planning, and bottleneck identification.
Security Posture: Conduct regular security audits and vulnerability patching.
Incident Response & Observability: Lead real-time incident resolution, perform Root Cause Analysis (RCA), participate in an on-call rotation, and drive observability improvements by ensuring comprehensive logging and telemetry. This includes diagnosing complex infrastructure and application problems.
Application Lifecycle & Scaling: Oversee application lifecycle management (version upgrades, security patches, regional rollouts) and support scaling strategies to meet demand while ensuring infrastructure resilience and SLO compliance.
Knowledge Management: Contribute to a shared knowledge base, documenting recurring issues and resolutions.
Valued skills and experience:
Experience: Minimum 5 years leading and supporting enterprise-level applications in production.
Cloud Platforms: Proven experience in cloud infrastructure provisioning and management on GCP, AWS, or Azure. Experience with FedRAMP Authorized platforms is highly desirable.
Technical Stack:
Proficiency in scripting languages (Python, Bash, PowerShell).
Strong understanding of containerization (Docker, Kubernetes, Helm).
Hands-on experience with cloud object storage (S3, GCS, Azure Blob).
Working knowledge of databases (MongoDB, PostgreSQL).
Experience with microservices, RESTful APIs, and IAM/SSO (Okta).
Familiarity with incident management (ServiceNow, Jira) and security tooling (Prisma Cloud, CrowdStrike, XSOAR, Burp Suite).
Problem-Solving: Excellent troubleshooting skills, especially in complex, distributed cloud environments.
Communication: Strong written and verbal communication, with the ability to clearly document procedures and solutions.
Work Ethic: Ability to work independently with minimal supervision in a fast-paced, collaborative, globally distributed team. Motivated, proactive, and committed to secure, reliable systems.
U.S. Citizenship & U.S. Residency is a condition of employment. You must be eligible and willing to submit for the U.S. Government security clearances; an active clearance is a strong plus.
Please note: This role involves participation in an on-call rotation to provide 24/7 support, escalating and coordinating responses to high-severity issues as needed.