Provision, configure, and maintain core AWS infrastructure, including EC2, VPCs, S3, RDS, and other services.
Manage storage, networking, and compute resources to ensure optimal performance and availability.
Oversee access management (IAM), including roles, policies, and user permissions.
2. Monitoring & Performance Optimization:Implement robust monitoring solutions using Amazon CloudWatch and other tools to track system health, performance, and availability.
Perform capacity planning, resource optimization, and billing/tag management to maintain operational efficiency.
Manage logging, inventory, and operational metrics to ensure proactive issue detection.
3. Automation & Deployment:Automate infrastructure provisioning and deployment workflows using tools such as AWS CloudFormation and Systems Manager.
Build and maintain reusable automation scripts and orchestration solutions for large-scale, distributed environments.
Develop and maintain scripts using languages such as Python, Bash, or PowerShell to automate routine tasks and improve operational efficiency.
Support CI/CD pipeline integration for streamlined application deployment.
4. Troubleshooting & Support:Diagnose and resolve issues related to networking, system performance, and AWS service outages.
Provide operational support for cloud-hosted applications and ensure minimal downtime.
5. Platform Engineering & Governance:Develop core platform capabilities that enable self-service deployments, codified infrastructure patterns, and consistent operations.
Integrate operational tools and services across the cloud environment to support enterprise-grade IT processes.