Oracle Health & Analytics is a rapidly growing organization that leverages Oracle's cloud technologies to modernize and automate healthcare. Our mission is to improve the quality of life by delivering better, more secure experiences and easier access to health and research data for patients and providers. As a new line of business, we foster a creative, entrepreneurial environment unencumbered by legacy systems and value expertise that helps us create a world-class engineering center focused on excellence. Required Qualifications BS or MS in Computer Science or equivalent domain experience. 4–6 years of relevant SRE or cloud engineering experience, operating independently on senior projects. Experience deploying and managing large-scale, customer-facing web services in a public cloud infrastructure (e.g., OCI, AWS, Azure). Expertise in automated deployment and configuration management tools (Terraform, Kubernetes, Ansible, etc.). Hands-on experience with CI/CD for data workflows, DataOps orchestration, and automated data pipeline management. Familiarity with observability tools and methodologies: monitoring, alerting, logging, and performance tuning. Proficient with scripting and programming languages (Python, Bash, etc.) for automation and system integration. Track record of incident management/troubleshooting and root cause analysis in distributed systems. Strong written and verbal communication skills, able to clearly present complex technical information to diverse audiences. US citizenship and eligibility for federal security clearance (if applicable). Preferred Qualifications Knowledge of healthcare data management, compliance, and governance. Experience with data migration, modernization, and control plane architecture. As a Site Reliability Engineer, you will play a critical role in building and operating the control plane for Oracle Health's modern cloud-based SI platform, with an emphasis on Observability & Scaling. You will design, implement, and automate processes and systems that ensure mission-critical data workflows are secure, reliable, resilient, and highly available. This role presents an opportunity to solve complex problems involving large-scale distributed systems, data pipeline management, and automation, all in a highly collaborative, agile environment. Key Responsibilities Design, implement, and operate the control plane that ensures observability & scaling for data-centric services. Lead efforts in automated data pipeline management, including CI/CD for data workflows, data migration, and modernization. Develop and maintain robust monitoring, alerting, and observability tooling to ensure system performance, reliability, and rapid incident response. Partner with development teams to implement improvements in service architecture, focusing on automation, self-healing, and real-time monitoring. Build and operate DataOps automation and orchestration platforms, including onboarding & bootstrapping automation for new services and tenants. Participate in incident management, troubleshooting, and root cause analysis for issues impacting data pipelines, access, or system availability. Support data access control and governance by designing solutions that meet strict security and compliance requirements. Define and improve KPIs, SLOs, and metrics for data platforms and services. Contribute to technology strategy—including data modernization, automation frameworks, and integration of new technologies. Collaborate in cross-functional teams and communicate complex technical concepts to stakeholders in clear, concise ways.