Seattle, WA, United States
8 hours ago
Consulting Member of Technical Staff

The Oracle Cloud Infrastructure (OCI) Compute organization offers GPU Superclusters, bare metal CPUs, and virtual machines at scale to our customers. With rapid growth in machine learning, the demand for GPUs is exploding, making performance and efficiency of cloud scale services a critical area of investment. 

 

The Core Architecture team partners with teams across the entire Compute organization to identify performance and efficiency constraints within the lifecycle of compute services from forecasting, inventory management, capacity ingestion, placement, repair, and decommissioning. Consulting engineers are responsible for performing deep analysis of critical business problems, identifying bottlenecks and proposing & incubating new architectural constructs that address the needs of some of our largest customers. These solutions could take the shape of new microservices or restructuring of the control plane services and dataflow.

 

You will take the lead in defining the architecture for the brand-new host state management engine that will power the next generation of the Compute Control Plane.  This initiative spans across multiple Compute domains, from GPU validation to repairs, and you will drive engineers from these organizations to build microservice based solutions that will enable Compute to scale for growing customer demands.

We are looking for a hands-on senior principal engineer with technical breadth, proven experience in solving cloud scale problems, distributed systems design & implementation experience to build fault tolerant solutions that will form the foundations of the next generation of Compute offerings. The candidate is expected to have strong written and verbal communications skills, the ability to lead projects across organizational boundaries, and experience representing their work to senior leadership.

 

Career level-IC5

Por favor confirme su dirección de correo electrónico: Send Email