Seattle, WA, USA
9 days ago
Sr Principal Software Engg ,AI Workload Management

Here at OCI we’re building the world’s largest AI clusters and we’re the fastest at bringing them to the market.  The AI Infrastructure organization at OCI is leading this effort by creating a GPU focused cloud with the latest hardware providing the best performance, efficiency, reliability, and scalability.  This is your chance to be part of the AI revolution by creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. You will have the opportunity to work with cutting-edge technologies and make a significant impact on our organization's success.

We are looking for a highly skilled and motivated distributed systems engineer who can architect solutions to optimize AI infrastructure components like GPU control plane and GPU data plane to support various AI workloads. You will provide technical leadership to the team and bring clarity to ambiguous problems and come up with innovative solutions. You will collaborate with cross-functional teams to enhance our AI infrastructure to deliver exceptional customer experience and peak performance. 

Por favor confirme su dirección de correo electrónico: Send Email