USA
1 day ago
Senior Software Engineer

The High Performance Computing and Artificial Intelligence (HPC/AI) team is on a mission to build the next-generation distributed AI supercomputer, enabling breakthroughs in artificial intelligence by delivering unmatched computational power, scalability and reliability. We design and develop cutting-edge infrastructure that supports high-performance AI model training at scale, laying the foundation for innovations that redefine what AI can achieve.


We are looking for passionate and innovative software engineers to design and develop the tooling and infrastructure that powers the next generation of large-scale AI and HPC networking systems. In this role, you will build network automation tools, observability frameworks, and performance optimization systems that are critical for achieving ultra-low latency, high throughput, and petabyte-scale efficiency in distributed AI workloads.

As a Senior Software Engineer on the HPC & AI Infrastructure team, you’ll work at the intersection of AI supercomputing and large-scale networking, shaping how advanced AI models are trained and deployed in the cloud. Your contributions will directly impact the reliability and performance of massive distributed clusters, leveraging high-speed fabrics (e.g., InfiniBand, RoCE) and accelerated compute platforms (e.g., NVIDIA, AMD GPUs).

 

This is a unique opportunity to build core software infrastructure—from telemetry and diagnostics tools to orchestration and network configuration systems—that ensures observability, debuggability, and operational excellence at exascale levels. You’ll collaborate across hardware, infrastructure, and ML platform teams to deliver systems that push the boundaries of what's possible in AI training and inference. If you're excited about distributed systems, low-level performance engineering, and software for next-generation AI infrastructure, come help us build the backbone of the AI supercomputers of tomorrow.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Por favor confirme su dirección de correo electrónico: Send Email