Microsoft just launched the world’s first large-scale NVIDIA GB300 NVL72 cluster for OpenAI.
Built on a rack-scale system, each rack has:
→ 18 VMs
→ 72 GPUs + 36 Grace CPUs
→ 37 TB of fast memory
→ 130 TB/s NVLink bandwidth
→ 800 Gb/s per-GPU networking
→ 1,440 PFLOPS of FP4 Tensor performance
Microsoft Azure is now home to the world’s fastest AI cluster.
Another first for our AI fleet... a supercomputing cluster of NVIDIA GB300s with 4600+ GPUs and featuring next gen InfiniBand.
First of many as we scale to hundreds of thousands of GB300s across our DCs, and rethink every layer of the stack across silicon, systems, and software to support next gen AI workloads.