NVIDIA Unveils Vera Rubin AI Accelerators with Unprecedented Performance

News Overview

NVIDIA has announced its next-generation AI accelerators, the Vera Rubin series, featuring groundbreaking performance metrics.
The Vera Rubin GPUs are designed with dual GPU dies on a single substrate, offering 50 PFLOPS of FP4 performance and equipped with 288 GB of HBM4 memory.
The Vera Rubin NVL144 platform will integrate 144 GPUs, delivering 3.6 ExaFLOPS (FP4) and 1.2 ExaFLOPS (FP8) of performance, while the NVL576 platform will feature 576 GPUs, achieving 15 ExaFLOPS (FP4).

Original article: iXBT

In-Depth Analysis

Technical Specifications

GPU Architecture: Dual GPU dies on a single substrate
Performance: 50 PFLOPS in FP4 precision per GPU
Memory: 288 GB HBM4 per GPU
Platform Configurations:
- Vera Rubin NVL144: 144 GPUs, 3.6 ExaFLOPS (FP4), 1.2 ExaFLOPS (FP8)
- Vera Rubin NVL576: 576 GPUs, 15 ExaFLOPS (FP4)

Architectural Features

Vera CPU: An accompanying processor with 88 ARM cores supporting hyper-threading, designed to complement the Vera Rubin GPUs.

Product Lineup and Availability

Release Schedule:
- Vera Rubin: Scheduled for release in 2026
- Vera Rubin Ultra: Expected in 2027, offering enhanced performance over the standard Vera Rubin models

Commentary

NVIDIA’s introduction of the Vera Rubin series marks a significant leap in AI accelerator technology, reflecting the company’s commitment to meeting the escalating computational demands of advanced AI applications. The innovative dual-die design and substantial memory capacity position the Vera Rubin GPUs to handle increasingly complex AI workloads efficiently. However, the immense performance capabilities come with heightened power requirements, necessitating advanced cooling solutions and infrastructure considerations. As NVIDIA continues to push the boundaries of AI hardware, the Vera Rubin series is poised to play a pivotal role in shaping the future landscape of AI research and deployment.