NewsTrending

Blackwell Sweeps MLPerf Training 6.0 Across All Benchmarks

Shruti KoparkarJun 17, 2026 · about 2 months ago

NVIDIA's Blackwell platform swept MLPerf Training 6.0 benchmarks, achieving the fastest training times across all seven tests, scaling to 8,192 GPUs, and being the only platform with submissions across the entire suite. The results reflect deep co-engineering between NVIDIA and cloud partners like Microsoft Azure and CoreWeave on system architecture, networking, and software optimization for large-scale model training.

TL;DR

Blackwell achieved fastest training time on all seven MLPerf Training 6.0 benchmarks, including two new mixture-of-experts workloads (DeepSeek-V3 671B and GPT-OSS-20B)
GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale, driven by higher compute density with NVFP4, expanded memory, and higher power ceiling
Largest-scale Blackwell submission to date: 8,192 GPUs on DeepSeek-V3 671B using GB200 NVL72 systems, with CoreWeave reaching quality target in 2.02 minutes
Microsoft Azure trained Llama 3.1 405B on 8,192 GPUs in 7.07 minutes, the fastest time for that benchmark, demonstrating production-ready reliability at scale

Why It Matters

Training infrastructure performance directly determines how quickly AI teams can iterate on models, what scale they can reach, and total cost of ownership. Blackwell's sweep across all benchmarks and demonstrated ability to scale to 8,192 GPUs signals that the platform is becoming the de facto standard for frontier model development, affecting competitive positioning across the AI industry.

Business Impact

For enterprises and cloud providers, Blackwell's performance gains translate to faster time-to-market for AI models and lower training costs per iteration. The co-engineering results with Azure and CoreWeave demonstrate that production-grade reliability at scale is achievable, reducing risk for organizations planning large-scale training deployments.

Key Implications

Blackwell's dominance across all seven benchmarks establishes a clear performance baseline that competitors must match, likely accelerating adoption among model builders and cloud providers
The 1.6x performance improvement of GB300 over GB200 at the same scale creates a performance tier that may justify premium pricing for time-sensitive training workloads
Successful 8,192-GPU training runs demonstrate that production-grade reliability at extreme scale is achievable, reducing perceived risk for enterprises planning multi-month training campaigns
NVFP4 low-precision training methods achieving accuracy targets across different model architectures suggest a path to further cost reduction without sacrificing model quality

What to Watch

Monitor whether competing GPU providers (AMD, Intel) achieve comparable results on MLPerf Training 6.1 and beyond, and whether the performance gap narrows. Watch for adoption patterns among hyperscalers and whether GB300 NVL72 systems become the preferred choice for new frontier model training, which would indicate whether the 1.6x improvement justifies the upgrade cost in practice.

Data & Training AI Hardware Infrastructure Generative AI

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

AI Drug Discovery Hits a Data Wall

AI is accelerating drug discovery by enabling predictive design of candidates and hit identification at scale, but the technology is exposing critical gaps in data quality and lab infrastructure. Drug companies are hitting a 'data wall' where publicly available datasets lack the structure and diversity needed to train accurate models, while lab teams struggle to validate the growing volume of AI-generated compounds. Success depends on closing the loop between computational prediction and experimental validation through better data collection and integration.

by MIT Technology Review Insights5 days ago· MIT Technology Review

Data & TrainingTrendingNews

Brain Waves Join Video as Physical AI Training Data

Frontier physical AI models are moving beyond video training data to incorporate multiple camera angles, dense annotation, and brain wave readings as training inputs. The shift reflects growing recognition that traditional video datasets alone are insufficient for training AI systems that interact with the physical world. Brain wave data represents an emerging frontier in multimodal training approaches for robotics and embodied AI.

by Tim Fernholz5 days ago· TechCrunch AI

Data & TrainingNews

Mercor's $614M Revenue Surge Hinges on AI Lab Spending

Mercor, a three-year-old data startup that trains AI models through contractor networks, generated $614 million in gross revenue in the first half of 2026, up 70% from all of 2025. The company's growth is heavily concentrated among AI foundation model makers, with 91% of first-half revenue coming from customers like OpenAI, Anthropic, and Google DeepMind. This revenue concentration reveals both the startup's market traction and its dependency on a narrow customer base.

by Julia Hornstein11 days ago· The Information

Data & TrainingNews

Anthropic's $1.5B settlement approved, but copyright question remains open

A federal court has granted final approval to Anthropic's $1.5 billion copyright settlement, resolving one major legal case against the AI company. The settlement addresses claims over the use of copyrighted works in model training but does not establish precedent for the broader industry question of whether copyrighted material can legally be used to train AI systems. The approval marks a significant moment in AI litigation but leaves the fundamental legal question unresolved.

by Kirsten Korosec11 days ago· TechCrunch AI