VFF - The signal in the noise
NewsTrending

Blackwell Sweeps MLPerf Training 6.0 Across All Benchmarks

Read original
Share
Blackwell Sweeps MLPerf Training 6.0 Across All Benchmarks

NVIDIA's Blackwell platform swept MLPerf Training 6.0 benchmarks, achieving the fastest training times across all seven tests, scaling to 8,192 GPUs, and being the only platform with submissions across the entire suite. The results reflect deep co-engineering between NVIDIA and cloud partners like Microsoft Azure and CoreWeave on system architecture, networking, and software optimization for large-scale model training.

  • Blackwell achieved fastest training time on all seven MLPerf Training 6.0 benchmarks, including two new mixture-of-experts workloads (DeepSeek-V3 671B and GPT-OSS-20B)
  • GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale, driven by higher compute density with NVFP4, expanded memory, and higher power ceiling
  • Largest-scale Blackwell submission to date: 8,192 GPUs on DeepSeek-V3 671B using GB200 NVL72 systems, with CoreWeave reaching quality target in 2.02 minutes
  • Microsoft Azure trained Llama 3.1 405B on 8,192 GPUs in 7.07 minutes, the fastest time for that benchmark, demonstrating production-ready reliability at scale

Training infrastructure performance directly determines how quickly AI teams can iterate on models, what scale they can reach, and total cost of ownership. Blackwell's sweep across all benchmarks and demonstrated ability to scale to 8,192 GPUs signals that the platform is becoming the de facto standard for frontier model development, affecting competitive positioning across the AI industry.

For enterprises and cloud providers, Blackwell's performance gains translate to faster time-to-market for AI models and lower training costs per iteration. The co-engineering results with Azure and CoreWeave demonstrate that production-grade reliability at scale is achievable, reducing risk for organizations planning large-scale training deployments.

  • Blackwell's dominance across all seven benchmarks establishes a clear performance baseline that competitors must match, likely accelerating adoption among model builders and cloud providers
  • The 1.6x performance improvement of GB300 over GB200 at the same scale creates a performance tier that may justify premium pricing for time-sensitive training workloads
  • Successful 8,192-GPU training runs demonstrate that production-grade reliability at extreme scale is achievable, reducing perceived risk for enterprises planning multi-month training campaigns
  • NVFP4 low-precision training methods achieving accuracy targets across different model architectures suggest a path to further cost reduction without sacrificing model quality

Monitor whether competing GPU providers (AMD, Intel) achieve comparable results on MLPerf Training 6.1 and beyond, and whether the performance gap narrows. Watch for adoption patterns among hyperscalers and whether GB300 NVL72 systems become the preferred choice for new frontier model training, which would indicate whether the 1.6x improvement justifies the upgrade cost in practice.

Share

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Related stories

Meta embeds AI search into Facebook using public posts

Meta embeds AI search into Facebook using public posts

Meta is launching AI Mode, a new search feature on Facebook that generates AI-powered results by pulling from publicly-posted content across its platforms. The feature appears alongside traditional search modes like People and Marketplace, and allows users to ask follow-up questions to AI-generated results. This rollout is part of a broader set of new AI features Meta is introducing, including photo presets for swapping sports jerseys and collage template suggestions.

by Stevie Bonifield· The Verge AI
Warner Music acquires AI attribution startup Sureel AI

Warner Music acquires AI attribution startup Sureel AI

Warner Music Group has acquired Sureel AI, an attribution startup focused on tracking how artists' work is used in AI-generated content and model training. The deal reflects growing industry concern over unauthorized use of copyrighted music in AI systems. WMG aims to gain better visibility and control over its catalog's deployment in AI applications.

by Aisha Malik· TechCrunch AI
NVIDIA and LG Build AI Factory for Robotics and Manufacturing

NVIDIA and LG Build AI Factory for Robotics and Manufacturing

NVIDIA and LG Group are establishing an AI factory to accelerate LG's robotics, autonomous driving, and data center businesses. The partnership integrates NVIDIA's AI infrastructure and digital twin technologies with LG's manufacturing expertise and consumer electronics capabilities. The collaboration will focus on physical AI development, robot simulation and training, and next-generation AI factory infrastructure aligned with NVIDIA's DSX platform.

by Madison Huang· NVIDIA Blog (AI)
UK Forces Google to Let Publishers Opt Out of AI Search

UK Forces Google to Let Publishers Opt Out of AI Search

The UK Competition and Markets Authority has ruled that Google must allow publishers to opt out of AI Search features, including AI Overviews and the use of their content for fine-tuning AI models. This marks the first regulatory requirement globally forcing a search engine to provide publishers with control over content used in generative AI features. The ruling strengthens publishers' negotiating position with Google over content usage and compensation.

by Jess Weatherbed· The Verge AI