NewsTrending

NVIDIA and Microsoft Build Unified Stack for Agentic AI

Dave SalvatorJun 3, 2026 · about 2 months ago

NVIDIA and Microsoft announced a unified AI stack for agentic AI deployment spanning Windows devices, Azure cloud, and local systems. The partnership includes new hardware like RTX Spark laptops and DGX Station for Windows, NVIDIA open models on Microsoft Foundry, GPU acceleration in Microsoft Fabric, and the OpenShell secure runtime. The announcement was made at Microsoft Build with NVIDIA CEO Jensen Huang joining Microsoft CEO Satya Nadella's keynote.

TL;DR

RTX Spark laptops and DGX Station for Windows bring 1 petaflop and 20 petaflops of AI performance respectively, arriving fall 2026 and Q4 2026 from major OEMs
NVIDIA Nemotron 3 Ultra, a new open reasoning model for long-running agents, launches this month on Microsoft Foundry alongside Anthropic Claude models running natively on Blackwell Ultra
NVIDIA GPU acceleration is now built into Microsoft Fabric Data Warehouse for faster SQL execution in enterprise data warehouses
NVIDIA OpenShell provides a secure runtime for autonomous agents, integrated into GitHub Copilot and Windows deployments

Why It Matters

Agentic AI requires not just capable models but also purpose-built hardware, secure runtimes, and fast data access. This partnership addresses all three layers, giving developers a complete stack from consumer laptops to enterprise cloud infrastructure. The timing matters because it signals that both companies view agentic AI as the next major computing paradigm requiring coordinated hardware and software investment.

Business Impact

Enterprises can now deploy autonomous agents across their entire infrastructure without vendor lock-in, using open models like Nemotron alongside proprietary ones. The hardware announcements from RTX Spark to DGX Station create a clear upgrade path for organizations scaling from development to production agentic workloads. Integration with Microsoft Foundry and Fabric means agents can access enterprise data warehouses with GPU acceleration, reducing latency for real-time decision-making.

Key Implications

Windows is being repositioned as a viable platform for AI development and deployment, not just consumption, competing with cloud-only and Linux-based approaches
Open models from NVIDIA are becoming first-class citizens in Microsoft's enterprise AI platform, reducing dependency on closed models from OpenAI and Anthropic alone
The secure runtime focus via OpenShell suggests both companies see governance and safety as table-stakes for agentic AI in enterprise settings
Hardware manufacturers including ASUS, Dell, HP, Lenovo, and MSI now have clear product roadmaps for AI-native devices, likely accelerating consumer and enterprise adoption

What to Watch

Monitor adoption rates of RTX Spark and DGX Station when they ship in fall 2026 and Q4 2026 respectively, as these will indicate whether enterprises are willing to invest in local agentic AI infrastructure. Watch for performance benchmarks comparing Nemotron 3 Ultra to competing reasoning models on Foundry. Track whether OpenShell becomes the de facto security standard for agentic AI or if alternative runtimes emerge.

AI Hardware AI Agents Infrastructure Generative AI

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

NVIDIA, Hugging Face Enable Distributed Fine-Tuning for Diffusion Models

NVIDIA and Hugging Face have integrated NeMo Automodel, an open-source training library, with the Diffusers ecosystem to enable distributed fine-tuning of video and image models at scale. The integration allows users to fine-tune diffusion models like FLUX.1-dev, Wan 2.1, and HunyuanVideo directly from Hugging Face Hub without checkpoint conversion or model rewrites. The collaboration brings production-grade capabilities including memory-efficient sharding, latent caching, and multiresolution bucketing to any Diffusers-format model.

1 day ago· Hugging Face Blog

AI HardwareTrendingNews

Valar Atomics Seeks $1B at $5B Valuation for Nuclear Data Center Power

Valar Atomics, a three-year-old startup developing small nuclear reactors for data centers and industrial facilities, is in fundraising talks for $1 billion at a pre-money valuation around $5 billion. Sequoia Capital is leading the discussions, which could include a mix of debt and equity. The funding round follows the company's achievement of a power milestone.

by Jemima McEvoy1 day ago· The Information

AI HardwareTrendingNews

UK Robotics Firm Humanoid Reaches Unicorn Status

Humanoid, a London-based robotics company, has achieved unicorn status after raising $150 million in the first tranche of a Series A funding round that values the company at $1.2 billion excluding new funds. The funding closed earlier this week, with the company reportedly aiming to raise additional capital. The milestone marks a significant validation for the UK robotics sector.

by Rocket Drew1 day ago· The Information

AI HardwareTrendingNews

China's CXMT Seeks $8.6B in Record Domestic Tech IPO

ChangXin Memory Technologies, China's leading memory-chip maker, filed for a Shanghai IPO seeking to raise at least 57.9 billion yuan ($8.6 billion), according to a regulatory filing on Wednesday. The offering is positioned to be the biggest tech listing in China's domestic market. The move reflects China's push to develop domestic semiconductor capabilities amid geopolitical tensions and supply chain concerns.

by Qianer Liu1 day ago· The Information

TL;DR

Why It Matters

Business Impact

Key Implications

What to Watch

Subscribe to the newsletter

Related stories

NVIDIA, Hugging Face Enable Distributed Fine-Tuning for Diffusion Models

Valar Atomics Seeks $1B at $5B Valuation for Nuclear Data Center Power

UK Robotics Firm Humanoid Reaches Unicorn Status

China's CXMT Seeks $8.6B in Record Domestic Tech IPO