VFF - The signal in the noise
NewsTrending

Google Taps Samsung for Next-Gen AI Chip as TSMC Capacity Tightens

Read original
Share
Google Taps Samsung for Next-Gen AI Chip as TSMC Capacity Tightens

Google is in talks with Samsung Electronics to manufacture part of its next-generation Tensor Processing Unit, code-named Icefish, using Samsung's 2-nanometer production technology. The move reflects broader industry pressure as AI chip demand strains manufacturing capacity at Taiwan Semiconductor Manufacturing Co., the dominant supplier. Icefish is planned as Google's 10th-generation TPU for use in cloud data centers.

  • Google is negotiating with Samsung to produce components of Icefish, its next-generation AI chip, using 2-nanometer technology
  • The chip is planned as Google's 10th-generation TPU for cloud data center deployment
  • Capacity constraints at TSMC are driving major chip firms to diversify manufacturing partners
  • Samsung's involvement signals a shift in how leading tech companies source advanced semiconductor production

Global AI infrastructure depends on a small number of advanced chip manufacturers. As demand for AI chips accelerates, reliance on a single supplier creates bottlenecks. Samsung's entry into Google's supply chain represents a structural shift in how the industry manages capacity constraints and reduces concentration risk.

For Google, securing Samsung as a manufacturing partner reduces dependency on TSMC and helps ensure steady supply of critical AI infrastructure. For Samsung, the deal represents a major validation of its advanced chip manufacturing capabilities and opens a significant revenue stream from a top-tier customer.

  • TSMC's near-monopoly on cutting-edge chip production is eroding as customers seek alternative suppliers
  • Samsung's 2-nanometer technology is now competitive enough for Google's most advanced AI workloads
  • Geopolitical and supply chain diversification are becoming strategic imperatives for major tech companies building AI infrastructure

Monitor whether Samsung secures additional orders from other major AI chip buyers, which would signal broader industry acceptance of its advanced manufacturing. Track Icefish's development timeline and any public announcements from Google or Samsung about the partnership. Watch for similar diversification moves by other hyperscalers facing capacity constraints.

Share

Our Briefing

Weekly signal. No noise. Built for founders, operators, and AI-curious professionals.

No spam. Unsubscribe any time.

Related stories

DeepMind commits $10M to multi-agent AI safety research
TrendingNews

DeepMind commits $10M to multi-agent AI safety research

Google DeepMind and partners have announced a $10M funding call dedicated to multi-agent AI safety research. The initiative aims to address safety challenges that emerge when multiple AI systems interact with each other. This represents a targeted investment in a research area that has received less attention than single-agent safety concerns.

· Google Deepmind
German Court Holds Google Liable for AI Overviews Errors
TrendingNews

German Court Holds Google Liable for AI Overviews Errors

A German court ruled that Google is legally responsible for the accuracy of content generated by its AI Overviews feature, which produces AI-generated answers within Google search results. The ruling treats AI-generated content as Google's own statements rather than neutral search results, establishing potential liability for factual errors. This decision could have broad implications for how AI-generated content is regulated across jurisdictions.

by Martin Peers· The Information
Google releases DiffusionGemma for 4x faster local text generation
TrendingNews

Google releases DiffusionGemma for 4x faster local text generation

Google DeepMind released DiffusionGemma, a 26B Mixture of Experts model that generates text up to 4x faster than autoregressive models by producing entire blocks of text simultaneously rather than token-by-token. The open experimental model, available under Apache 2.0 license, achieves 1000+ tokens per second on NVIDIA H100 GPUs and fits within 18GB VRAM on consumer hardware when quantized. The trade-off is lower output quality compared to standard Gemma 4, positioning it for speed-critical applications like real-time editing and code infilling rather than production use cases demanding maximum quality.

· Google Deepmind
Google DeepMind Releases Gemma 4 12B for Laptop-Based AI
TrendingNews

Google DeepMind Releases Gemma 4 12B for Laptop-Based AI

Google DeepMind introduced Gemma 4 12B, a multimodal AI model designed to run on consumer laptops with 16GB of RAM. The model uses an encoder-free architecture that processes vision and audio inputs directly into the language model backbone, reducing latency and memory overhead. Performance approaches the larger 26B model while maintaining a smaller footprint, and it is released under an Apache 2.0 license.

· Google Deepmind