News

AI Factories: Power and Tokens Drive Enterprise Economics

Jeremy GraybillMay 27, 2026 · about 2 months ago

Jeremy Graybill argues that AI factories function as token factories, converting electrical power into intelligence at scale. As agentic AI and autonomous agents proliferate in enterprise environments, the economics of AI deployment shift from traditional metrics to performance per watt and cost per token. This reframing reflects how infrastructure and efficiency, rather than model capability alone, will determine competitive advantage in AI deployment.

TL;DR

AI factories are fundamentally token factories that convert power into intelligence in real time
Agentic AI and always-on autonomous agents are driving enterprise deployment at scale
Performance per watt and cost per token are becoming the defining economic metrics
Infrastructure efficiency and power consumption will determine competitive advantage

Why It Matters

As AI moves from experimental projects to continuous, autonomous operation in enterprises, the underlying infrastructure economics become critical. Organizations can no longer optimize solely for model accuracy or capability. Instead, the ability to run agents continuously and cost-effectively depends on power efficiency and token economics, making infrastructure decisions as important as algorithmic ones.

Business Impact

For enterprises deploying agentic AI at scale, operational costs will be dominated by power consumption and token throughput rather than upfront model licensing. Companies that optimize for performance per watt and cost per token will have significant competitive advantages. This shifts investment priorities toward infrastructure, chip efficiency, and operational optimization.

Key Implications

Power efficiency and infrastructure design become primary competitive differentiators in AI deployment
Cost structures for AI operations will be dominated by continuous token generation rather than inference licensing
Enterprise AI strategies must prioritize infrastructure planning and power management alongside model selection
Hardware and chip manufacturers gain strategic importance in the AI value chain

What to Watch

Monitor how enterprises measure and optimize AI operational costs, particularly the shift toward per-token and per-watt metrics. Watch for infrastructure investments and partnerships that prioritize power efficiency. Track how chip manufacturers and cloud providers position efficiency improvements as competitive advantages in agentic AI deployment.

AI Hardware AI Agents AI for Business Infrastructure

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Meta's custom AI chips enter production in September

Meta will begin production of its new custom AI chips in September 2026. The company is adopting a modular design approach to accommodate rapid changes in AI technology and evolving computational needs. This move reflects Meta's strategy to reduce dependence on third-party chip suppliers and control its AI infrastructure costs.

by Ram Iyer1 day ago· TechCrunch AI

AI HardwareTrendingNews

SK Hynix Raises Record $26.5B in U.S. IPO

SK Hynix, a South Korean memory chipmaker already listed in Seoul, raised $26.5 billion in a Nasdaq IPO, the largest ever by a foreign company in the U.S. and surpassing Alibaba's 2014 record of $25 billion. The company plans to deploy proceeds toward unspecified strategic initiatives. The listing marks a significant capital raise for the semiconductor sector amid ongoing global chip demand.

by Henry Siu1 day ago· The Information

AI HardwareNews

Startup Shrinks 27B-Parameter Model to iPhone

PrismML, a Khosla Ventures-backed startup, claims to have compressed Alibaba's Qwen 3.6 large language model, which contains 27 billion parameters, to run on an iPhone 17 Pro. This represents the largest AI model ever deployed on a mobile device, surpassing typical mobile models that operate with only a few billion active parameters. The achievement addresses Apple's broader effort to run powerful AI locally on iPhones to reduce cloud computing costs and improve user privacy.

by Aaron Tilley2 days ago· The Information

AI HardwareTrendingNews

Robotics Startup Bets on Video Game Data for AI Foundation Models

General Intuition is developing foundation models for robotics by training on millions of hours of video game data rather than real-world robot footage. The startup believes this approach can accelerate physical AI development by reducing the need for extensive real-world training data. The strategy mirrors how large language models like ChatGPT transformed AI by scaling training on vast datasets.

by Rebecca Bellan2 days ago· TechCrunch AI