VFF - The signal in the noise
News

DeepSeek-V4 Undercuts Premium AI Models by 85 Percent

carl.franzen@venturebeat.com (Carl Franzen)Read original
Share
DeepSeek-V4 Undercuts Premium AI Models by 85 Percent

DeepSeek released V4, a 1.6-trillion-parameter open source model that matches or exceeds the performance of OpenAI's GPT-5.5 and Anthropic's Claude Opus 4.7 while costing roughly one-sixth to one-seventh as much via API. The model is available free under MIT License on Hugging Face and through DeepSeek's API, with pricing of $5.22 per million input-output tokens compared to $35 for GPT-5.5 and $30 for Claude Opus 4.7. This release represents a major economic shift in frontier AI access and forces enterprises to recalculate the cost-benefit of premium closed models.

  • DeepSeek-V4-Pro costs $5.22 per million input-output tokens, roughly 1/6th the price of Claude Opus 4.7 ($30) and 1/7th the price of GPT-5.5 ($35) on standard pricing
  • The model is a 1.6-trillion-parameter Mixture-of-Experts system available free under MIT License, with performance near or exceeding closed-source frontier models on multiple benchmarks
  • DeepSeek-V4-Flash, the cheaper variant, costs $0.42 per million tokens, making it nearly 1/100th the cost of premium U.S. models while trading off performance
  • The release compresses advanced model economics into a lower price band, making previously uneconomical inference workloads viable for enterprises and developers

DeepSeek's V4 release accelerates the commoditization of frontier-class AI capabilities. The dramatic price compression forces OpenAI and Anthropic to defend their premium pricing and challenges the assumption that closed-source models justify their cost premium. This shift has immediate implications for how enterprises evaluate AI infrastructure spending and which tasks become economically viable to automate.

For operators and founders, V4 materially changes unit economics on inference-heavy workloads. Tasks that were too expensive to automate on GPT-5.5 or Claude Opus 4.7 become viable on DeepSeek-V4-Pro, expanding the addressable market for AI applications. Teams must now evaluate whether premium closed models justify their cost or whether open alternatives meet their performance requirements at a fraction of the price.

  • Price-based competitive differentiation for closed-source models becomes harder to sustain when open alternatives deliver comparable performance at 1/6th to 1/7th the cost
  • Enterprises running large-scale inference workloads face immediate pressure to benchmark DeepSeek-V4 against their current providers and renegotiate contracts or switch
  • The open source model availability under MIT License enables broader deployment without licensing friction, potentially accelerating adoption in regulated or cost-sensitive sectors
  • OpenAI and Anthropic may need to justify premium pricing through superior performance, faster inference, or specialized capabilities rather than general capability alone

Monitor whether OpenAI and Anthropic respond with price cuts or performance claims that differentiate their models. Track enterprise adoption rates of DeepSeek-V4 and whether the model's performance holds up in production workloads at scale. Watch for regulatory or geopolitical responses to DeepSeek's dominance, particularly given its Chinese origins and the current AI competition between the U.S. and China.

Share

Our Briefing

Weekly signal. No noise. Built for founders, operators, and AI-curious professionals.

No spam. Unsubscribe any time.

Related stories

Open-Source Search Agent Outperforms GPT-5.4
TrendingNews

Open-Source Search Agent Outperforms GPT-5.4

Researchers from UIUC, UC Berkeley, and Chroma released Harness-1, a 20-billion parameter open-source search agent that scores 73% on information recall benchmarks, outperforming GPT-5.4 (70.9%) and other proprietary models. The model is available under Apache 2.0 license on Hugging Face. Harness-1 achieves its performance by offloading search session management to a structured software environment rather than relying on expanded context windows, suggesting that model efficiency matters more than raw parameter size for autonomous retrieval tasks.

by carl.franzen@venturebeat.com (Carl Franzen)about 19 hours ago· VentureBeat AI
Microsoft Breaks Free From OpenAI Dependency With Independent AI Push
News

Microsoft Breaks Free From OpenAI Dependency With Independent AI Push

Mustafa Suleyman, CEO of Microsoft AI, discussed the company's restructured approach to AI development in a podcast interview. Microsoft signed a new contract with OpenAI in October that allows the company to pursue superintelligence independently while continuing to license OpenAI models. Suleyman has assembled a dedicated superintelligence team and built infrastructure to train frontier models, announcing seven new models across multiple modalities at Microsoft Build.

by Nilay Patel1 day ago· The Verge AI
Microsoft Breaks Free From OpenAI to Build Its Own Superintelligence
News

Microsoft Breaks Free From OpenAI to Build Its Own Superintelligence

Microsoft AI chief Mustafa Suleyman disclosed that a contractual change with OpenAI six months ago freed the company to independently pursue superintelligence using its own researchers, data, and custom silicon. The company announced seven new in-house AI models branded under the MAI family, including a flagship reasoning model and tools for coding, image generation, transcription, and voice synthesis. This marks a strategic shift where Microsoft is building alongside OpenAI rather than relying solely on it, though the company is not abandoning the partnership.

by michael.nunez@venturebeat.com (Michael Nuñez)2 days ago· VentureBeat AI
ChatGPT adds persistent memory across conversations
News

ChatGPT adds persistent memory across conversations

OpenAI has introduced a new memory system for ChatGPT designed to retain user preferences and maintain context across multiple conversations. The feature allows the AI assistant to remember details about users over time, reducing the need to repeat information in each new chat session. This update aims to make ChatGPT interactions more personalized and efficient for ongoing work.

5 days ago· OpenAI