Topic

LLMs

Large language model releases, benchmarks, and capability research

Featured

LLMs

xAI releases Grok 4.5 as cheaper Opus-class alternative

by Lucas Ropek6 days ago· TechCrunch AI

LLMs

MiniMax Plans 2.7-Trillion Parameter Model for Q3 Launch

by Juro Osawa7 days ago· The Information

Anthropic

Anthropic wins approval to restore Claude Fable 5 after Trump talks

by Hayden Field14 days ago· The Verge AI

All Stories

LLMsTrending

Musk Directs Tesla Staff to Adopt xAI's Grok Model

Elon Musk sent a memo to Tesla staff directing them to adopt Grok, the AI model developed by xAI, citing lower token…

by Grace Kay1 day ago· The Information

LLMs

Startup Shrinks 27B-Parameter Model to iPhone

PrismML, a Khosla Ventures-backed startup, claims to have compressed Alibaba's Qwen 3.6 large language model, which…

by Aaron Tilley5 days ago· The Information

Research

OpenAI Researcher: GPT-5.6 Beats Human Interns on Most Tasks

At the International Conference on Machine Learning in Seoul, OpenAI senior researcher Noam Brown stated that GPT-5.6…

by Stephanie Palazzolo6 days ago· The Information

LLMs

Nemotron 3 Ultra Matches Closed Models at 10x Lower Cost

NVIDIA's Nemotron 3 Ultra model, tuned through LangChain's Deep Agents harness, achieved benchmark-leading performance…

by Adel El Hallak6 days ago· NVIDIA Blog (AI)

ResearchTrending

Anthropic finds consciousness-like structure in Claude

Anthropic published research showing that Claude language models have spontaneously developed an internal structure…

by michael.nunez@venturebeat.com (Michael Nuñez)8 days ago· VentureBeat AI

LLMsTrending

Tencent's Hy3 removes licensing barrier, but GLM-5.2 keeps coding crown

Tencent released Hy3, a 295-billion-parameter open-weight model under Apache 2.0 license, removing regional…

by sam.witteveen@venturebeat.com (Sam Witteveen)8 days ago· VentureBeat AI

LLMs

Mistral AI raises funding to democratize frontier AI models

Mistral AI, founded in 2023, has secured significant funding to develop open source AI models with the stated goal of…

by Anna Heim9 days ago· TechCrunch AI

LLMsTrending

Z.ai launches ZCode to undercut Cursor and Claude Code

Z.ai, a Beijing-based AI lab, launched ZCode, a free desktop application designed as an agent-first development…

by michael.nunez@venturebeat.com (Michael Nuñez)12 days ago· VentureBeat AI

Research

Why Every LLM Gives You the Same Answer

Large language models exhibit severe homogeneity in their responses to open-ended questions, converging on predictable…

by Will Douglas Heaven13 days ago· MIT Technology Review

AnthropicTrending

Anthropic Cuts Prices on Claude Sonnet 5 to Challenge Agent Market

Anthropic has launched Claude Sonnet 5, a model positioned as a more affordable alternative to its Opus offering and…

by Rebecca Bellan14 days ago· TechCrunch AI

DeepSeekTrending

DeepSeek Open-Sources DSpark, Cutting LLM Inference Costs by Up to 85%

DeepSeek has open-sourced DSpark, an MIT-licensed framework that accelerates large language model inference by up to…

by carl.franzen@venturebeat.com (Carl Franzen)15 days ago· VentureBeat AI

LLMs

Open-Source AI Gains as Regulatory Pressure Mounts

Open-source AI models are gaining traction among developers and companies as a response to Trump administration…

by Stephanie Palazzolo15 days ago· The Information

Research

New agentic memory cuts token use 27x vs. competitors

Researchers at the National University of Singapore developed MRAgent, a framework that dynamically reconstructs memory…

by bendee983@gmail.com (Ben Dickson)16 days ago· VentureBeat AI

AI SecurityTrending

Chinese AI Matches U.S. Leader in Cybersecurity Capabilities

Security researchers have found that Z.ai's GLM-2 model matches Anthropic's Mythos in cybersecurity capabilities,…

by Martin Peers16 days ago· The Information

LLMs

Cara Builds Domain-Specific AI for Insurance on AWS

Cara, an AI-native platform built on AWS, automates back-office workflows for enterprise insurance brokerages by using…

by Amaan Babul18 days ago· AWS Machine Learning Blog

Research

Self-Improving Agents: Shanghai Lab Cuts Manual Tuning

Researchers at Shanghai Artificial Intelligence Laboratory have introduced Self-Harness, a framework that enables…

by bendee983@gmail.com (Ben Dickson)22 days ago· VentureBeat AI

LLMsTrending

Sakana's Fugu sidesteps export controls with multi-model orchestration

Sakana AI launched Fugu, a multi-agent orchestration system that routes queries across a pool of specialized AI models…

by carl.franzen@venturebeat.com (Carl Franzen)22 days ago· VentureBeat AI

LLMs

Startup Claims Breakthrough in LLM Efficiency, Backed by Third-Party Tests

Miami-based AI startup Subquadratic emerged from stealth claiming it solved a decade-old mathematical bottleneck in…

by Will Douglas Heaven25 days ago· MIT Technology Review

LLMs

Z.ai's Open GLM-5.2 Beats GPT-5.5 on Coding, Costs 1/6th as Much

Z.ai released GLM-5.2, a 753-billion parameter open-weights LLM that outperforms OpenAI's GPT-5.5 on multiple…

by carl.franzen@venturebeat.com (Carl Franzen)28 days ago· VentureBeat AI

ResearchTrending

Tencent Backs Alibaba's Former Qwen Researcher in $20M AI Lab Deal

Tencent Holdings has invested $20 million in an AI lab founded by Junyang Lin, the former lead researcher behind…

by Jing Yang30 days ago· The Information

ResearchTrending

Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate

Google researchers propose 'faithful uncertainty,' a technique that allows large language models to express qualified…

by bendee983@gmail.com (Ben Dickson)about 1 month ago· VentureBeat AI

LLMsTrending

Moonshot's K2.7-Code cuts costs but skips independent benchmarks

Moonshot AI released Kimi K2.7-Code, an open-source coding model claiming 30% lower thinking-token usage and…

about 1 month ago· VentureBeat AI

Research

Context compression reaches production viability with 16x reduction

Researchers from NYU, Columbia, Princeton, University of Maryland, Harvard, and Lawrence Livermore National Laboratory…

about 1 month ago· VentureBeat AI

LLMsTrending

Apple's Flash-Based Model Architecture Breaks On-Device Memory Ceiling

Apple announced AFM 3, a new foundation model family developed with Google that includes a 20-billion-parameter…

about 1 month ago· VentureBeat AI