
Topic
Voice & Video AI
Text-to-speech, voice cloning, video generation, and audio AI
Featured

All Stories
Apple's Siri Redesign Embraces ChatGPT in iOS 27
Apple is redesigning Siri for iOS 27 with a new chat interface and pill-shaped bubble that emerges from the Dynamic…
Sesame launches iOS app with conversational AI agents
Sesame, a conversational AI startup founded by former Oculus leaders, has launched its iOS app to the public. The app…
YouTube Moves to Auto-Label AI-Generated Videos
YouTube is implementing automatic detection and labeling of videos containing significant photorealistic AI-generated…

Kuaishou's Kling Hits $500M Annualized Revenue
Kuaishou Technology's Kling AI video generation unit reached approximately $500 million in annualized revenue as of…

ElevenLabs adds selective editing to music generation model
ElevenLabs has released a new music-generation model that allows users to regenerate specific sections of a song while…

Twilio's AI Rally Masks Deeper Questions
Twilio's stock has surged 36% in 2026, outperforming a declining SaaS market, driven largely by investor enthusiasm for…
Anker's Liberty 5 Pro Earbuds Bring AI Chip to Consumer Audio
Anker has released the Liberty 5 Pro earbuds, its first to feature the Thus AI audio chip announced last month. The…
AI Video Moves Beyond Clips to Reshape Studio Production
AI video generation is evolving beyond low-quality viral clips toward tools that could reshape how studios produce…

SageMaker and vLLM Enable Real-Time Voice AI Without Custom Infrastructure
Amazon SageMaker AI now supports bidirectional streaming for real-time inference, enabling continuous two-way data flow…

AWS becomes fal's preferred cloud as generative media shifts to infrastructure
fal, a generative media platform serving 2.5 million developers, has selected AWS as its preferred cloud provider…

Specialized AI beats foundation models in healthcare speech recognition
Corti, a Copenhagen-based healthcare AI company, launched Symphony for Speech-to-Text, a clinical-grade speech…
Apple Embeds On-Device AI Into Accessibility Tools Across Platforms
Apple is expanding AI-powered accessibility features across iPhone, Mac, iPad, Apple TV, and Vision Pro, leveraging…

AWS Details Modular Voice Agent Design for Production Scale
Amazon has published a technical guide on building scalable voice agents using Nova Sonic, a speech-to-speech…
Amazon Alexa Plus Adds AI Podcast Generation
Amazon has rolled out podcast generation capabilities for Alexa Plus, its upgraded AI assistant, allowing users to…

OpenAI Acquires Voice-Cloning Startup Weights.GG
OpenAI acquired Weights.GG, a startup behind the AI voice-cloning tool Replay, in January 2026. About half a dozen…

Amazon Consolidates Rufus Into Alexa for Shopping
Amazon is rebranding its Rufus shopping chatbot to Alexa for Shopping, consolidating its AI assistant strategy around…

Perceptron Mk1 undercuts rivals 80-90% on video AI pricing
Perceptron Inc., a two-year-old startup led by former Meta and Microsoft researchers, released Mk1, a video analysis AI…

Google Embeds Gemini Dictation in Gboard, Pressuring Startups
Google is integrating Gemini-powered dictation directly into Gboard, its keyboard app, with an initial rollout to…

Kuaishou Pursues External Funding for Kling AI Video Unit
Kuaishou Technology confirmed via regulatory filing that its board is evaluating a restructuring of Kling, its AI video…

Vapi hits $500M valuation with Amazon Ring win
Vapi, an AI voice platform startup, has reached a $500M valuation after winning Amazon Ring as a customer, beating out…

Kuaishou to Spin Off Kling AI Video Unit at $20B Valuation
Kuaishou Technology, a Chinese social media platform, is planning to spin off its Kling AI video generation unit ahead…

Wispr Flow's Hinglish bet pays off in India voice AI market
Wispr Flow reports accelerated growth in India following its Hinglish language rollout, demonstrating early traction in…

The Voice-First Office: How AI Will Reshape Workplace Design
As voice interaction with AI systems becomes more prevalent in workplace settings, the nature of office environments…

OpenAI Adds Reasoning to Realtime Voice Models
OpenAI has released new realtime voice models available through its API that can reason, translate, and transcribe…
AI Agents Can Now Publish Podcasts Directly to Spotify
A new command-line tool called Save to Spotify enables AI agents like Claude and OpenAI Codex to directly publish…

Uber Deploys OpenAI to Optimize Driver Earnings and Rider Booking
Uber has integrated OpenAI technology to power AI assistants and voice features for both drivers and riders on its…

Parloa brings voice AI agents to enterprise customer service
Parloa has built a platform that uses OpenAI models to power voice-driven AI customer service agents for enterprises.…
Google Upgrades Gemini for Home to Handle Complex Multi-Step Tasks
Google has upgraded Gemini for Home to version 3.1, enabling the smart home assistant to handle more complex,…

Visual AI Now Drives App Growth, But Revenue Lags Downloads
According to Appfigures data, app launches featuring visual AI models are generating 6.5 times more downloads than…