News

AWS Shows How to Build Voice Agents for Healthcare Appointments

Jimin KimJun 25, 2026 · about 24 hours ago

AWS has published a technical guide for building a voice-based healthcare appointment agent using Amazon Nova 2 Sonic and Amazon Bedrock AgentCore. The agent handles patient authentication, appointment confirmation or rescheduling, and health information collection through natural speech conversation. US healthcare no-show rates range from 5-30 percent by specialty, representing significant lost revenue and provider time.

TL;DR

Amazon Nova 2 Sonic processes speech natively end-to-end, preserving vocal context like tone and hesitation instead of losing it in separate transcription steps
The agent authenticates patients by voice, manages appointments (confirm, cancel, reschedule), collects pre-visit health data, and escalates to human staff when needed
Architecture uses Amazon Bedrock AgentCore, Amazon Cognito, Amazon DynamoDB, and Amazon SNS with a React frontend for browser-based testing
Integration with Amazon Connect Customer enables outbound dialing to actual phone lines for production deployment

Why It Matters

Healthcare providers lose significant revenue to no-show rates between 5-30 percent depending on specialty. Traditional appointment reminder systems require manual one-by-one calling and don't scale. A voice agent that preserves vocal cues like tone and hesitation can respond more appropriately to patient anxiety or confusion, potentially improving engagement and reducing no-shows.

Business Impact

Automating appointment reminders and rescheduling at scale reduces labor costs and idle provider time while improving patient communication. The speech-to-speech approach avoids latency and context loss from chaining separate transcription, reasoning, and synthesis services, enabling more natural and responsive interactions.

Key Implications

Healthcare organizations can deploy serverless voice agents without building custom speech pipelines, lowering technical barriers to automation
Preserving vocal context in a single model may improve patient outcomes by allowing the agent to detect and respond to emotional cues rather than just transcribed words
Integration with existing telephony services like Amazon Connect makes production deployment feasible for clinic and hospital networks

What to Watch

Monitor adoption rates among healthcare providers and reported changes in no-show rates or patient satisfaction after deployment. Watch for regulatory or compliance considerations around voice authentication and patient data collection in healthcare settings. Track whether other cloud providers release similar speech-to-speech models and how they compare on latency and accuracy.

Voice & Video AI AI Agents AI for Business Generative AI AWS

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Loka Cuts Voice AI Latency with Amazon Nova 2 Sonic

Loka built a voice AI agent using Amazon Nova 2 Sonic that processes audio end-to-end rather than converting speech to text and back, reducing response latency from 3-5 seconds to near-real-time while lowering costs. The approach achieved a speech reasoning score of 87.0 on Big Bench Audio, outperforming Google's Gemini 2.5 Flash (71.0) and OpenAI's GPT Realtime (83.0). The solution addresses a core frustration with traditional voice assistants: robotic, slow responses that damage customer experience and increase support costs.

by Bojan Jakimovski1 day ago· AWS Machine Learning Blog

Voice & Video AITrendingNews

ByteDance Upgrades Video AI Model to Seedance 2.5

ByteDance unveiled Seedance 2.5, an upgraded AI video generation model, at a Beijing conference on Tuesday. The new model improves upon Seedance 2.0, which was previously recognized as a significant breakthrough in AI video generation.

by Juro Osawa3 days ago· The Information

Voice & Video AITrendingNews

Fika Jobs raises $4M for AI-powered video hiring platform

Fika Jobs, a Stockholm-based startup, has raised $4 million to develop a video-first hiring platform that uses AI interview agents alongside short-form video candidate profiles. The platform blends elements of LinkedIn and TikTok to streamline recruitment. The funding supports the company's expansion of its AI-driven interview and candidate discovery capabilities.

by Lauren Forristal3 days ago· TechCrunch AI

Voice & Video AITrendingNews

Alibaba's HappyHorse Rises as Sora and Seedance Retreat

Alibaba Cloud released HappyHorse 1.1, an upgraded AI video generation model now ranked No. 2 globally on independent benchmarks. The release capitalizes on market consolidation following OpenAI's discontinuation of Sora and ByteDance's indefinite shelving of Seedance 2.0 due to financial and copyright pressures. HappyHorse is positioned as an enterprise-grade, API-first product backed by Alibaba's infrastructure, targeting integration into corporate content production workflows.

by michael.nunez@venturebeat.com (Michael Nuñez)3 days ago· VentureBeat AI

TL;DR

Why It Matters

Business Impact

Key Implications

What to Watch

Subscribe to the newsletter

Related stories

Loka Cuts Voice AI Latency with Amazon Nova 2 Sonic

ByteDance Upgrades Video AI Model to Seedance 2.5

Fika Jobs raises $4M for AI-powered video hiring platform

Alibaba's HappyHorse Rises as Sora and Seedance Retreat