VFF - The signal in the noise
NewsTrending

Google Launches Gemini Omni for AI-Powered Video Generation and Editing

Read original
Share
Google Launches Gemini Omni for AI-Powered Video Generation and Editing

Google DeepMind has introduced Gemini Omni, a multimodal model that generates and edits video from mixed inputs including images, audio, video, and text. The first model in the family, Gemini Omni Flash, is rolling out to the Gemini app, Google Flow, and YouTube Shorts with the ability to edit videos through natural language conversation while maintaining character consistency and physical coherence across multiple turns. Future versions will support additional output modalities like image and audio generation.

  • Gemini Omni Flash enables video generation and editing from mixed input modalities (text, image, audio, video)
  • Users can edit videos conversationally with natural language, with edits building on previous instructions while maintaining scene consistency
  • Initial rollout targets Gemini app, Google Flow, and YouTube Shorts, with image and audio output modalities planned for future releases
  • The model grounds video generation in Gemini's real-world knowledge and allows users to transform existing footage or create entirely new content

Gemini Omni represents a significant step in multimodal AI capability, moving beyond text-to-image generation into video creation and editing. This consolidates reasoning and creative generation into a single model, which could reshape how creators and enterprises approach video production and editing workflows. The conversational editing interface lowers the technical barrier for complex video manipulation tasks.

For content creators and media companies, this tool could reduce production timelines and costs by enabling rapid iteration on video content through natural language prompts rather than traditional editing software. For Google, this positions Gemini as a competitive alternative to specialized video generation tools and integrates generative capabilities deeper into YouTube and its productivity suite.

  • Video generation and editing may shift from specialized software to conversational AI interfaces, affecting the competitive landscape for traditional video editing tools
  • Multimodal input handling at scale suggests progress toward more general-purpose AI systems that can reason across and generate across multiple content types
  • Integration into YouTube Shorts and Google Flow signals Google's strategy to embed generative capabilities into existing user-facing products rather than launching standalone tools

Monitor adoption rates and user feedback on video quality, consistency, and editing accuracy across multiple turns. Watch for competitive responses from other AI labs and video software vendors, and track whether Google expands output modalities (image, audio) on the timeline promised. Pay attention to any content moderation or authenticity challenges that emerge as video generation becomes more accessible.

Related Video

Share

Subscribe to the newsletter

The latest stories and analysis, delivered to your inbox.

Free. No spam. Unsubscribe any time.

Related stories

Google Launches Fast, Cheap Image Model for Enterprise Workflows
TrendingNews

Google Launches Fast, Cheap Image Model for Enterprise Workflows

Google launched Nano Banana 2 Lite, a lightweight image generation model built on Gemini 3.1 Flash-Lite architecture, capable of generating 1k resolution images in 4 seconds at $0.034 per 1,000 images. The model is available immediately to enterprise developers through Google AI Studio, the Gemini API, and GEAP. It trades resolution flexibility for speed and cost efficiency, targeting high-throughput commercial workflows like programmatic advertising and e-commerce asset generation.

by carl.franzen@venturebeat.com (Carl Franzen)· VentureBeat AI
Google Limits Meta's Gemini Access as AI Capacity Strains Persist

Google Limits Meta's Gemini Access as AI Capacity Strains Persist

Google imposed capacity limits on Meta's use of its Gemini AI models a few months ago, citing inability to meet the social media company's full demand. The restriction was not limited to Meta, as Google also constrained access for other clients. Google has since moved to address capacity issues by signing a deal to rent cloud computing capacity from Elon Musk's infrastructure.

by Martin Peers· The Information
Google Restructures Coding AI Team to Close Anthropic Gap
TrendingNews

Google Restructures Coding AI Team to Close Anthropic Gap

Google is restructuring a months-old strike team focused on AI coding tools, aiming to improve model training and expand capabilities beyond coding into areas like presentation creation. The reorganization reflects competitive pressure from Anthropic and OpenAI, which are also broadening their AI coding tool applications. The changes also formalize what was originally conceived as a short-term group into a more permanent structure.

by Erin Woo· The Information
Google Invests $75M in A24 to Build AI Movie Tools
TrendingNews

Google Invests $75M in A24 to Build AI Movie Tools

Google's DeepMind is investing approximately $75 million in A24, the independent film studio, to develop AI-powered movie production tools. This marks Google's first equity stake in a film studio and will span multiple projects focused on helping filmmakers expand their creative workflows. The non-exclusive collaboration aims to create tools shaped by filmmaker input rather than imposed from outside the industry.

by Jess Weatherbed· The Verge AI