Google's Genie 3: Innovation in Generating Virtual Environments from Text Prompts AI NEWS

Home
AInews
Google's Genie 3: Innovation in Generating Virtual Environments from Text Prompts

Google's Genie 3: Innovation in Generating Virtual Environments from Text Prompts

2025-08-06

Google DeepMind has unveiled Genie 3 - a groundbreaking general-purpose world model capable of creating immersive interactive virtual environments from simple text prompts. This latest iteration delivers 720p resolution with 24fps real-time navigation, maintaining visual and physical coherence for several minutes - a significant leap from its predecessor Genie 2 which only supported 10-20 seconds of low-resolution interaction.

Introducing "promptable world events" enables post-generation scene modifications like rain effects, animal generation, or object insertion. This feature transforms Genie 3 into a dynamic responsive environment ideal for exploration, as demonstrated in DeepMind's blog examples showcasing impressively realistic scenarios.

Positioned as foundational technology for embodied AI agents (robots and virtual assistants interacting with environments), Genie 3's research director Shlomi Fruchter calls it "the first real-time interactive general-purpose world model" for training simulated agents in tasks like warehouse navigation or complex instructions execution. Currently available as a controlled research preview to select scholars and creators, this phase allows DeepMind to evaluate safety protocols, address biases, and refine functionalities.

By enabling AI-generated worlds that can be explored, modified, and remembered, Genie 3 represents substantial progress toward true embodied AI capable of reasoning, experimenting, and planning in simulations before real-world deployment. For AGI researchers it provides a powerful new toolset, while creators, educators, and game designers gain access to unprecedented possibilities. Educators can build immersive learning environments, artists and developers can rapidly prototype game elements, and users can generate personalized virtual spaces - from horseback riding in New Zealand to coastal ocean views - through simple descriptive prompts.

"Today we announce Genie 3 - a general-purpose world model capable of generating more diverse interactive environments than ever before," stated Google in its blog post. "Given a text prompt, Genie 3 can create dynamic worlds navigable in real-time at 24fps while maintaining 720p coherence for multiple minutes. At DeepMind, we've pioneered simulation research for over a decade - from training agents to master real-time strategy games to developing environments for open-ended learning and robotics. This work has driven our development of world models - AI systems that leverage environmental understanding to simulate world aspects, enabling agents to predict environmental evolution and assess action impacts."

DeepMind has already employed Genie 3 to train its SIMA agent (Scalable Instructable Multi-World Agent), successfully completing multi-step tasks like navigating virtual warehouses to locate specific objects. While world models don't inherently "know" objectives, SIMA achieved success through planning in self-consistent simulations. Challenges remain however: agent action range remains limited, simulation durations stay constrained to minutes, and modeling multi-agent interactions presents ongoing difficulties. Text rendering accuracy in environments also requires explicit prompt inclusion for precision.

Motiff AI

Motiff AI - A Figma plugin that can generate and improve designs

Lottie Files

Lottie Files - AI tool for creating Lottie animations and motion design

Stark

Stark - AI tool for fixing accessibility in design

SellerPic

SellerPic - Virtual Try On with AI Models for E-commerce

Verloop

Verloop - AI customer support automation and engagement tool

Freshworks

Freshworks - AI tool automating service and support requests

Bolster AI

Bolster AI - Automated threat detection and takedown solution

RECENT AI TOOLS

Readdy AI

Motiff AI

Lottie Files

Stark

SellerPic

RECENT AI NEWS

Google Launches Image-to-Video Feature Powered by Veo 3 in Google Vids

Anthropic rolls out experimental Claude AI plugin for Chrome browser control

Gemini AI Launched on Google Distributed Cloud for Secure On-Premises Deployment

Hyland Launches Context Engine and Agent Mesh to Simplify Enterprise AI

WhatsApp Launches Meta AI-Powered “Writing Assistant” to Help Compose Messages

Vercel Secures New Funding at $9B Valuation Amid AI Growth

OpenAI Co-Founder Calls for AI Labs to Conduct Safety Tests on Competitors' Models

Attio, an AI-native CRM start-up, raises $52 million in funding

RECENT AI TOOLS