Alibaba ATH Innovation Division
Happy Oyster
Real-time interactive 3D world model
- Re-explorable 3D output instead of one-shot video
- Two distinct authoring modes (Directing + Wandering)
- Native audio-video co-generation in a world model
Alibaba ATH Innovation Division vs Google DeepMind
Happy Oyster (Real-time interactive 3D world model) compared to Veo (Google's most advanced cinematic AI video generation model.).

Happy Oyster and Veo target adjacent jobs but take different approaches. This page compares them side by side on output paradigm, access, capabilities, and positioning — based on vendor-stated claims as of 2026-04-20 / 2026-04-21.
Alibaba ATH Innovation Division
Real-time interactive 3D world model
Google DeepMind
Google's most advanced cinematic AI video generation model.
| Dimension | Happy Oyster | Veo |
|---|---|---|
| Modality | 3d-world, interactive, audio-video | text-to-video, image-to-video, video-to-video |
| Release status | early-access (2026-04-16) | ga (2024-05-14) |
| Capabilities | Directing Mode · Wandering Mode · Native Audio-Video Co-generation · 3D World Generation | Native Audio Generation · Cinematic Camera Control · Image & Video Animation · Fast & Lite Modes |
| Output type | Interactive 3D world (not pre-rendered video) | — |
| Modes | Directing + Wandering | — |
| Audio | Natively co-generated with visuals | — |
| Access | Limited early access (April 2026) | — |
| API availability | Not publicly documented | — |
| Pricing | Not announced | — |
| Game-engine export | Not confirmed | — |
| Max Resolution | — | 4K (Standard/Pro), 1080p & 720p (Fast/Lite) |
| Aspect Ratios | — | 16:9, 9:16 |
| Frame Rate | — | 24 - 30 fps |
| Base Duration | — | 4 to 8 seconds natively, extendable via API and looping |