Mentatcurated
Model

Gemini Omni

Google DeepMind multimodal model that accepts text, images, audio, and video as input and generates short video with synced audio, edited across a multi-turn conversation.

3 finds · medium confidence
Accesspaid
MakerGoogle DeepMind
Released2026-05-19
Reviewed2026-06-09
Visit blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni →

The lenses

Scored on its own merits — not rolled up from the finds we publish.

Novelty 4
Impact · breadth 4
Impact · depth 4
Actionable 3
Substance 4
Hype 4

How this connects

Tap a node to open it