To better understand the emerging capabilities of GAIA-1, it is helpful to see it in action. We can condition GAIA-1 on our actions using video and/or natural language prompts to generate a plausible future. This method allows us to control the scene, as well as the ego vehicle in the simulation, with action conditioning.
In the videos below, you can see how we used natural language to prompt different futures.
Prompt: “It’s night, and we have turned on our headlights.”


GIPHY App Key not set. Please check settings