
my cat didn't tell me she was starting a cult (aka good lord, google omni editing of irl footage is remarkable)
Amar Singh
4.7K posts

@ummerrr
generative media PM at Google Flow (@flowbygoogle), previously google shopping. opinions are my own

my cat didn't tell me she was starting a cult (aka good lord, google omni editing of irl footage is remarkable)


Was super impressed at text handling here with #GeminiOmni inside @FlowbyGoogle "add a serif font at center like a title that says Flow. it appears at 00:01 and fades out at 00:05. Lets add a logo to the tennis balls in dark green that say Omni"


I think people don't realize why Gemini Omni is different than other video AIs. It is fully multimodal, so it can edit video natively, too I took the famous "train " movie from 1896 & made it a bullet train, LEGO, added a time traveler, a centipede, muppets... (see reflections?)





Now in @FlowByGoogle, you’ll be able to use the full potential of Gemini Omni Flash, our model that can create anything from any input, starting with video. With a simple prompt and style reference, Gemini Omni allows you to transform the environment of an existing scene, add visual effects and other elements, all while preserving the original performance. You can even add new characters with custom voices directly to a scene with the new Character feature. #GoogleIO











I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.



I think people don't realize why Gemini Omni is different than other video AIs. It is fully multimodal, so it can edit video natively, too I took the famous "train " movie from 1896 & made it a bullet train, LEGO, added a time traveler, a centipede, muppets... (see reflections?)




