NFK@nfkmobile
Grok Imagine, right now is in my opinion best and fastest ai video generator for the masses. sure, is not perfect, but Rome wasn't built in a day.
Maybe ppl from xai or Elon Musk would look on our posts and suggestions for future improvements.
What is a must (for advanced users into ai video generation, been doing this game since 2022) ..
1. for longer movies , we need an option to organize like a project style, and to be able to add main prompts like the niche of the current movie, the character description and to be able to select a custom seed so we can have consistency of the characters.
2. we have now 6 seconds generation ( saw Elon promised 15 seconds soon).. BUT when we generate long movies, we end up with lots of scenes... what grok needs for the same project of the movie, would be a First Frame -Last frame scene interpolation between the scenes (take last frame from scene one, and first frame from scene 2 and generate a mid scene that would merge scene 1 with scene 2 .. and continue for the other scenes (this could be very easy implemented with some python lines of code , like before spitting final video, select all scenes.. extract frames etc etc etc etc.. simple af, when u have all scenes + the interpolation scenes combine evrything with ffmpeg ).
3.. list is long... and i dind't finished my coffee yet, so here is a grok TEXT to video short movie (coz lol u hit the limit for today).
Prompts i used for each scene are a little more advanced, so i can see what grok is able to do ..
the prompts used are like this (can;t post all due to X limits ) : { "scene_1": { "global_cinematography": "Ultra-realistic Hollywood cyberpunk thriller in the vein of The Matrix (1999) and Blade Runner 2049 (2017), shot on Arri Alexa LF with anamorphic lenses for widescreen 2.39:1 aspect ratio, 24fps for fluid motion, desaturated palette dominated by cool blues, greens, and high-contrast neon reds piercing perpetual smog-choked night. Consistent VFX pipeline: Procedural green code cascades, photorealistic cybernetic augmentations with subsurface scattering, physics-based rain and particle simulations. Lighting paradigm: Volumetric god rays through haze, practical lens flares from holograms, rim lighting on metallic surfaces for depth. Sound integration: Pulsing industrial synth score with digital glitches, rain patter syncing to code interference, metallic echoes underscoring dialogue. Transitions: Seamless glitch wipes or matrix symbol dissolves ensuring narrative continuity, each scene's final beat priming the next for unbroken tension flow. Continuity directive: Scenes chain via lingering elements—rain droplets from prior shots persisting, Nova's silhouette echoing across cuts, HUD overlays threading flashbacks to present, escalating glitch distortions building to climax rupture—maintaining spatial and temporal cohesion in Neo-Tokyo's underbelly.", "shot": { "composition": "Wide aerial drone shot with 35mm wide-angle anamorphic lens on Arri Alexa LF, high dynamic range capturing smog gradients and rain refraction for immersive dystopian establishment, foreground skyscraper edges framing the descent path", "camera_motion": "Controlled descending tilt-push through layered haze, subtle forward momentum building velocity into street-level convergence, priming alley reveal for Scene 2 silhouette emergence" }, "subject": { "description": "Neo-Tokyo's jagged circuit-board skyscrapers thrusting into smog-veiled void, rain-lashed surfaces mirroring erratic neon pulses; faint pedestrian phantoms below as harbingers of oblivious simulation", "wardrobe": "null" }, "scene": { "location": "Shadowed aerial vantage over Neo-Tokyo underbelly, continuity hook from global haze motif", "time_of_day": "Perpetual neon-twilight under storm overcast, syncing with all scenes' eternal dusk", "environment": "Thick smog banks parting reluctantly, acid rain sheets cascading in synchronized sheets with volumetric depth, holographic billboards stuttering in the distance to echo Scene 7 flicker" }, "visual_details": { "action": "Drone pierces urban canopy, unveiling rain-assaulted sprawl where neon bleeds into puddles like corrupted signals, distant alley haze teasing Nova's imminent step-forward in Scene 2", "props": "Circuit-etched tower facades with embedded LED veins flickering erratically, overflowing industrial gutters spewing iridescent chemical runoff, wind-scattered debris hinting at skirmish aftermath", "action_sequence": [ {"0-1s": "High hover frames smog-piercing spires, rain droplets streak lens in slow-mo refraction"}, {"1-2s": "Descent accelerates, haze thins to reveal neon-veined edges glowing faintly blue"}, {"2-3s": "Tilt reveals grid below, rooftops hammered in static-burst impacts syncing to score pulse"}, {"3-4s": "Forward push threads alley corridors, Mandarin signs initial flicker priming Scene 7"}, {"4-5s": "Pedestrians sharpen as wireframe ghosts, AR visors glinting obliviously"}, {"5-6s": "Level to ground haze, Nova's trench silhouette materializes at frame's vanishing point, coat billow lingering into Scene 2 track"} ] }, "cinematography": { "lighting": "Desaturated neon primaries with volumetric god rays slicing haze for ethereal isolation, rain speculars adding dynamic highlights consistent across wet surfaces", "tone": "Oppressive immersion yielding to rebellious spark—global cyber-noir dread laced with glitch anticipation, flowing seamlessly to Nova's personal emergence" } }, "scene_2": { "global_cinematography": "Ultra-realistic Hollywood cyberpunk thriller in the vein of The Matrix (1999) and Blade Runner 2049 (2017), shot on Arri Alexa LF with anamorphic lenses for widescreen 2.39:1 aspect ratio, 24fps for fluid motion, desaturated palette dominated by cool blues, greens, and high-contrast neon reds piercing perpetual smog-choked night. Consistent VFX pipeline: Procedural green code cascades, photorealistic cybernetic augmentations with subsurface scattering, physics-based rain and particle simulations. Lighting paradigm: Volumetric god rays through haze, practical lens flares from holograms, rim lighting on metallic surfaces for depth. Sound integration: Pulsing industrial synth score with digital glitches, rain patter syncing to code interference, metallic echoes underscoring dialogue. Transitions: Seamless glitch wipes or matrix symbol dissolves ensuring narrative continuity, each scene's final beat priming the next for unbroken tension flow. Continuity directive: Scenes chain via lingering elements—rain droplets from prior shots persisting, Nova's silhouette echoing across cuts, HUD overlays threading flashbacks to present, escalating glitch distortions building to climax rupture—maintaining spatial and temporal cohesion in Neo-Tokyo's underbelly.", "shot": { "composition": "Low-angle tracking push with 50mm anamorphic prime on Arri Alexa LF, heroic distortion compressing background alley into claustrophobic funnel, foreground rain blur veiling initial fog for continuity from Scene 1 descent", "camera_motion": "Fluid forward Steadicam arc from lingering Scene 1 haze, subtle left profile tilt to frame Nova against graffiti wall, pulling back slightly to hold environmental depth into Scene 3 orbit" }, "subject": { "description": "Nova, 30s hybrid rebel with scarred synthetic pallor, cropped black hair rain-matted, holographic irises scanning with latent data flickers; sleek titanium limbs rune-etched in dormant blue", "wardrobe": "Sodden black trench coat with frayed hems from Scene 1 debris scatter, high collar shadowing jawline for motif continuity" }, "scene": { "location": "Graffiti-choked alley continuation from Scene 1 street convergence, Neo-Tokyo underbelly", "time_of_day": "Eternal neon-dusk syncing global palette", "environment": "Fog banks rolling from industrial vents as Scene 1 smog extension, wet cobblestones rippling with residual aerial rain patterns" }, "visual_details": { "action": "Nova materializes from Scene 1's terminal haze, striding assertively into sodium glow with metallic glint, coat hem dragging puddles to splash forward—teasing Scene 3 facial trace", "props": "Luminescent 'GLITCH THE SYSTEM' graffiti echoing from Scene 1 signs, overhead hover-traffic hum persisting from aerial hum", "action_sequence": [ {"0-1s": "Fog swirl from Scene 1 yields Nova's silhouette, boot first impacting puddle"}, {"1-2s": "Full stride forward, coat hem trails iridescent wake linking to blood drip in Scene 9"}, {"2-3s": "Titanium forearm catches neon, runes sequential-pulse awakening blue continuity"}, {"3-4s": "Holographic eyes iris-scan, reflecting alley code fragments priming Scene 4 overlay"}, {"4-5s": "Rain beads contour synthetic skin, parting at seams for Scene 3 macro journey"}, {"5-6s": "Profile lean against wall, vapor breath hangs, posture straightening into Scene 5 OTS"} ] }, "cinematography": { "lighting": "Harsh sodium sidelight rimming form per global motif, cool rune fill softening human remnants, prismatic rain refractions tying to Scene 1 aerial streaks", "tone": "Defiant grace in simulated decay—cyber-noir intimacy building personal stakes, camera arc ensuring spatial flow to close-up revelation" } }, etc etc etc up to scene 16. you got the point