Jukka Seppänen

1.8K posts

Jukka Seppänen

@Kijaidesign

3D modeling/printing artist, AI enthusiast, rookie Python coder.

Finland Katılım Ekim 2017

79 Takip Edilen6.5K Takipçiler

Jukka Seppänen@Kijaidesign·5 Mar

@Chris_Scott_Dev It's available currently in my WanVideoWrapper -nodes.

English

686

Chris Scott@greatscottdev·4 Mar

@Kijaidesign Did you publish a custom node for this ?

English

730

Jukka Seppänen@Kijaidesign·4 Mar

WanVideo2video fun, 129 frames with sliding context windows using the 14B model.

English

178

19.9K

Jukka Seppänen@Kijaidesign·4 Mar

@SlipperyGem I2V is bit more complicated, haven't figured out best way to do that yet.

English

173

Brie Wensleydale🧀🐭@SlipperyGem·4 Mar

@Kijaidesign Amazing! I've tried this on the 480p I2V model, 257 frames took 30 min. However, it completely burned. Will you try it on the i2v model?

English

301

Jukka Seppänen@Kijaidesign·4 Mar

WanVideo sliding context window test on the 480p 14B T2V -model, worked surprisingly well but incredibly slow (50 mins on a 5090, 513 frames at 832x480)

English

122

13.8K

Jukka Seppänen@Kijaidesign·4 Mar

@jqlive Absolutely, I have the rudimentary implementation in my HunyuanVideo wrapper nodes in ComfyUI as well, it needs some updating to be as good though.

English

284

JQ@jqlive·4 Mar

@Kijaidesign Couldn’t sliding context window also work with Hunyuan Video? 🤔

English

276

Jukka Seppänen@Kijaidesign·4 Mar

@XCeciiiiiiiiiil This is with my ComfyUI wrapper nodes yeah.

English

151

Jukka Seppänen@Kijaidesign·4 Mar

@6___0 It's due to the technique itself, each step basically does the whole video in chunks, which include the overlaps, so many many passes though the model required. Actual VRAM use is really low since only the 81 frames are processed at once, as that's what the model does best.

English

297

kfant@6___0·4 Mar

@Kijaidesign is the processsing speed due to low mem?

English

326

Jukka Seppänen@Kijaidesign·2 Mar

@melmassadian @el_mejnun No way, my pride wouldn't take that, I've built every PC I've ever had myself since I was 14!

English

˗ˏˋ⚡️ˎˊ-@melmassadian·2 Mar

@Kijaidesign @el_mejnun As in a pre built pc?

English

Jukka Seppänen@Kijaidesign·2 Mar

WanVideo has been lot of fun, currently playing with applying sliding context windowing similar to AnimateDiff, the 1.3B model is especially suitable for this due to it's speed, here's 1025 frames in one go, under 5GB VRAM used, but took about 10 mins on a 5090 with 30 steps.

English

103

13.8K

Jukka Seppänen@Kijaidesign·2 Mar

As an impatient person, I often get obsessed with optimizations, tried implementing TeaCache for WanVideo, failed, but in the process accidentally (maybe) succeeded? Results look promising at least!

English

187

20K

Jukka Seppänen@Kijaidesign·2 Mar

@zazoum1 Currently only single prompt with sliding context windows, but I plan to test multiple ones if I can figure all that out.

English

zazoum(Αθανάσιος Νταβλούρος)@zazoum1·2 Mar

@Kijaidesign Is it like stitching timed prompts?Is it implemented on the repo, and if yes do you have a workflow in the examples to experiment with?

English

Jukka Seppänen@Kijaidesign·2 Mar

@gjohgj The model is supposed to output 16fps, though people seem to debate that. This video was saved with 16fps.

English

203

GG@gjohgj·2 Mar

@Kijaidesign Wait what? 1025 frames? on what framerate?

English

191

Jukka Seppänen@Kijaidesign·2 Mar

@el_mejnun I actually have both, I couldn't justify upgrading but 2nd rig for all this stuff I couldn't resist, so I splurged. It's considerably faster, around ~40% or so with all optimizations. Slightly difficult to use still as you have to compile many things from source. Runs hot though

English

214

Reverent Elusarca@el_mejnun·2 Mar

@Kijaidesign Thanks for your hard work! Also would love to see every benchmark result with 5090. I believe you had 4090 before so how is the performance so far

English

298

Jukka Seppänen@Kijaidesign·5 Ara

@YolaoDude Any text2video model can do some level of video2video as long as the VAE encoder is available, and it already was. The process is basically same as the usual img2img.

English

736

Yolao@YolaoDude·5 Ara

@Kijaidesign So this now can do Video2Video?... i though it was just Text2Video at the moment?

English

891

Jukka Seppänen@Kijaidesign·5 Ara

HunhuyanVideo -model's vid2vid passes the hippo test! Very promising and versatile model, thanks to the cfg distillation and every possible optimization I could think of this clip of 101 frames at 768x432 took about 2 minutes to sample, fitting slightly under 20GB using my nodes.

English

561

66.5K

Jukka Seppänen@Kijaidesign·5 Ara

Couple more examples, just changing prompt.

English

15.6K

Jukka Seppänen@Kijaidesign·5 Ara

@ZenMatAI oh and available here: huggingface.co/Kijai/HunyuanV…

English

380

Jukka Seppänen@Kijaidesign·5 Ara

@ZenMatAI It's the same model, but at fp8 so half the size.

English

Jukka Seppänen@Kijaidesign·20 Kas

@stilfuchs Oh wow that worked surprisingly good, this is on my list of things to explore as well! Thanks for sharing :D

English

490

Ulf@stilfuchs·20 Kas

@Kijaidesign thanks @Kijaidesign for all the work you do, cant wait to get my fingers on this to make some more test! Here is a quick 3dgs test of your footage without cleaning 😋 This stuff will be so much fun in the next years!

English

2.9K

Jukka Seppänen@Kijaidesign·19 Kas

I have finally pushed a bigger update to my CogVideoX ComfyUI wrapper nodes, cleaning up most of the bloat that has been accumulating with all these different models. One of the discoveries I made during this is that the orbit -LoRAs work with the "Fun" -models as well!

English

199

18.2K

Jukka Seppänen@Kijaidesign·20 Kas

@OneStrangeW The workflow, among with many (much simpler) others are included with the nodes. I'm afraid I can't help with huggingface issues as all the models are hosted there.

English

358

Keşfet

@SlipperyGem @jqlive @6___0 @melmassadian @el_mejnun @zazoum1 @gjohgj @elonmusk