Zippy

101 posts

Zippy

@AlexanderRedde3

Software engineer working on ML image generation techniques & inference. Obsessed with programming and life.

Here and there เข้าร่วม Mart 2022

120 กำลังติดตาม566 ผู้ติดตาม

Zippy รีทวีตแล้ว

Luma@LumaLabsAI·1d

Uni-1 is here! A new kind of model that thinks and generates pixels simultaneously. Less artificial. More intelligent.

English

405

732

4.6K

3.6M

Zippy@AlexanderRedde3·17 Kas

@hisham_artz You can use the code in the official trt repo to build a trt engine, and run it from there. Mine is different- took the Engine class from repo & modified it to behave like a diffusers unet so i can run it in diffusers pipeline. It's under /demo/Diffusion/ github.com/NVIDIA/TensorRT

English

💀@hisham_artz·17 Kas

@AlexanderRedde3 I'm trying to combine SDXL with TensorRT, do you have a github i can follow? I'd like to recreate this

English

Zippy@AlexanderRedde3·30 Kas

Now we're real time! 😎 - with SDXL turbo & tensorrt optimized further.

English

1.5K

Zippy@AlexanderRedde3·20 Ağu

Made an fp8 implementation of Flux which gets ~3.5 it/s 1024x1024 on 4090, >2x faster than other methods. github.com/aredden/flux-f… #flux1

English

1.5K

Zippy@AlexanderRedde3·16 Nis

@oleg__chomp I think you could probably just compile it normally- following the examples from the tensorrt repo. Most of the speedups from tensorrt are a result of it's own internal autotuning. The api for using that autotuning remains the same for most models.

English

Oleg Chomp@oleg__chomp·7 Nis

@AlexanderRedde3 great result! maybe you can share some tips on connecting taesd to tensorrt pipeline?

English

Zippy@AlexanderRedde3·4 Ara

@vibeke_udart Also make sure that the scheduler is using "timestep_spacing": "trailing". And depending on the actual trt impl you are using. If you're just using the code in the trt repo as I did, you'll want to make a diffusers compatible unet wrapper which can be used in normal diffusers.

English

Zippy@AlexanderRedde3·4 Ara

@vibeke_udart Well sdxl-turbo should use a maximum of guidance scale ~ 1.5, looks like it is using more. I've also found that sometimes the diffusers LCMScheduler works best for details, though can make the image a bit smudgey. Either LCMScheduler or EulerAncestralDiscreteScheduler.

English

Zippy@AlexanderRedde3·3 Ara

@rudzinskimaciej Yes I am 😊 I have a serious case of programming addiction haha

English

Rudzinski Maciej@rudzinskimaciej·3 Ara

@AlexanderRedde3 Then double wow for the effect 😁 thx You are doing it for yourself?

English

Zippy@AlexanderRedde3·2 Ara

Finally figured out how to speed up my #sdxlturbo frontend! It's so fast that the only way to show the actual speed is to delete the prompt, since I can't type fast enough 😆 .. built with next.js frontend & tensorrt backend.

English

217

43.7K

Zippy@AlexanderRedde3·3 Ara

@koltregaskes Essentially, yes. Though the lack of inter-frame coherency even with the same seed would make it seem very jittery, similar to the video.

English

230

Kol Tregaskes@koltregaskes·3 Ara

@AlexanderRedde3 If you could type fast enough you could create animation/video on the go? 🤔

English

250

Zippy@AlexanderRedde3·3 Ara

@rudzinskimaciej It's an amd 7950x cpu + 2x4090 machine, though this demo is only using 1 4090. Also thanks! 😊

English

288

Rudzinski Maciej@rudzinskimaciej·3 Ara

@AlexanderRedde3 What's the machine it's doing inference on? Great work 💪

English

311

Zippy@AlexanderRedde3·3 Ara

@ZealotDKD Right now there isn't a way. It's just a web ui / api running on my workstation. 🥹

English

397

ZealotDKD on itch.io | commissions open |@ZealotDKD·3 Ara

@AlexanderRedde3 How can I use this?

English

434

Zippy@AlexanderRedde3·3 Ara

@vibeke_udart There's a demo on their github github.com/NVIDIA/TensorRT. It uses diffusers, so can be pulled automatically though huggingface, though getting tensorrt to the state it's in-in my demo requires a lot of tweaks. Also I recommend using docker, tensorrt can be kina finnicky.

English

137

Zippy@AlexanderRedde3·3 Ara

@JonathanSolder3 Not at the moment, it's just a little app running on my workstation at home. Not sure when or if it'll ever be a part of a public thing.

English

JnS#d3r@@JonSold3ra·3 Ara

@AlexanderRedde3 Incredible. Is there any way I can use this?

English

Zippy@AlexanderRedde3·29 Kas

Made a neat GUI in react for fun & hooked it up to an optimized sdxl-turbo tensorrt api backend I built for image autocomplete! It generates so fast that my browser cant keep up 🥲 - so fun! #sdxlturbo #sdxl #AIart #stablediffusion

English

2.3K

Zippy@AlexanderRedde3·3 Ara

@EMostaque Your prediction of real time image generation came true 😄. It took a little while, but we're here!

English

673

Emad@EMostaque·2 Ara

@AlexanderRedde3 Living up to zippyness

English

1.9K

Zippy@AlexanderRedde3·2 Ara

For anyone who may think that this is a fast-forwarded video, or using image caching, I can guarantee it is not- the images are all base64 and it's using a websocket, the terminal below is the output from the tensorrt rest-api server in real time as I ctrl+y / ctrl+z