eisneim

955 posts

eisneim

@eisneim

Gen AI | Web developer | videographer 视频大拍档特里

Shenzhen, China Katılım Nisan 2017

327 Takip Edilen318 Takipçiler

Sabitlenmiş Tweet

eisneim@eisneim·22 Nis

I'm working on a Flux dev based model that can relight a photo conditioned on time (eg. 6AM, 7AM ) without changing the background unlike ic-light and LBM model

English

581

eisneim retweetledi

Mickmumpitz@mickmumpitz·14 Nis

Another test with the LTX 2.3 vid2vid lip sync workflow. I've been finding the inpainting mode works more reliably overall, so I'd actually recommend turning it on even for close-ups.

English

143

8.6K

eisneim retweetledi

Brie Wensleydale🧀🐭@SlipperyGem·18 Nis

Yet another amazing-lookingIC lora for LTX 2.3 lands on the scene. Its v2v and text prompted. Does editing, removal, replacement and restyle. Personally, I would REALLY like to know if it can handle a first frame as a reference. I'm guessing now though. civitai.red/models/2553102…

English

160

7.4K

eisneim retweetledi

Purz.ai@PurzBeats·18 Nis

LTX 2.3 IC LoRA - EditAnything by Alisson Pereira

English

102

7.9K

eisneim retweetledi

A.Robot@100PercentRobot·7 Nis

Just discovered frame injection in LTX-2.3, so of course I did something weird

English

153

12.7K

eisneim retweetledi

⚡AI Search⚡@aisearchio·5d

Another open source image generator & editor LLaDA2.0-Uni github.com/inclusionAI/LL…

Español

127

eisneim@eisneim·8 Nis

github.com/eisneim/LTX-2_… i created a new Repo for faster and better image to video generation using LTX 2.3 with triple stage sampling

English

164

eisneim@eisneim·24 Mar

github.com/GAIR-NLP/daVin… new video model: 15B-parameter, 40-layer Transformer that jointly processes text, video, and audio via self-attention only. No cross-attention, no multi-stream complexity. Achieves 80.0% win rate vs Ovi 1.1 and 60.9% vs LTX 2.3

English

157

eisneim retweetledi

Tongyi Lab@Ali_TongyiLab·16 Şub

1/2 Qwen3.5 is here. The next frontier of Native Multimodal Agents is open. 🚀 We are thrilled to release Qwen3.5-397B-A17B, our flagship open-weight vision-language model. Built for the future of coding, reasoning, and seamless multimodal interaction. Key Highlights: Inference Efficiency: A massive 397B total parameters, but only 17B active—delivering flagship power at a fraction of the cost. Hybrid Architecture: Innovative Gated Delta Networks (Linear Attention) + Sparse MoE for extreme speed. True Multimodality: Exceptional performance across GUI interaction, video comprehension, and agentic workflows. Global Scale: Qwen3.5 now supports over 200 languages. Empowering developers and enterprises to build smarter, faster, and more versatile AI agents.

English

186

1.6K

449.4K

eisneim retweetledi

Ivan Fioravanti ᯅ@ivanfioravanti·17 Şub

OpenCode + MLX + Qwen3.5-397B-A17B-4bit. Video is 8x, but the goal is showing that It works! This is something unimaginable just few months ago. MLX Team is pushing like crazy and M5 Ultra will do the rest 🚀

English

522

48.4K

eisneim retweetledi

Wildminder@wildmindai·17 Şub

Capybara? 14B model for T2V, T2I, TV2V, TI2I. - based on HunyuanVideo1.5; - byt5-small, Glyph-SDXL-v2, SigLIP; - 480p-1080p; 16.7GB model, 5GB VAE.. mostly for video editing. huggingface.co/xgen-universe/…

English

148

16.8K

eisneim retweetledi

Gorden Sun@Gorden_Sun·17 Şub

BitDance：字节大年初一开源的AI绘画模型最大的亮点是速度快，使用高压缩视觉分词器，将图像映射为紧凑的二值Token序列，并且每一步扩散过程并行预测64个Token。所以即使模型大小有14B，生成图片的速度也非常快。模型：huggingface.co/collections/sh… Github：github.com/shallowdream20…

中文

120

15.4K

eisneim retweetledi

Wildminder@wildmindai·27 Oca

Self-Refining Video Sampling: inference-time method using a video generator as its own refiner to correct physics and motion. no retraining needed; scores >70% human preference; is validated on Wan2.2 & Cosmos. agwmon.github.io/self-refine-vi…

English

261

32K

eisneim retweetledi

AK@_akhaliq·21 Oca

OmniTransfer All-in-one Framework for Spatio-temporal Video Transfer

English

10.7K

eisneim retweetledi

DailyPapers@HuggingPapers·22 Oca

Qwen just dropped Qwen3-TTS on Hugging Face Voice cloning from 3s of audio, 10-language support, and 97ms streaming latency for ultra-realistic speech generation

English

209

19.1K

eisneim retweetledi

Radical Numerics@RadicalNumerics·12 Oca

Scaling scientific world models requires co-designing architectures, training objectives, and numerics. Today, we share the first posts in our series on low-precision pretraining, starting with NVIDIA's NVFP4 recipe for stable 4-bit training. Part 1: radicalnumerics.ai/blog/nvfp4-par… Part 2: radicalnumerics.ai/blog/nvfp4-par… We cover floating point fundamentals, heuristics, custom CUDA kernels, and stabilization techniques. Future entries will cover custom recipes and results on hybrid architectures.

English

528

67.9K

eisneim retweetledi

AK@_akhaliq·16 Oca

HeartMuLa A Family of Open Sourced Music Foundation Models

English

9.7K

eisneim retweetledi

PhotoX86@PhotoX86·16 Oca

一个有意思的小节点,通过色块引导给图像重新打光.图1为原图,图2绘制的色块(通常色块绘制在图像边缘见图4,我这里是尝试在身体上绘制),图3根据色块位置和颜色重新生成光线效果. 虽然有时候效果还不够满意,毕竟这个还只是一个Alpha版,期待以后的更新迭代.节点名称:Qwen-Edit-2511_LightingRemap_Alpha0.2

中文

230

eisneim retweetledi

Wildminder@wildmindai·5 Ara

Again Wan. Reward Forcing: Real-time streaming video gen, 23 FPS w/ interactive control; - infinite generation; - built on Wan2.1-T2V-1.3B reward-forcing.github.io

English

125

7.9K

eisneim retweetledi

Tencent Hy@TencentHunyuan·5 Ara

💡HunyuanVideo1.5 Update: We are now releasing the 480p I2V step-distilled model, which generates videos in 8 or 12 steps (recommended)! On RTX 4090, end-to-end generation time is reduced by 75%, and a single RTX 4090 can generate videos within 75 seconds. The step-distilled model maintains comparable quality to the original model while achieving significant speedup. For even faster generation, you can also try 4 steps (faster speed with slightly reduced quality). 🔗Check out the GitHub Repo: github.com/Tencent-Hunyua…

English

545

39K

eisneim retweetledi

toyxyz@toyxyz3·31 Eki

github.com/tencent-ailab/…

ZXX

4.1K

Keşfet

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry