Eyal 🏐

290 posts

Eyal 🏐 banner
Eyal 🏐

Eyal 🏐

@_Shaamallow

PhD Student @IAatMeta, ex @Polytechnique, @Clipdropapp by @heyjasperai

Online Katılım Aralık 2015
746 Takip Edilen151 Takipçiler
Sabitlenmiş Tweet
Eyal 🏐
Eyal 🏐@_Shaamallow·
🚀 Excited to launch Texture Diffusion, an open-source Blender add-on that brings the power of diffusion models to texture generation! 🎨 ✨ Features: 🎯 Inpainting for precise texture edits 🔗 Support for LoRAs & IP-Adapters 🤝 Seamless integration into Blender’s workflow github.com/Shaamallow/tex… Inspired by Stable ProjectorZ, this is a small project I’m happy to share with the community! Feedback & suggestions are welcome! The add-on is making use of the popular @ComfyUI so that you don't have to install yet another Diffusion Backend, and @cubiq IP-Adapters integration into #ComfyUI. ⭐ Check it out, star it on GitHub, and share it with friends if you like it! #blender #ai #diffusion #opensource
English
4
27
155
8.2K
Eyal 🏐 retweetledi
Basile Terver
Basile Terver@BasileTerv987·
My first PhD paper is out! 🎓 "What Drives Success in Physical Planning with Joint-Embedding Predictive World Models?" tl:dr: JEPA-WMs for robotics: learn dynamics on top of visual encoders, optimize actions towards goal 👇 w/ @JimmyTYYang1, Jean Ponce, @AdrienBardes, @ylecun
English
13
110
918
79.3K
Eyal 🏐 retweetledi
Pierre Fernandez
Pierre Fernandez@pierrefdz·
We're thrilled to share the open source release of Meta Seal, a comprehensive, SOTA, and MIT-licensed suite of AI watermarking research, models, & training code. Learn more in the 🧵 below and explore the artifacts here: facebookresearch.github.io/meta-seal
English
13
29
105
28.1K
Eyal 🏐 retweetledi
Gradium
Gradium@GradiumAI·
Gradium is out of stealth to solve voice. We raised $70M and after only 3 months we’re releasing our transcription and synthesis products to power the next generation of voice AI.
English
77
159
1.1K
423.3K
sway
sway@SwayStar123·
Third paper to do this now lol "LATENT DIFFUSION MODEL WITHOUT VARIATIONAL AUTOENCODER" Using dino features and a residual connection to make a stronger decoder, and diffuse in dino feature space
sway tweet media
English
9
25
293
94.9K
Eyal 🏐 retweetledi
Marc Szafraniec
Marc Szafraniec@MarcSzafraniec·
Proud to have contributed to the ground-breaking DINOv3 by reaching the SOTA on COCO Object Detection, for the first time with a frozen SSL backbone, and a lightweight head ! For me, the debate is closed: SSL is the way!
Marc Szafraniec tweet media
AI at Meta@AIatMeta

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense prediction tasks. Learn more about DINOv3 here: ai.meta.com/blog/dinov3-se…

English
4
7
61
5.8K
Eyal 🏐 retweetledi
Max Seitzer
Max Seitzer@maxseitzer·
Introducing DINOv3 🦕🦕🦕 A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale. High quality dense features, combining unprecedented semantic and geometric scene understanding. Three reasons why this matters…
Max Seitzer tweet media
English
12
137
1K
134.8K
Eyal 🏐 retweetledi
Federico Baldassarre
Federico Baldassarre@BaldassarreFe·
Say hello to DINOv3 🦖🦖🦖 A major release that raises the bar of self-supervised vision foundation models. With stunning high-resolution dense features, it’s a game-changer for vision tasks! We scaled model size and training data, but here's what makes it special 👇
Federico Baldassarre tweet mediaFederico Baldassarre tweet mediaFederico Baldassarre tweet mediaFederico Baldassarre tweet media
English
40
253
1.9K
223.8K
Eyal 🏐 retweetledi
AI at Meta
AI at Meta@AIatMeta·
Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense prediction tasks. Learn more about DINOv3 here: ai.meta.com/blog/dinov3-se…
English
346
746
4.5K
896.4K
Eyal 🏐 retweetledi
Grant Sanderson
Grant Sanderson@3blue1brown·
New video on the details of diffusion models: youtu.be/iv-5mZ_9CPY Produced by @welchlabs, this is the first in a small series of 3b1b this summer. I enjoyed providing editorial feedback throughout the last several months, and couldn't be happier with the result.
YouTube video
YouTube
English
40
372
2.8K
379.7K
Eyal 🏐 retweetledi
Nikola Jovanović
Nikola Jovanović@ni_jovanovic·
There's a lot of work now on LLM watermarking. But can we extend this to transformers trained for autoregressive image generation? Yes, but it's not straightforward 🧵(1/10)
GIF
English
5
52
316
48.5K
Eyal 🏐 retweetledi
Federico Baldassarre
Federico Baldassarre@BaldassarreFe·
DINOv2 meets text at #CVPR 2025! Why choose between high-quality DINO features and CLIP-style vision-language alignment? Pick both with dino.txt 🦖📖 We align frozen DINOv2 features with text captions, obtaining both image-level and patch-level alignment at a minimal cost. [1/N]
Federico Baldassarre tweet media
English
4
99
676
54.2K
Eyal 🏐 retweetledi
Neil Zeghidour
Neil Zeghidour@neilzegh·
Thanks @GoogleAI 🙏, I'm proud to see concepts introduced in this paper (RVQ-VAE, quantizer dropout) being still as relevant four years later, and in particular how the RVQ turned out to be a perfect fit for audio language models.
Google AI@GoogleAI

Congratulations to Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, and Marco Tagliasacchi for winning the IEEE Best Paper Award for "SoundStream: An End-to-End Natural Audio Codec"! arxiv.org/abs/2107.03312 #SPSAwards #IEEEAwards

English
3
12
184
12.7K
Eyal 🏐
Eyal 🏐@_Shaamallow·
@linoy_tsaban Any ideas about the number of samples for the training ?
English
0
0
0
974
Linoy Tsaban
Linoy Tsaban@linoy_tsaban·
OminiControl Art just dropped🔥 OminiControl Art builds on OminiControl framework and distills the artistic style of GPT-4o into FLUX OminiControl deserves way more attention, it's one of my favorite works to come out lately- it's elgant, straightforward & it just works
Linoy Tsaban tweet media
English
13
88
725
68.1K
EXO Labs
EXO Labs@exolabs·
LLM running on Windows 98 PC 26 year old hardware with Intel Pentium II CPU and 128MB RAM. Uses llama98.c, our custom pure C inference engine based on @karpathy llama2.c Code and DIY guide 👇
English
63
216
1.4K
482.4K
Xenova
Xenova@xenovacom·
Introducing Moonshine Web: real-time speech recognition running 100% locally in your browser! 🚀 Faster and more accurate than Whisper 🔒 Privacy-focused (no data leaves your device) ⚡️ WebGPU accelerated (w/ WASM fallback) 🔥 Powered by ONNX Runtime Web and Transformers.js
English
12
81
471
27.9K
Eyal 🏐 retweetledi
Onur Tasar
Onur Tasar@onurxtasar·
We (@heyjasperai / @clipdropapp research team) are excited to announce the release of our latest research project on fast and controllable shadow generation. Our 1-step diffusion model can create realistic shadows for object images in under a second ⚡️, while giving you precise control over shadow direction, softness, and intensity 🔨. - Project page: gojasper.github.io/controllable-s… - Research paper: arxiv.org/abs/2412.11972 - Demo : shadow-generation-demo.jasper.ai - Public test set on @huggingface : huggingface.co/datasets/jaspe… @CChadebec @benjamin_aubin_ @dh7net
English
12
39
185
14.3K