CubanHodLer🇨🇺

10.6K posts

CubanHodLer🇨🇺

@CubanBTC

If you rewind yourself walking backwards you Go Forward. ⏪⏯⏩

Katılım Temmuz 2013

4.8K Takip Edilen227 Takipçiler

Sabitlenmiş Tweet

CubanHodLer🇨🇺@CubanBTC·2 Nis

This guy blocked me and I don’t really know why, must be my frozen model’s weights 🥶

English

CubanHodLer🇨🇺 retweetledi

Vaibhav (VB) Srivastav@reach_vb·21 Tem

Probably the craziest week in Open Source AI (yet): 1. Mistral (in collaboration with Nvidia) dropped Apache 2.0 licensed NeMo 12B LLM, better than L3 8B and Gemma 2 9B. Models are multilingual with 128K context and a highly efficient tokenizer - tekken. 2. Apple released DCLM 7B - truly open source LLM, based on OpenELM, trained on 2.5T tokens with 63.72 MMLU (better than Mistral 7B) 3. HF shared SmolLM - 135M, 360M, & 1.7B Smol LMs capable of running directly in the browser; they beat Qwen 1.5B, Phi 1.5B and more. Trained on just 650B tokens. 4. Groq put out Llama 3 8B & 70B tool use & function calling model checkpoints - achieves 90.76% accuracy on Berkely Function Calling Leaderboard (BFCL). Excels at API usage & structured data manipulation! 5. Salesforce released xLAM 1.35B & 7B Large Action Models along with 60K instruction fine-tuning dataset. The 7B model scores 88.24% on BFCL & 2B 78.94% 6. Deepseek changed the game with v2 chat 0628 - The best open LLM on LYMSYS arena right now - 236B parameter model with 21B active parameters. It also excels at coding (rank #3) and arena hard problems (rank #3) There's a lot more; Arcee (mergekit) released a series of LLMs, each better than the other, and Numina and HF Numina 72B (based on Qwen 2) and Math datasets, Mixbread with embedding models (english + german) and a lot more! It's fun to see so many releases next week with L3 405B (?) and companions; we might see a shift in the Open LLM landscape! See you next week! What else did I miss? 🤗

English

135

723

107.9K

CubanHodLer🇨🇺 retweetledi

𝚐𝔪𝟾𝚡𝚡𝟾@gm8xx8·10 Mar

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability arxiv.org/abs/2402.08679

English

367

634.5K

CubanHodLer🇨🇺 retweetledi

SkalskiP@skalskip92·12 Tem

that's all the code you need

English

5.3K

CubanHodLer🇨🇺 retweetledi

SkalskiP@skalskip92·12 Tem

supervision-0.22.0 is coming out today one of the things we release is Mediapipe integration along with default visualizers for face and body pose keypoints link: github.com/roboflow/super…

English

252

147.9K

CubanHodLer🇨🇺 retweetledi

Emil Eifrem@emileifrem·11 Tem

My friend and Neo4j CTO @prathle has written an outstanding blog post that summarizes the recent buzz around GraphRAG, what we've learned from a year of helping users build systems with Knowledge Graphs + LLMs and where we believe the space is going. Thread below. 👇🧵

English

192

1.1K

245.4K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·11 Tem

Gradio Multimodal Demo for LLaVA-NeXT-Interleave😍 : semantic-sam.xyzou.net:6123 Models and Datasets are on 🤗 Hub: huggingface.co/collections/lm…

English

7.2K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·11 Tem

LLaVA-NeXT-Interleave🔥 - Interleave data format unifies different tasks. - New datasets on 🤗Hub: 1️⃣M4-Instruct, high-quality dataset, 1.1M samples from domains: multi-image, video, 3D & single-image 2️⃣LLaVA-Interleave Bench - Set of tasks to evaluate multi-image capabilities

English

5.8K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·11 Tem

🚀Introducing LLaVA-NeXT Interleave: Now AI can understand and reason with multiple images at once - This opens up multi-image scenarios like multi-frame videos, multi-view 3D, and multiple inter-leaved images. - An all round LMM that can understand videos, images, and 3D More⬇️

English

147

27.6K

CubanHodLer🇨🇺 retweetledi

SkalskiP@skalskip92·1 Tem

Florence-2 fine-tuning YouTube tutorial is finally out! (sorry it took me so long) - running the pre-trained model with different vision tasks - configuring LoRA - training and benchmarking - Florence-2 vs. top vision model link: youtube.com/watch?v=i3KjYg… ↓ key takeaways

YouTube

English

120

885

54.5K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·29 Haz

🤯DiffIR2VR-Zero: Zero-shot video restoration to high-resolution using pre-trained image restoration diffusion models. - Framework can handle video denoising and up to 8x super-resolution - outperforms trained models in generalizing across diverse datasets and extreme degradations - Compatible with every 2D restoration model

English

179

20.3K

CubanHodLer🇨🇺 retweetledi

MrNeRF@janusch_patas·18 Haz

Code dropped: github.com/ubc-vision/3dg… I've been waiting for far too long to try this!

Andrea Tagliasacchi 🇨🇦@taiyasaki

📢📢📢 Today I'll be giving a talk at the #CVPR2024 workshop on "Learning 3D with Multi-View Supervision" See you at 16:20 in Summit 331 abdullahamdi.com/3dmv2024 Get a preview of the upcoming "3D Gaussian Splatting as Markov Chain Monte Carlo" ubc-vision.github.io/3dgs-mcmc

English

145

40.3K

CubanHodLer🇨🇺 retweetledi

rsasaki0109@rsasaki0109·30 Haz

OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition github.com/SCNU-RISLAB/Ov…

English

159

9.7K

CubanHodLer🇨🇺 retweetledi

rsasaki0109@rsasaki0109·30 Haz

MeshVPR: Citywide Visual Place Recognition Using 3D Meshes github.com/gmberton/MeshV…

GIF

English

128

7.7K

CubanHodLer🇨🇺 retweetledi

Ethan Mollick@emollick·25 Haz

Weirdly, the ways things are going, only the LLMs will remember what we wrote, imperfectly.

English

230

16K

CubanHodLer🇨🇺 retweetledi

Ethan Mollick@emollick·25 Haz

The Internet is rotting. The destruction of MTV news is the latest. Over 25% of the links embedded in New York Tunes articles just seven years ago & 60% of older links, are now broken. It isn’t good that the only people preserving decades of digital data is the Internet Archive.

English

367

1.4K

106.4K

CubanHodLer🇨🇺 retweetledi

elvis@omarsar0·25 Haz

Nice survey on LLM-based synthetic data generation, curation, and evaluation. If you are working with LLMs, a lot of effort is going into these areas so it's important to get familiar with concepts. This survey is a good starting point.

English

112

478

38.3K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·25 Haz

💪Demo by @AnnioDance: huggingface.co/spaces/vilarin… Model on 🤗 ExVideo: huggingface.co/ECNU-CILab/ExV… Diffutoon: huggingface.co/camenduru/Diff… 🎥Create engaging experiences with Gradio's Video Component. Stream, analyze, and interact with video data seamlessly🌟 Visit: Gradio.dev

English

12.4K

CubanHodLer🇨🇺 retweetledi

Gradio@Gradio·25 Haz

🚨Epic News: Demos for ExVideo and Diffutoon are out! 🤯 Play with the advanced parameters for greater control -- this might be the best open model and demo for text2video/image2video out there!

English

121

17.4K

CubanHodLer🇨🇺@CubanBTC·27 Haz

@tradernewsai @emollick The “financial press” manipulates on their own 😂

English

tradernews.ai@tradernewsai·26 Haz

@emollick I was thinking the other day, and I’m shocked we don’t see manipulation of the financial press with this stuff. A voice-cloned phone call to a WSJ reporter after you buy or short a stock, etc. It feels like there are so many unknown unknowns out there.

English

703

Ethan Mollick@emollick·26 Haz

Not enough IT security people know how much AI has compromised traditional approaches to security. They are perfect spear phishing machines, solid autonomous hacking systems, and great at voice & video cloning with just a few seconds of audio. And this is just the public stuff.

English

157

618

54.2K

CubanHodLer🇨🇺 retweetledi

Xenova@xenovacom·26 Haz

Florence-2, the new vision foundation model by Microsoft, can now run 100% locally in your browser on WebGPU, thanks to Transformers.js! 🤗🤯 It supports tasks like image captioning, optical character recognition, object detection, and many more! 😍 WOW! Demo (+ source code) 👇

English

181

952

88.7K

Keşfet

@prathle @AnnioDance @tradernewsai @emollick @elonmusk @BarackObama @taylorswift13 @cristiano