Ann Huang

43 posts

Ann Huang banner
Ann Huang

Ann Huang

@AnnInTweetD

I like food, and some other things too. Making ML magic 🪄 at @huggingface.

Seattle Beigetreten Mart 2023
75 Folgt112 Follower
Ann Huang
Ann Huang@AnnInTweetD·
@huggingface In our benchmarks, we found that using CDC to store iterative model and dataset version led to transfer speedups of ~2x - we'd love to learn about more real world examples to see how we perform!
English
1
0
8
274
Ann Huang
Ann Huang@AnnInTweetD·
We're turning @huggingface Hub's files into content-defined chunks to speed up your workflows!⚡️ This means: - 🧠We store your file as deduplicated chunks - ⏩ You only upload changed chunks when iterating! - 🚀 Pulling changes? Only download changed chunks!
English
3
16
53
17.3K
Ann Huang retweetet
Cyril Zakka, MD
Cyril Zakka, MD@cyrilzakka·
Super excited to introduce Halo: A beginner's guide to DIY health tracking with wearables! 🤗✨ Using an $11 smart ring, I'll show you how to build your own private health monitoring app. From basic metrics to advanced features like: - Activity tracking - Heart rate monitoring - Sleep analysis - and more!
Cyril Zakka, MD tweet media
English
28
75
636
88.5K
Ann Huang retweetet
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
If you were to fix one thing on the Hugging Face Hub, what would it be? What's your biggest gripe with the hub - let's fix it; the more, the merrier.
English
42
9
60
16.2K
Ann Huang retweetet
Caleb
Caleb@calebfahlgren·
The @huggingface SQL Console now has Embeds! 🔗 Nice URL to Share / Save your Query Results 🖼️ Embed Results into Web Pages via IFrame In this example, I use handy DuckDB regex functions to find the Code Feedback conversations with the most markdown code blocks
English
2
7
20
3.4K
Ann Huang retweetet
Daniel van Strien
Daniel van Strien@vanstriendaniel·
How do you release an impactful dataset on the @huggingface Hub? We're enhancing how we track dataset downloads on the Hub, so I wanted to share some common themes I've noticed for datasets with high downloads. 🧵
English
1
9
25
9.7K
Ann Huang retweetet
Yacine Jernite
Yacine Jernite@YJernite·
Glad to see the @OpenSourceOrg release their OSAI definition process after an extensive collaborative process, and especially happy to see the role of training data enshrined! Head over to the OSI HF org page if you want to discuss the definition on @huggingface 🤗 1/2🧵
Yacine Jernite tweet media
English
1
7
22
7.4K
Eugene Vinitsky 🦋
Eugene Vinitsky 🦋@EugeneVinitsky·
We have 100GB of data that we need to make publicly accessible. What do folks use? S3 seems wildly expensive per download
English
308
40
2K
448.9K
Ann Huang retweetet
Gradio
Gradio@Gradio·
🆕 𝚜𝚊𝚏𝚎𝚑𝚝𝚝𝚙𝚡: a new open-source library from the Gradio team 🆕 This library is a product of our collaboration with @TrailOfBits and allows you to make asynchronous GET requests while avoiding Server Side Request Forgery. A 🧵 on why this is important!
Gradio tweet media
English
2
8
41
19.7K
Ann Huang retweetet
Cyril Zakka, MD
Cyril Zakka, MD@cyrilzakka·
The source code for HFChat macOS🤗is now fully open source and accepting PRs! Looking forward to see what folks will build. You'll also find some hidden features that never made it to the release: github.com/huggingface/ch…
Cyril Zakka, MD tweet media
English
12
40
214
69K
Ann Huang retweetet
Martin Görner
Martin Görner@martin_gorner·
Did you know that you can load the newest checkpoints (like Llama 3.2) into Keras directly from the original HuggingFace release (safetensors)? I tried - and lived to tell the tale: huggingface.co/blog/keras-lla…
Martin Görner tweet media
English
5
18
93
19.2K
Ann Huang retweetet
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
What a great day for Open Science! @AIatMeta released models, datasets, and code for many of its research artefacts! 🔥 > Meta Segment Anything Model 2.1: An updated checkpoint with improved results on visually similar objects, small objects and occlusion handling. A new developer suite will be added to make it easier for developers to build with SAM 2. Model checkpoints: huggingface.co/collections/re… > Layer Skip: Inference code and fine-tuned checkpoints demonstrating a new method for enhancing LLM performance. Model checkpoints: huggingface.co/collections/fa… > SALSA: New code enables researchers to benchmark AI-based attacks to validate security for post-quantum cryptography. Repo: github.com/facebookresear… > Meta Lingua: A lightweight and self-contained codebase designed to train language models at scale. Repo: github.com/facebookresear… > Meta Open Materials: New open source models and the largest dataset to accelerate AI-driven discovery of new inorganic materials. Model checkpoints: huggingface.co/fairchem/OMAT24 > MEXMA: A new research paper and code for our novel pre-trained cross-lingual sentence encoder covering 80 languages. Model checkpoint: huggingface.co/facebook/MEXMA > Self-Taught Evaluator: a new method for generating synthetic preference data to train reward models without relying on human annotations. Model checkpoint: huggingface.co/facebook/Self-… > Meta Spirit LM: An open-source language model for seamless speech and text integration. Repo: github.com/facebookresear…
Vaibhav (VB) Srivastav tweet media
English
1
36
167
18.7K
Ann Huang retweetet
clem 🤗
clem 🤗@ClementDelangue·
👀👀👀
clem 🤗 tweet media
QME
6
6
83
16.2K
Ann Huang
Ann Huang@AnnInTweetD·
@huggingface Views like this help us understand real-world access patterns so we can architect a more efficient, geo-distributed system for the Hub's storage backend. What else should we be looking at?
English
1
0
3
124
Ann Huang
Ann Huang@AnnInTweetD·
Sweet visualization of S3 PUT requests to @HuggingFace Hub over a 24-hour period, showing upload density across the globe. 🌏
GIF
English
1
10
35
5.7K
Ann Huang
Ann Huang@AnnInTweetD·
Did you know that @Huggingface Hub holds over 29 PB of Git LFS files across datasets, models, and spaces? 📈 That's the equivalent of 64 @CommonCrawl downloads - and it's growing every day. So what's inside? 🧵
Ann Huang tweet media
English
1
1
7
472