XetHub

62 posts

XetHub banner
XetHub

XetHub

@xetdata

XetHub enables ML teams to collaborate effectively on massive datasets. Now part of @HuggingFace 🤗!

加入时间 Mart 2022
13 关注422 粉丝
XetHub 已转推
Ann Huang
Ann Huang@AnnInTweetD·
We're turning @huggingface Hub's files into content-defined chunks to speed up your workflows!⚡️ This means: - 🧠We store your file as deduplicated chunks - ⏩ You only upload changed chunks when iterating! - 🚀 Pulling changes? Only download changed chunks!
English
3
16
53
17.3K
XetHub 已转推
Ann Huang
Ann Huang@AnnInTweetD·
Sweet visualization of S3 PUT requests to @HuggingFace Hub over a 24-hour period, showing upload density across the globe. 🌏
GIF
English
1
10
35
5.7K
XetHub 已转推
Ann Huang
Ann Huang@AnnInTweetD·
Did you know that @Huggingface Hub holds over 29 PB of Git LFS files across datasets, models, and spaces? 📈 That's the equivalent of 64 @CommonCrawl downloads - and it's growing every day. So what's inside? 🧵
Ann Huang tweet media
English
1
1
7
472
XetHub 已转推
Gradio
Gradio@Gradio·
Welcome, Gradio 5 👋 We’ve been hard at work over the past few months, and we are excited to announce today the stable release of Gradio 5! With more than 2 million users every month (and >470,000 apps on Hugging Face Spaces), Gradio has become the default way to build, share, and use machine learning applications. Our goal with Gradio 5 was to address the most common pain points that we’ve heard from Gradio developers about taking these apps to production. Here are 5 new things in Gradio 5 (including a new way to build Gradio apps without writing any code!)
Gradio tweet media
English
23
106
558
211.6K
XetHub 已转推
Ann Huang
Ann Huang@AnnInTweetD·
Deduplicating evolving datasets is a no-brainer - store differences instead of full versions of each one. But format matters! Here's how appends, modifications, and deletes on @ApacheParquet files (~20% of what's stored on @huggingface Hub) deduplicate. 🧵
Ann Huang tweet media
English
2
12
31
12.9K
XetHub 已转推
Ann Huang
Ann Huang@AnnInTweetD·
First demo of a @xetdata-backed roundtrip to/from @huggingface servers = first steps to a faster, more scalable HF Hub! ⚡️🤗
English
1
3
17
9.3K
XetHub
XetHub@xetdata·
@huggingface 🤔 What features do you want us to bring to Hugging Face Hub? We'd love to hear your ideas!
English
0
0
0
81
XetHub
XetHub@xetdata·
Almost three weeks into our @huggingface experience and loving it! Digging into a POC of replacing Git LFS and exploring developer experiences = our happy place 🤗
XetHub tweet media
English
1
0
3
224
XetHub
XetHub@xetdata·
✨ Now you can, with XetHub custom views. Custom views lets you create interactive views that live alongside the files in your repository, bringing instant context to opaque binary files whether you're browsing or comparing files.
English
1
0
1
788
XetHub
XetHub@xetdata·
🧞‍♂️ Ever wished you could share an interactive view of your work with a collaborator, instead of static screenshots and long descriptions? 🤓 Wouldn't it be nice to quickly review differences on binary files in the browser like you do with code?
English
1
0
1
1K
XetHub
XetHub@xetdata·
So excited to be in Toronto for a wonderful @TMLS_TO weekend! Come join us!
XetHub tweet media
English
1
0
2
397
XetHub
XetHub@xetdata·
- 50% savings in average upload times vs nearest competitor - Better average download times vs competitors - 50% savings in final storage used vs nearest competitor
English
1
0
0
205
XetHub
XetHub@xetdata·
🤔 How does your versioning tool stack up to the competition? We benchmarked the iterative development experience of three real-world modern workflows (🎮 game development, 🧬 biotech, and 🧑‍🔬 research) on S3, DVC, Git LFS, and XetHub. The results?
English
1
0
0
254