Wauplin

323 posts

Wauplin banner
Wauplin

Wauplin

@Wauplin

Doing things at Hugging Face. Maintainer of 🤗/huggingface_hub.

Katılım Ağustos 2015
0 Takip Edilen1.2K Takipçiler
Wauplin
Wauplin@Wauplin·
Storage Buckets gave the Hub a mutable storage layer. HF-MOUNT makes that storage feel local. Mount models, datasets, and Buckets as a filesystem and work with them through normal paths. Buckets are read-write; models and datasets are read-only.
Wauplin tweet media
English
1
0
1
91
Wauplin
Wauplin@Wauplin·
@sudoingX My previous comment is still valid, but you were right there was a bug in hf-xet that could lead to deadlocks while streaming large files from the Hub. It is now resolved and shipped: x.com/Wauplin/status…
Wauplin@Wauplin

If you transfer large files from the @huggingface Hub, upgrade now huggingface_hub v1.7.0 / hf-xet 1.4.2 fixes: - deadlocks/stalls on large file downloads - faster uploads by cutting redundant SHA-256 computation - extended timeout for shard uploads (was silently failing)

English
1
0
2
71
Sudo su
Sudo su@sudoingX·
hey if you're downloading large GGUFs from huggingface and the download keeps crashing with "File Reconstruction Error: Background writer channel closed" it's not your connection. it's HuggingFace's new Xet storage backend. it can't reliably reconstruct files over ~15GB during download. known bug, still open. the disable flag doesn't work either. HF_HUB_DISABLE_XET=1 is supposed to force legacy downloads but it doesn't. confirmed broken (github issue #3266). the actual fix: pip uninstall hf-xet removes the Xet client entirely. huggingface cli falls back to legacy HTTP. downloads complete. no crashes. found this trying to download a 48GB model on a production. burned 45 minutes before tracing it. save yourself the time.
Sudo su tweet media
English
13
5
156
7.9K
Wauplin
Wauplin@Wauplin·
If you transfer large files from the @huggingface Hub, upgrade now huggingface_hub v1.7.0 / hf-xet 1.4.2 fixes: - deadlocks/stalls on large file downloads - faster uploads by cutting redundant SHA-256 computation - extended timeout for shard uploads (was silently failing)
English
1
0
1
180
mrfakename
mrfakename@realmrfakename·
About to try to upload 100TB to try out HF Buckets... 👀
English
6
1
34
9.8K
Wauplin retweetledi
Julien Chaumond
Julien Chaumond@julien_c·
get PRO on @huggingface and instantly 10x your storage to 1 TB private + 10 TB public ...for $9 a month 😮 a deal this good should be illegal
Julien Chaumond tweet media
English
12
12
159
29.4K
Wauplin retweetledi
célina
célina@hanouticelina·
It's 2026. Build. For. Agents. We just launched Hugging Face Storage Buckets, a faster storage layer for AI workflows, 3x cheaper than S3 with no Git overhead. Buckets are basically built for workflows that generate artifacts: checkpoints, logs, traces, and other intermediate outputs produced by training jobs and pipelines. They come with a CLI designed with coding agents in mind. Agents can easily explore commands, examples, and documentation via --𝚑𝚎𝚕𝚙 (there’s a Skill for that as well), then sync directories, inspect bucket contents, move files around, and script storage operations. The CLI interface makes Buckets easy for agents to use: they can chain commands, automate storage tasks, and integrate Buckets directly into scripts, pipelines, and agent workflows.
célina tweet media
English
2
4
21
1.9K
Wauplin
Wauplin@Wauplin·
@sudoingX I'd encourage you to try this out. We've made a lot of patches in the past weeks to ensure downloads are fast and reliable, and even more for large downloads. If you encounter any new issue let me know!
English
0
0
0
30
Wauplin
Wauplin@Wauplin·
@sudoingX Hey @sudoingX just as a heads-up: - HF_HUB_DISABLE_XET=1 does work. You just have to set it before launching the script (as explained in the issue you've linked) - huggingface-cli is pretty outdated. Much better to update to latest version and use `hf` CLI #install-the-hugging-face-cli" target="_blank" rel="nofollow noopener">huggingface.co/docs/huggingfa…
English
1
0
6
184
Chain Alpha
Chain Alpha@Chain_AlphaX·
@Wauplin Hmm, so like… storage for AI stuff? Interesting.
English
1
0
0
53
Wauplin
Wauplin@Wauplin·
1/6 Today we’re introducing Storage Buckets on the Hugging Face Hub. They’re built for mutable, non-versioned ML artifacts: checkpoints, optimizer states, processed shards, logs, traces, eval outputs, and agent-generated files.
Wauplin tweet media
English
3
3
40
6.8K
Wauplin
Wauplin@Wauplin·
6/6 Mental model: Buckets = mutable working data Repos = finalized, versioned deliverables That gives ML teams a cleaner separation between storage and publication, while staying in one Hub-native workflow. huggingface.co/blog/storage-b…
English
1
0
2
97
Wauplin
Wauplin@Wauplin·
5/6 Typical use cases: - continuously writing checkpoints during training - storing intermediate outputs in data pipelines - keeping agent traces, memory, and shared artifacts remote - separating working data from published assets
English
1
0
1
109
Wauplin retweetledi
Caleb
Caleb@calebfahlgren·
NEW in huggingface_hub v1.6.0 2 new datasets commands in the hf CLI: 📁 `parquet` - list & fetch parquet files for any dataset 🔍 `sql` - run SQL queries on datasets with DuckDB Let your agent query 900k+ datasets from the CLI
Caleb tweet media
English
1
3
9
703
Wauplin
Wauplin@Wauplin·
More from huggingface_hub v1.5.0 (beyond Buckets 🪣): Spaces now supports hot-reload (live edit of Gradio files). And the CLI got an extension system à la `gh extension`, plus native support for AI agents (@opencode, @cursor_ai ). Full notes: github.com/huggingface/hu…
Wauplin@Wauplin

huggingface_hub v1.5.0 just dropped! The highlight: Buckets. Think S3, but native to the Hub. No git history. Just fast, chunk-deduplicated object storage. hf buckets sync ./outputs hf://buckets/me/my-checkpoints And that's it. Currently in beta preview. DM me if interested!

English
0
1
2
449