Sylvain Lesage

18 posts

Sylvain Lesage

@severo_dev

Dataviz freelance developer. Part-time 🤗 @huggingface (dataset viewer) GitHub : https://t.co/IBRvyeYaGI

شامل ہوئے Kasım 2013

43 فالونگ61 فالوورز

پن کیا گیا ٹویٹ

Sylvain Lesage@severo_dev·21 Nis

As 📷 xet-team infrastructure begins backing hundreds of repositories on the Hugging Face Hub, we’re getting to put on our researcher hats and peer into the bytes. 👀 🤓 IMO, one of the most interesting ideas Xet storage introduces is a globally shared store of data.

English

1.4K

Sylvain Lesage@severo_dev·22 Nis

Give me some time 🤗🤗

English

505

Sylvain Lesage@severo_dev·22 Nis

Received a lot of information, decided to continue building practical AI programs Dev 80% supply has been locked app.streamflow.finance/contract/solan…

Sylvain Lesage@severo_dev

Yo, I just dropped an experimental memecoin on Pump. Should I keep building an AI app for it?🤗🤗 pump.fun/coin/2HjpyavGa…

English

1.4K

Sylvain Lesage@severo_dev·22 Nis

Yo, I just dropped an experimental memecoin on Pump. Should I keep building an AI app for it?🤗🤗 pump.fun/coin/2HjpyavGa…

English

5.1K

Sylvain Lesage@severo_dev·22 Nis

I 🙏 at the altar of stamina. "(Stamina is) the ability to chip away at goals despite a lack of visible progress. To hold focus and presence in a world incentivized to distract you. To stay patient. To be on time. To push through difficult material. To follow instructions or proceed without them."

English

935

Sylvain Lesage@severo_dev·22 Nis

See that purple banner on the Llama 4 models? It's Xet storage, and this is actually huge for anyone building with AI models. Real numbers: ~25% deduplication on Llama 4 models, hitting ~40% for finetunes

English

535

Sylvain Lesage@severo_dev·22 Nis

The Lovelace 2.0 Test of Artificial Creativity and Intelligence “tell a story in which a boy falls in love with a girl, aliens abduct the boy, and the girl saves the world with the help of a talking cat.” Change that 🐱 to a 🐶 and I'd read that story any day. arxiv.org/abs/1410.6142

English

300

Sylvain Lesage@severo_dev·22 Nis

Online demo of hyparquet: a parser for apache parquet files. Uses hightable for high performance windowed table viewing.huggingface.co/spaces/severo/…

English

232

Sylvain Lesage ری ٹویٹ کیا

Ihor Stepanov@ihor_step·17 Nis

I just finished my small experiments comparing different encoder models on retrieval tasks. The goal was to check whether MLM is better than RTD for these tasks. I compared Electra's small models, both generator and discriminator, that have the same size. Additionally, it was tested DeBERTa v1, which was pre-trained with MLM and DeBERTa v3, which was pre-trained with RTD on a two times larger dataset. As a baseline ModernBERT was evaluated as well. Models were fine-tuned on 500k examples from the MS-MARCO dataset (huggingface.co/datasets/sente…). For benchmarking, the NanoBEIR evaluator was used. You can see the average ndcg@10 plotted below. It's clearer that MLM-trained models produce better discriminative features, however, more detailed experiments are needed for more accurate conclusions. @antoine_chaffin @tomaarsen

English

14.7K

Sylvain Lesage@severo_dev·22 Nis

@ClementDelangue Absolutely, the transformer architecture has become a cornerstone for model definition

English

103

Sylvain Lesage ری ٹویٹ کیا

clem 🤗@ClementDelangue·22 Nis

So cool to see transformers becoming the source of truth for model definition & collaborating with wonderful partners like vLLM to have these models run everywhere the fastest! As a model builder, it means that you integrate with Hugging Face & instantly get hundreds of integrations out of the box. Time to accelerate AI, one integration at a time!

English

154

20.6K

Sylvain Lesage ری ٹویٹ کیا

jade@jadechoghari·21 Nis

Robotics simulation is how we teach robots to act smart—before they ever touch the real world. It’s physical AI: simulating Newtonian physics so robots can grip, move, and learn. A new simulator is coming to @huggingface @LeRobotHF 🤗. Stay tuned 👀😉

English

180

35.3K

Sylvain Lesage@severo_dev·21 Nis

Come find the many BERT islands. Or see how datasets relate in practice, not just in theory. See how libraries or tasks can tie repositories together. You can play around with node size using storage/likes/downloads too. The result is a super fun visualization from @saba9 and @znation that I’ve already lost way too much time to. I'm excited to see how the networks grow as we add more repositories! xet-team/repo-graph

English

138

Sylvain Lesage@severo_dev·21 Nis

Because of this, different repositories can share bytes we store. That opens up something cool - we can draw a graph of which repos actually share data at the chunk level, where: - Nodes = repositories - Edges = shared chunks - Edge thickness = how much they overlap

English

175

Sylvain Lesage@severo_dev·21 Nis

English

1.4K

Sylvain Lesage ری ٹویٹ کیا

AshutoshShrivastava@ai_for_success·21 Nis

🚨 SkyReels just launched! The world’s first open-source video generation platform supporting unlimited duration 🔥 All-in-one creator toolkit: - Consistent high-quality video (LoRA ready) - Fast gen, amazing output. - Amazing facial expressions . Plus text-to-film agent handles everything: script, character, storyboards, full AV gen, auto-edit . it's Wild! Step by step tutorial 👇

English

119

695

81K

Sylvain Lesage@severo_dev·21 Nis

Need to convert CSV to Parquet? Use chatdb.ai/tools/csv-to-p…. It does the job instantly. @cfahlgren1 provides many other tools on his website. Approved and bookmarked!

English

116

دریافت کریں

@antoine_chaffin @tomaarsen @ClementDelangue @huggingface @LeRobotHF @saba9 @znation @elonmusk