پن کیا گیا ٹویٹ
Sylvain Lesage
18 posts

Sylvain Lesage
@severo_dev
Dataviz freelance developer. Part-time 🤗 @huggingface (dataset viewer) GitHub : https://t.co/IBRvyeYaGI
شامل ہوئے Kasım 2013
43 فالونگ61 فالوورز

Received a lot of information, decided to continue building practical AI programs
Dev 80% supply has been locked
app.streamflow.finance/contract/solan…
Sylvain Lesage@severo_dev
Yo, I just dropped an experimental memecoin on Pump. Should I keep building an AI app for it?🤗🤗 pump.fun/coin/2HjpyavGa…
English

Yo, I just dropped an experimental memecoin on Pump. Should I keep building an AI app for it?🤗🤗
pump.fun/coin/2HjpyavGa…
English

I 🙏 at the altar of stamina. "(Stamina is) the ability to chip away at goals despite a lack of visible progress. To hold focus and presence in a world incentivized to distract you. To stay patient. To be on time. To push through difficult material. To follow instructions or proceed without them."
English

The Lovelace 2.0 Test of Artificial Creativity and Intelligence “tell a story in which a boy falls in love with a girl, aliens abduct the boy, and the girl saves the world with the help of a talking cat.” Change that 🐱 to a 🐶 and I'd read that story any day.
arxiv.org/abs/1410.6142
English

Online demo of hyparquet: a parser for apache parquet files.
Uses hightable for high performance windowed table viewing.huggingface.co/spaces/severo/…
English
Sylvain Lesage ری ٹویٹ کیا

I just finished my small experiments comparing different encoder models on retrieval tasks.
The goal was to check whether MLM is better than RTD for these tasks.
I compared Electra's small models, both generator and discriminator, that have the same size. Additionally, it was tested DeBERTa v1, which was pre-trained with MLM and DeBERTa v3, which was pre-trained with RTD on a two times larger dataset. As a baseline ModernBERT was evaluated as well.
Models were fine-tuned on 500k examples from the MS-MARCO dataset (huggingface.co/datasets/sente…).
For benchmarking, the NanoBEIR evaluator was used. You can see the average ndcg@10 plotted below.
It's clearer that MLM-trained models produce better discriminative features, however, more detailed experiments are needed for more accurate conclusions.
@antoine_chaffin @tomaarsen

English

@ClementDelangue Absolutely, the transformer architecture has become a cornerstone for model definition
English
Sylvain Lesage ری ٹویٹ کیا

So cool to see transformers becoming the source of truth for model definition & collaborating with wonderful partners like vLLM to have these models run everywhere the fastest!
As a model builder, it means that you integrate with Hugging Face & instantly get hundreds of integrations out of the box.
Time to accelerate AI, one integration at a time!

English
Sylvain Lesage ری ٹویٹ کیا

Robotics simulation is how we teach robots to act smart—before they ever touch the real world. It’s physical AI: simulating Newtonian physics so robots can grip, move, and learn.
A new simulator is coming to @huggingface @LeRobotHF 🤗. Stay tuned 👀😉
English

Come find the many BERT islands. Or see how datasets relate in practice, not just in theory. See how libraries or tasks can tie repositories together. You can play around with node size using storage/likes/downloads too.
The result is a super fun visualization from @saba9 and @znation that I’ve already lost way too much time to. I'm excited to see how the networks grow as we add more repositories!
xet-team/repo-graph
English
Sylvain Lesage ری ٹویٹ کیا

🚨 SkyReels just launched! The world’s first open-source video generation platform supporting unlimited duration 🔥
All-in-one creator toolkit:
- Consistent high-quality video (LoRA ready)
- Fast gen, amazing output.
- Amazing facial expressions .
Plus text-to-film agent handles everything: script, character, storyboards, full AV gen, auto-edit . it's Wild!
Step by step tutorial 👇
English

Need to convert CSV to Parquet?
Use chatdb.ai/tools/csv-to-p…. It does the job instantly.
@cfahlgren1 provides many other tools on his website. Approved and bookmarked!

English



