Stéphane Clinchant

88 posts

Stéphane Clinchant

Stéphane Clinchant

@sclincha

Katılım Kasım 2014
209 Takip Edilen120 Takipçiler
Stéphane Clinchant retweetledi
Thibault Formal
Thibault Formal@thibault_formal·
New sparse retrieval model: introducing SPLARE, which extends SPLADE by replacing the vocabulary head with pretrained SAEs! paper: arxiv.org/abs/2603.13277 (ICLR'26) also how we won the WSDM'26 Cup on multilingual retrieval: arxiv.org/abs/2602.20986 (model weights coming soon!)
English
3
8
44
4K
Stéphane Clinchant retweetledi
Xin Eric Wang
Xin Eric Wang@xwang_lk·
𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full potential. Introducing 𝐒𝐨𝐟𝐭 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠: a training-free method that mimics human-like “soft” reasoning by generating continuous, abstract concept tokens. These tokens smoothly blend multiple meanings through probability-weighted mixtures of embeddings, enabling richer representations and seamless exploration of diverse reasoning paths. 𝐓𝐡𝐞 𝐢𝐦𝐩𝐚𝐜𝐭? ✅ Improved accuracy on math & code benchmarks by up to 2.48% (pass@1). ✅ Reduced token usage by up to 22.4%, making reasoning models both smarter and more efficient.
Xin Eric Wang tweet media
English
26
134
917
119.4K
Stéphane Clinchant retweetledi
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥 Fully open model weights, data and training pipeline 🤗
Vaibhav (VB) Srivastav tweet media
English
17
56
422
45.7K
Stéphane Clinchant
Stéphane Clinchant@sclincha·
We welcome contributors to add datasets, metrics, and other tasks to BERGEN. Join us! 🤝 #RAG #LLMs
English
0
0
1
99
Stéphane Clinchant
Stéphane Clinchant@sclincha·
What’s a good baseline for RAG? 🤔 The literature shows consistent differences in experimental setups, retrievers, datasets, and metrics. So, we built the BERGEN library github.com/naver/bergen to enhance reproducibility and identify strong baselines : 🧵 @naverlabseurope
English
1
7
21
1.4K
Jerry Liu
Jerry Liu@jerryjliu0·
RankZephyr is a nice 7B model by @rpradeep42 et al. that is optimized for list-wise zero-shot reranking. Since it is much smaller than proprietary models, it is a big step towards practically using LLMs as part of production retrieval systems (as opposed to simply combining LLM with a retrieval system in a typical RAG setup). Excited to present an integration with @llama_index through the RankLLM library, which contains a collection of open-source models specifically for reranking. Check out our full @llama_index RankLLM guide: docs.llamaindex.ai/en/stable/exam… RankZephyr paper: arxiv.org/pdf/2312.02724…
Jerry Liu tweet media
LlamaIndex 🦙@llama_index

Everyone building advanced RAG should carefully consider the reranker they want to use. Hint: Use an LLM (specifically RankZephyr) 💡 We’re excited to feature RankLLM by @rpradeep42 et al. - an awesome collection of open-source LLMs finetuned for reranking 💫, achieving state-of-the-art results and beating rerankers based on GPT-4 in performance. ✅ RankVicuna ✅ RankZephyr Huge shoutout to Ryan Nguyen (xpbowler on Github) for contributing the @llama_index integration! 🔥 Check out our full notebook below: docs.llamaindex.ai/en/latest/exam… RankLLM repo: github.com/castorini/rank… RankZephyr paper: arxiv.org/abs/2312.02724

English
10
18
122
24K
Stéphane Clinchant
Stéphane Clinchant@sclincha·
... especially when reviewers said ‘dense retrieval on its own has shown to surpass sparse retrieval considerably ‘ and that our ‘approach is quite incremental’ 2/2
English
0
0
0
175
Stéphane Clinchant
Stéphane Clinchant@sclincha·
It feels good when someone from a big company shares that they saw ‘ pretty promising results in terms of quality and space savings [for SPLADE] compared to dense embedding models’ ... 1/2
English
1
0
7
387
Stéphane Clinchant retweetledi
Laure Soulier
Laure Soulier@LaureSoulier·
What a great pleasure and honor to share this session about generative AI, ethics, bias, and politics with 3 passionate speakers @plimantour, Andrew Wyckoff, and Juha Heikkilä. Thanks @AI2S2Symposium for the invitation. See you in Geneva on Monday!
AI2S2 Symposium@AI2S2Symposium

⭐️ We're thrilled to unveil the complete lineup of keynote speakers at #AI2S2 in Geneva, Switzerland next week! 🗣️ Get ready for a knowledge-packed experience. Check out the full agenda here: ai2s2.org/2023/event ❗️ Act fast – free registration ends tonight!

English
0
2
14
924
Stéphane Clinchant retweetledi
Yeskendir 🇰🇿
Yeskendir 🇰🇿@yeskendir_k·
Excited to share our #ACL2023 paper "Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model". arxiv:arxiv.org/abs/2212.09811 Joint work w/ Alexandre Berard, Vassilina Nikoulina during my internship @naverlabseurope 1/6
Yeskendir 🇰🇿 tweet media
English
2
3
15
1.4K
Stéphane Clinchant retweetledi
AToMiC@TREC2023
AToMiC@TREC2023@TREC_AToMiC·
📢 REMINDER for TREC-AToMiC participants! News: Test topics are out! 🎉 Check them here: trec-atomic.github.io/annoucements/t…. We've carefully selected 200 sections from vital Wikipedia articles. Get ready for some fascinating exploration! Happy searching! 🚀
English
0
2
1
151
Sasha Rush
Sasha Rush@srush_nlp·
Would love critiques/missed cites. Had to drop a couple of sections on diagonal S4 just because it is hard to explain. However it is critical in the development of these models.
English
5
0
25
7.4K
Sasha Rush
Sasha Rush@srush_nlp·
Do we need Attention? (v0 github.com/srush/do-we-ne…): Slides for a survey talk summarizing recent Linear RNN models with a focus on NLP. Tries to cover a lot of different S4-related models (as well as RWKV/MEGA) in a digestible way.
Sasha Rush tweet mediaSasha Rush tweet mediaSasha Rush tweet mediaSasha Rush tweet media
English
10
172
796
195.6K
Stéphane Clinchant retweetledi
NAVER LABS Europe
NAVER LABS Europe@naverlabseurope·
Before signing off for the weekend sign up for next week's 📢 Open virtual seminar w @alireza_mshi Ph.D.@EPFL and @idiap_ch-@uzh_en ! Reference-Free Metric for Evaluating Question Generation by Answering the Question 📅Tue 16th May 9.30am CEST Register: tinyurl.com/uxzvaddr
NAVER LABS Europe tweet media
English
0
3
8
1.9K