Rohan Jha

0

2

25

Joe Barrow@barrowjoseph·1d

@antoine_chaffin @LightOnIO It feels like there’s a fork in the road between “simple tool, many calls” (DCI) and “expressive tool, few calls” (Agent-ModernColBERT). I can’t help but feel like the latter wins in most cases, especially given the ~quadratic increase in price from many calls.

English

0

1

68

Joe Barrow@barrowjoseph·1d

Thinking about an old idea in IR (information need) and how it connects to some recent work from @LightOnIO and @antoine_chaffin (namely Reason-ModernColBERT and Agent-ModernColBERT). Tried to summarize those thoughts here.

Joe Barrow@barrowjoseph

x.com/i/article/2055…

English

3

14

1.2K

Rohan Jha retweetledi

Antoine Chaffin@antoine_chaffin·3d

Reason-ModernColBERT nearly solved BrowseComp-Plus, smashing SOTA and outperforming models models 54× bigger Not bad for a 1 year old model not optimized for deep research What if we actually tried? Introducing Agent-ModernColBERT: adding another 10% on top with a 5 min training

English

11

44

224

38.9K

Rohan Jha@Robro612·4d

@AmelieTabatta > if any stage of your training mix touched MS MARCO, the leaderboard delta is noise. Wasn't the point of BEIR to measure MS MARCO-trained models' ZS performance on OOD (non-MARCO) data? cc @beirmug

English

0

2

113

Amélie Chatelain@AmelieTabatta·4d

Half of 'our embedder SOTA on BEIR' claims are measuring contamination, not retrieval quality. If any stage of your training mix touched MS MARCO, the leaderboard delta is noise. Hold-out benchmarks aren't optional anymore, run your own evals!

English

3

2

26

2.5K

Rohan Jha@Robro612·4d

@mixedbreadai Loved the examples! Very instructive of what well-trained instructed rerankers can unlock.

English

4

547

Mixedbread@mixedbreadai·4d

Introducing mxbai-rerank-v3-listwise: reranking that goes beyond binary relevance. It reads the whole candidate set, resolves conflicts, and ranks by directives like recency, source priority, and multi-step rules. +11% NDCG@10 on average across multiple domains, modalities, and languages in runs with Wholembed v3. Available today in preview in Mixedbread.

English

5

18

137

24.2K

Rohan Jha@Robro612·8 May

@n0riskn0r3ward I think of @macavaney & @rathee_mandeep 's adaptive retrieval line of work as falling under PRF, and are really exciting approaches I hope we'll see adopted more. Papers: GAR, QUAM, and Breaking the Lens of the Telescope

English

2

107

search founder@n0riskn0r3ward·8 May

Getting re-interested in pseudo relevance feedback papers - if you have any favorites pls share them!

English

0

2

237

Rohan Jha@Robro612·5 May

@bo_wangbo @antoine_chaffin @hugemensa Challenge accepted 😁

English

1

36

Bo@bo_wangbo·5 May

@Robro612 @antoine_chaffin @hugemensa we'll see how feasible to scale to 100B documents 😇

English

Antoine Chaffin@antoine_chaffin

0

2

23

Rohan Jha retweetledi

Antoine Chaffin@antoine_chaffin·5 May

XTR allows to perform multi-vector retrieval faster But there is not much models and tooling around it, hindering its adoption @Robro612 did a very interesting replication study and we took the opportunity to merge XTR into PyLate, alongside the awesome XTR-WARP of @hugemensa

Rohan Jha@Robro612

New 📄: we replicate XTR, a multi-vector retrieval method that makes ColBERT faster by avoiding its expensive step of gathering full document embeddings XTR is not a free lunch over ColBERT, but its training objective is useful for modern efficient engines like PLAID and WARP 👇🏼

English

7

10

47

5.4K

Rohan Jha@Robro612·5 May

A lot of @antoine_chaffin hours went into getting this interface so smooth. XTR training is just a swap away!

Today, with the merging into PyLate, you can train SOTA XTR models just by changing the score definition in your existing boilerplates Yes, this is as easy as this Engineering done right

English

2

12

978

Rohan Jha@Robro612·5 May

@bo_wangbo @antoine_chaffin @hugemensa Thanks Bo! pplXTR soon?

English

0

1

42

Bo@bo_wangbo·5 May

@antoine_chaffin @Robro612 @hugemensa great work @Robro612 !!

English

0

3

109

Rohan Jha@Robro612·5 May

@antoine_chaffin @hugemensa @raphaelsrty I've already hacked in the build-from-disk capabilities for my own experiments - it's a lifesaver for loading large collections with lower memory pressure

English

0

1

49

Antoine Chaffin@antoine_chaffin·5 May

@hugemensa @Robro612 @raphaelsrty > albeit somewhat nerfed If you refer to the parameters, power users can set their own trade offs If you refer to all of the other amazing features, I am very much looking forward to adding them all!

English

Antoine Chaffin@antoine_chaffin

0

2

169

Pau@hugemensa·5 May

Thanks to the amazing work from @Robro612 , @raphaelsrty and @antoine_chaffin v2 of xtr-warp-rs is available in PyLate! (albeit somewhat nerfed) XTR models can now be trained from PyLate as well, a great release that gets mainstream adoption of multi vector closer!

XTR allows to perform multi-vector retrieval faster But there is not much models and tooling around it, hindering its adoption @Robro612 did a very interesting replication study and we took the opportunity to merge XTR into PyLate, alongside the awesome XTR-WARP of @hugemensa

English

6

728

Rohan Jha retweetledi

Raphaël Sourty@raphaelsrty·5 May

XTR training and WARP indexes are available in PyLate 1.5.0 Credit to @Robro612 for the XTR integration and @hugemensa for the WARP index ☺️ The WARP index can run on GPU and will shine when models are trained with XTR procedure

Rohan Jha@Robro612

New 📄: we replicate XTR, a multi-vector retrieval method that makes ColBERT faster by avoiding its expensive step of gathering full document embeddings XTR is not a free lunch over ColBERT, but its training objective is useful for modern efficient engines like PLAID and WARP 👇🏼

English

6

28

3.4K

Rohan Jha@Robro612·5 May

Paper: arxiv.org/abs/2605.00646 Code: github.com/lightonai/pyla… Models: huggingface.co/collections/ro… Collaboration with @magmar_11 and @ben_vandurme

English

9

149

Rohan Jha@Robro612·5 May

Fortunately, thanks to the diligent work of open-source GOATs @antoine_chaffin @raphaelsrty @hugemensa, XTR and WARP are now integrated into PyLate. Trying it yourself should be as simple as swapping the score function during training and the index at inference.

English

0

7

169

Rohan Jha@Robro612·5 May

New 📄: we replicate XTR, a multi-vector retrieval method that makes ColBERT faster by avoiding its expensive step of gathering full document embeddings XTR is not a free lunch over ColBERT, but its training objective is useful for modern efficient engines like PLAID and WARP 👇🏼

English