Aarush

89 posts

Aarush banner
Aarush

Aarush

@Aarush1003

Searching for things to retrieve @LightOnIo Masters @DIKU_Institut

Katılım Temmuz 2024
397 Takip Edilen31 Takipçiler
Aarush
Aarush@Aarush1003·
Going to be joining @LightOnIO for the summer!! Will be working on some really cool search tools 🕵️‍♂️🌐 And thanks @AmelieTabatta & @raphaelsrty for the opportunity :) Search Models that will be trained by me⬇️⬇️
GIF
English
6
1
19
1.1K
Aarush retweetledi
Omar Khattab
Omar Khattab@lateinteraction·
in case you missed it, OBLIQ-Bench is now on arXiv: arxiv.org/pdf/2605.06235 my hope is that this reduces the frequency of IR or search agents papers that I discard immediately as a reader because in 2026 they’re still evaluating on long-expired MS MARCO, NQ, HotPotQA, BEIR, etc
Diane@dianetc_

We set out to build a better retriever, so we looked for the hardest IR benchmarks. For each, we asked how much headroom remained by running oracle reranking with a frontier LLM. Most had little room left! So we built OBLIQ-Bench to study much harder search queries than before.

English
4
22
108
12.1K
Antoine Chaffin
Antoine Chaffin@antoine_chaffin·
Scaling laws of multi-vector models seem rather different from dense ones
English
4
3
34
3.9K
Aarush
Aarush@Aarush1003·
I am also looking for visiting research positions for the summer, if anyone has any leads please reach out :)
English
0
0
0
39
Aarush retweetledi
Guanya Shi
Guanya Shi@GuanyaShi·
I’m so tired of writing rebuttals to this kind of “lack of novelty” review: “This paper trivially combines A, B, and C, so the algorithmic novelty is limited.” Technically, most (if not all) robotics papers are convex combinations of existing ideas. I still deeply appreciate A+B+C papers—especially when they deliver: - New capabilities: the “trivial combination” unlocks behaviors we simply couldn’t achieve before - Sensible & organic design: A+B+C is clearly the right composition—not some arbitrary A′+B+C′ - Nontrivial interactions: careful analysis of the dynamics, coupling, or failure modes between A, B, C - Rehabilitating old ideas: A was dismissed for years, but paired with modern B/C, it suddenly works—and teaches us why - System-level & "interface" insight: the contribution is not any single piece, but how the pieces talk to each other - Scaling laws or regimes: identifying when/why A+B+C works (and when it doesn’t) - Engineering clarity: making something actually work robustly in the real world is not “trivial” - New problem formulations: sometimes the real novelty is in the reformulation—only under this view does A+B+C make sense. Maybe worth keeping these in mind when reviewing the next A+B+C paper : )
English
29
122
980
113.9K
neural nets.
neural nets.@cneuralnetwork·
happy to announce that our paper from AI4Bharat has been accepted to the icbinb workshop at ICLR 2026 🎊 work done with @prajdabre @AdishPandya
neural nets. tweet media
English
22
3
440
13.5K
Aarush
Aarush@Aarush1003·
Combining LLM generated hard-negatives with that of BM25 or cross-encoders improves performance but still can not outperform the baselines. More importantly we find that Phi4 generates data that results in a better retriever than Qwen3-30B a model twice the size of Phi4.
English
1
0
0
43
Aarush
Aarush@Aarush1003·
New Pre-Print !! LLMs are not good dataset generators for retrieval tasks...
Aarush tweet media
English
1
0
0
489