
Rohan Jha
474 posts

Rohan Jha
@Robro612
CS PhD Student @jhuclsp Previously: Intern @JinaAI_, MS CS @UTAustin, BS AI @carnegiemellon Interested in Information Retrieval and NLP










New 📄: we replicate XTR, a multi-vector retrieval method that makes ColBERT faster by avoiding its expensive step of gathering full document embeddings XTR is not a free lunch over ColBERT, but its training objective is useful for modern efficient engines like PLAID and WARP 👇🏼

Today, with the merging into PyLate, you can train SOTA XTR models just by changing the score definition in your existing boilerplates Yes, this is as easy as this Engineering done right



XTR allows to perform multi-vector retrieval faster But there is not much models and tooling around it, hindering its adoption @Robro612 did a very interesting replication study and we took the opportunity to merge XTR into PyLate, alongside the awesome XTR-WARP of @hugemensa

New 📄: we replicate XTR, a multi-vector retrieval method that makes ColBERT faster by avoiding its expensive step of gathering full document embeddings XTR is not a free lunch over ColBERT, but its training objective is useful for modern efficient engines like PLAID and WARP 👇🏼






