Rohan Jha
385 posts

Rohan Jha
@Robro612
CS PhD Student @jhuclsp Previously: Intern @JinaAI_, MS CS @UTAustin, BS AI @carnegiemellon Interested in Information Retrieval and NLP

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics



another day, another win for late interaction models github.com/matospiso/late…

Baltimore has the right spirit… at least

Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.




Tokens aren’t free On our 135 questions bench, we saved around 32$ As a rule of thumb, this means 243$/1k question It starts to add up pretty quickly, especially given large team usages

I shared this with friends and colleagues and now all the smart ones are using it Someone even built it into its product with postgres pgvector Ironic that a search product is hard to find 😅










