
Saiful Haq
120 posts

Saiful Haq
@RetrieveRerank
Founder (Stealth) Prev. Director of AI and Staff Research Engineer @Hyperbots_Inc IIT Bombay 5th Year CS PhD @cfiltnlp @iitbombay Building in stealth 🚀


On Strengths and Limitations of Single-Vector Embeddings Microsoft shows that dimensionality alone cannot explain poor retrieval performance of single-vector embeddings, identifying domain shift and the "drowning in documents" paradox as key factors. 📝 arxiv.org/abs/2603.29519

For Agentic tasks, Oracle-level performance is the maximum performance a system can achieve, assuming it is able to retrieve all relevant documents perfectly, every time. We're proud to show that Mixedbread Search approaches the Oracle on multiple knowledge intensive benchmarks.


almost everything omar and his team build looks early when it’s released, and then quietly becomes foundational later. seen the same with colbert and dspy, was not a hit immediately after release but over time the industry caught up, understood the framing, and they became widely adopted i feel it will be similar with rlms too. the industry will circle back in few months and adopt the idea of rlms as a standard way to manage computation, context, and recursion in long-running systems.

Can RLMs eliminate RAG or am I hallucinating 🤔 @a1zhang @lateinteraction






🚨 New open dataset for real medical AI. ⚕️ 200K step-by-step clinical reasoning chats (537M+ tokens) 🤖 Generated with OpenAI’s gpt-oss-120B, reasoning effort set to “high” 🏥 Built for training medical reasoning LLMs Available on @huggingface 🧵👇

🚀 New short course with @qdrant_engine: Multi-vector Image Retrieval. Taught by @LukawskiKacper, Senior Developer Advocate at Qdrant, the course shows how multi-vector techniques outperform single-vector methods by matching text tokens to image patches directly. You’ll implement ColBERT to understand multi-vector search, apply ColPali for patch-level image retrieval, reduce memory with quantization and pooling, and use MUVERA to enable fast HNSW search. The course concludes with a full multi-modal RAG pipeline built on ColPali and MUVERA. Learn more and enroll now: hubs.la/Q03XCQZ10

Thrilled to note that we are keeping the tradition of the awesome AI residency program alive in a new avatar: pre-doc researcher program at GDM-Blr -- with some amazing work done by our recent predocs including @gautham_ga_ @pranamyapk @puranjay1412 @sahilgo6801 @swaroopnath6 If you want to join this program, please apply here: google.com/about/careers/…




