Omar Khattab

Brendan (can/do)@BrendanFoody

0

6

484

Josh McGrath@j_mcgraph·8h

yes yes everyone should start saying this more, and probably tagging my boss

it's under appreciated how excellent OpenAI models are at web search

English

9

3

246

16.3K

Omar Khattab@lateinteraction·5h

tied with x.com/mixedbreadai/s…

Mixedbread@mixedbreadai

Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.

English

3

741

Omar Khattab@lateinteraction·8h

biggest AI news in 2 weeks ICYMI

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

6

51

8.9K

Omar Khattab@lateinteraction·5h

@nileshgupta2797 @LightOnIO @mixedbreadai Yup I don’t believe late interaction should fundamentally revolve around MaxSim. It’s just a lot better than single vector. It is possible to design richer pruning-friendly set-level / multi-vector scoring functions I think!

English

1

22

Nilesh Gupta@nileshgupta2797·5h

@lateinteraction @LightOnIO @mixedbreadai I agree logical compositions is def where single vectors fail hard - eucledian space by design doesn't allow modeling logical relevance. Though I've been wondering what are the limits of maxsim, is it limited to AND like queries or goes beyond (e.g. (A or B) and (not C))?

English

0

1

23

Omar Khattab@lateinteraction·14h

late interaction model (150M) beats the 54x larger Qwen3-8B-Embedding by... hmm, looks like up to 34% relative increase :D also really funny that the entire top section of the BC+ leaderboard, sorted by Recall, is just late interaction models by @LightOnIO and @mixedbreadai

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

9

14

171

11.7K

Omar Khattab@lateinteraction·5h

@nileshgupta2797 @LightOnIO @mixedbreadai The difference is huge on every distribution not yet overfit by the single vector models. This was true back in the day on all of BEIR, all of LoTTE, and all the OpenQA datasets, etc. Single vector is uncompositional. It advanced only by moving distributions in domain.

English

0

1

54

Nilesh Gupta@nileshgupta2797·5h

@lateinteraction @LightOnIO @mixedbreadai Do you think such a big diff is primarily because of the logical AND like multi-constraint nature of queries in BrowseComp for e.g. find X that satisfies A and B and C and D ..

English

0

1

68

Omar Khattab รีทวีตแล้ว

Joel Dierkes@joeldierkes·7h

Mixedbread just made 115h of videos accessible to my agent. With the new @mixedbreadai v3 release, you can upload any video to your Mixedbread store and make its content accessible to your agent.

English

6

4

21

1.9K

Omar Khattab รีทวีตแล้ว

Pamela Fox@pamelafox·8h

@SQLGene @dbreunig indeed, SLMs came up so much last night- lots of reports of great performance after a bit of DSpy to correct malformed JSON and tool calling.

English

1

2

565

Omar Khattab รีทวีตแล้ว

Josh Clemm@joshclemm·8h

@pamelafox @dbreunig The link to the slides from the Dropbox Dash team is at the bottom of our eng blog post dropbox.tech/machine-learni…

English

1

4

644

Omar Khattab รีทวีตแล้ว

Zijian Chen@zijian42chen·10h

Late to BrowseComp-Plus...but good interactions 🙂

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

3

10

1.6K

Omar Khattab รีทวีตแล้ว

paul@pteiletche·11h

this guy just never stops winning!

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

4

12

1.8K

Omar Khattab รีทวีตแล้ว

Aamir@aaxsh18·9h

Mixedbread stores can handle videos now with 48h+ runtime

English

3

23

1.3K

Omar Khattab รีทวีตแล้ว

Raymond Weitekamp@raw_works·10h

currently re-embedding my entire machine, thank you very much! LateOn-Code-edge for code search and Reason-ModernColBERT for prose/docs search.

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

5

31

3.6K

Omar Khattab@lateinteraction·9h

what do people mean by ex-founder? how did you edit history to un-founder yourself

English

4

0

45

6.3K

Omar Khattab@lateinteraction·9h

to do really well on OOLONG

English

Omar Khattab@lateinteraction

1

5

752

Omar Khattab@lateinteraction·9h

someone should try having RLMs, optimized with GEPA, write REPL code primarily using DSPy to do autoresearch for ColBERTv3

alex zhang@a1zhang

someone should try having RLMs write REPL code primarily using DSPy

English

8

133

7.4K

Omar Khattab รีทวีตแล้ว

alex zhang@a1zhang·11h

someone should try having RLMs write REPL code primarily using DSPy

English

6

7

117

14.4K

Omar Khattab รีทวีตแล้ว

Igor Carron@IgorCarron·12h

Omar is shy about this late interaction approach changing everything. As an outsider and seeing @LightOnIO 's crew delivering exceptional results day after day, I prefer to frame this type of result as being one of those rare AlexNet moments. There is simply no turning back.

imo, these kinds of regular amazing results don't quite mean that late interaction is extremely strong per se as much as they mean that dense single-vector retrievers are a permanent bottleneck on your quality and generalization they're so bad!

English

5

9

1.8K

Omar Khattab@lateinteraction·12h

@HansiZeng @raphaelsrty @LightOnIO @mixedbreadai I definitely agree it’s not trivial to implement it right. But why does one need to implement it? It’s already implemented and optimized. We don’t usually implement our own attention kernels or HNSW index or anything like that. Rerankers are also far more expensive. Need GPU.

English

1

118

Hansi Zeng@HansiZeng·12h

@lateinteraction @raphaelsrty @LightOnIO @mixedbreadai But what if dense retriever plus reranker? I like ColBert(v2), but it is not trivial to implement them right… dense retrieval + reranker are much easier to implement.

English

0

147

Omar Khattab รีทวีตแล้ว

Anthony Ronning@anthonyronning·1d

okay i like half understand RLMs now and it's sick

English