Mixedbread

112 posts

Mixedbread banner
Mixedbread

Mixedbread

@mixedbreadai

Your fav. AI bakers! We're hiring!

San Francisco, CA Katılım Mart 2024
11 Takip Edilen3.3K Takipçiler
Sabitlenmiş Tweet
Mixedbread
Mixedbread@mixedbreadai·
Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.
Mixedbread tweet media
English
35
121
951
198.2K
Mixedbread
Mixedbread@mixedbreadai·
Introducing mxbai-rerank-v3-listwise: reranking that goes beyond binary relevance. It reads the whole candidate set, resolves conflicts, and ranks by directives like recency, source priority, and multi-step rules. +11% NDCG@10 on average across multiple domains, modalities, and languages in runs with Wholembed v3. Available today in preview in Mixedbread.
Mixedbread tweet media
English
5
18
137
24.2K
Mixedbread
Mixedbread@mixedbreadai·
Mixedbread search's ultimate aim is to power all workflows, no matter their modality or language. Try it for your own knowledge-intensive tasks today: mixedbread.com
English
0
1
11
2.1K
Mixedbread
Mixedbread@mixedbreadai·
You can read more about this in our blog post, where we present more detailed benchmark results and elaborate on the nature of the three benchmarks, and why we're very proud to be topping all three of them. mixedbread.com/blog/closing-g…
English
1
2
17
2.9K
Mixedbread
Mixedbread@mixedbreadai·
So what is the Oracle gap? Optimising agentic systems is complicated. There are many individual components you need to get just right. Retrieval is one of those components, and its impact is best measured by the Oracle gap: the difference between the performance of the same system between an imperfect retriever and perfect, fully-relevant results that would be provided by a so-called Oracle.
English
1
2
12
2.5K
Mixedbread
Mixedbread@mixedbreadai·
Agents are increasingly performing knowledge work: Deep Research, generating financial reports, reasoning across historical knowledgebases... Many high-quality benchmarks now focus on evaluating such tasks, among which BrowseComp-Plus, @databricks's OfficeQA, or @Snowflake's MADQA, released just last week.
English
1
1
22
2.9K
Mixedbread
Mixedbread@mixedbreadai·
For Agentic tasks, Oracle-level performance is the maximum performance a system can achieve, assuming it is able to retrieve all relevant documents perfectly, every time. We're proud to show that Mixedbread Search approaches the Oracle on multiple knowledge intensive benchmarks.
Mixedbread tweet media
English
4
22
148
80K
Mixedbread retweetledi
Omar Khattab
Omar Khattab@lateinteraction·
I've been eagerly awaiting this release from the @mixedbreadai folks. They're world-leading experts in late interaction retrieval. And today they remind us that late interaction done well makes all your favorite embedding models look like they don't work.
Omar Khattab tweet media
Mixedbread@mixedbreadai

Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.

English
8
23
199
22.3K
Mixedbread
Mixedbread@mixedbreadai·
Wholembed v3 is available immediately through Mixedbread Search. You can try it on our platform now, for free: New users get 2M free tokens to get started. Startups can receive much more through our partnered accelerator programs with Vercel and TinyFish. mixedbread.com
English
1
0
49
6.6K
Mixedbread
Mixedbread@mixedbreadai·
Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.
Mixedbread tweet media
English
35
121
951
198.2K
Mixedbread retweetledi
Ben Clavié
Ben Clavié@bclavie·
The SIGIR deadline has now passed. You know what hasn't passed? The Late Interaction Workshop deadline! Which we are extending to February 20th AOE. We care about everything around multi-vector retrieval: indexing, field reports, model training, multimodal, you name it.
Ben Clavié tweet media
English
4
11
35
18.6K
Mixedbread
Mixedbread@mixedbreadai·
We're building Mixedbread to close the gap between Search that is possible today and what the users of tomorrow will demand. You can read more about it here: mixedbread.com/blog/multimoda…
English
3
4
55
4.8K
Mixedbread
Mixedbread@mixedbreadai·
Retrieval has always been, and continues to be the natural interface to information. Traditionally, it was how you found useful websites and relevant snippets. Nowadays, it's how agents find exactly the pieces of context they need to answer a user's query. But retrieval has been stagnant for too long, and it has become clear that the once-omnipresent single-vector text representation is not meeting the needs of this new generation of users. Agents need the model to be able to understand long, reasoning-intensive queries. They need the ability to retrieve documents where they live, be they text, images, pdfs or even videos.
English
1
0
22
3.2K
Mixedbread
Mixedbread@mixedbreadai·
We build the first production ready multi-vector and multimodal search. Now we are serving over 1 billion documents in under 50ms latency (p50). We are sharing how we build it.
Mixedbread tweet media
English
14
43
335
65.5K