Saahil Ognawala

3.3K posts

Saahil Ognawala

@saahil

Head of Product @JinaAI_ . Tweets about AI, ML, software, security and everything that is not those.

Munich; Udaipur Katılım Temmuz 2008

857 Takip Edilen491 Takipçiler

Saahil Ognawala retweetledi

Jina AI@JinaAI_·7 Mar

4K. That's the max depth you can hide text to and embedding models can still retrieve it. Beyond that, it's gone. Using jina-embeddings-v3, we reproduced the new Needle-in-a-Haystack from recent NoLiMA paper - by removing literal matches between queries and the relevant information (the needle) hidden in the haystack. We ask ourselves: - How do embedding models perform retrieval across long-context? - Can query expansion mitigate this performance gap?

English

315

26.6K

Saahil Ognawala retweetledi

Elastic@elastic·20 Şub

Elasticsearch Open Inference API now supports @JinaAI_! Developers can now build search and RAG applications using the latest Jina AI embedding and reranking models without additional integration or costs. Learn more: go.es.io/4i8sNOs

English

Saahil Ognawala retweetledi

Jina AI@JinaAI_·1 Şub

For every model we release, there are 3 ways of using it at scale: Jina API, CSP (e.g. SageMaker), self-hosted K8s. Here we evaluate latency, throughput, neighborhood problem, cost/token across three options to help you decide which one best suits you.👇jina.ai/news/a-practic…

English

3.9K

Saahil Ognawala retweetledi

Bo@bo_wangbo·1 Şub

My colleague @saahil wrote a detailed review of model deployment options, api vs cloud providers vs self-hosting, full of numbers and insights for serious developers:

Jina AI@JinaAI_

English

480

Saahil Ognawala retweetledi

Jina AI@JinaAI_·21 Kas

Jina-CLIP-v2: a 0.9B multilingual multimodal embedding model that supports 89 languages, 512x512 image resolution, 8192 token-length, and Matryoshka representations down to 64-dim for both images and text. jina.ai/news/jina-clip… With of course strong performance on retrieval & classification tasks. Like Jina-CLIP v1, the text encoder of Jina-CLIP v2 can function as a standalone dense retriever, giving performance comparable to jina-embeddings-v3, which is currently the best multilingual embedding model under 1B parameters.

English

347

42.3K

Saahil Ognawala@saahil·19 Kas

@seointrovert @JinaAI_ Can you please DM me the email address you used to buy tokens on our homepage?

English

SEO Introvert@seointrovert·19 Kas

@JinaAI_ . Do you have a channel/way to get support if after purchasing credits, the API does not work. I've tried to contact you directly through the contact form on jina.ai but got no reply after 5 days.

English

Saahil Ognawala retweetledi

Jina AI@JinaAI_·5 Kas

At #EMNLP2024 Miami next week? Join us on November 14, 2024, from 10:30 AM to 12:00 PM (Miami Time) for a BoF session on Embeddings, Rerankers, and Small LMs for better search. This 1.5-hour in-person session will bring together researchers in embeddings, rerankers & information retrieval; offering an excellent opportunity to explore recent advances in search foundation models, share your work with a specialized audience, and discuss emerging trends in search models in 2024. All EMNLP on-site participants are welcome to attend. Topics of interest: - Multimodal, multilingual, cross-lingual, and cross-modal embeddings and rerankers - Late-interaction models (e.g., ColBERT, ColPali) and late chunking - Long-context embedding models - Instruction-tuning for embedding and reranker models - LLM-based embedding and reranker models - Small language models for document reading - Efficient and lightweight embedding architectures, attention mechanisms - Zero/few-shot retrieval and adaptation methods - Contrastive learning approaches for retriever - Matryoshka representation learning, embedding compression and quantization - Hybrid sparse-dense retrieval systems - Task-LoRA, domain adaptation, OOD, fine-tuning for embedding models - MTEB, LongMTEB, RAG evaluation metrics and benchmarks - Embedding models for code and structured data - Privacy-preserving embedding techniques

English

4.8K

Saahil Ognawala retweetledi

Jina AI@JinaAI_·30 Eki

curl docs.jina.ai This is our Meta-Prompt. It allows LLMs to understand our Reader, Embeddings, Reranker, and Classifier APIs for improved codegen. Using the meta-prompt is straightforward. Just copy the prompt into your preferred LLM interface like ChatGPT, Claude, or whatever works for you, add your instructions, and you're set. In this example, we copied the entire prompt into Anthropic Claude and asked it to grab every sentence from Hacker News front page and visualize them using UMAP with matplotlib. This task is nontrivial as it combines multiple APIs from our Search Foundation, like Reader and Embedding where Claude may not have knowledge of. So if you asked Claude directly, it probably wouldn't give an optimal answer. But with the meta-prompt, Claude now has good knowledge about our APIs and can generate much better code! We can copy paste the code directly to Google Colab and with minimum modification, the code just works!

English

662

209.1K

Saahil Ognawala retweetledi

Jina AI@JinaAI_·22 Eki

Classification is the #1 downstream task for embeddings. It's recently popular for routing queries to LLMs too. We're excited to launch new Classifier API jina.ai/classifier/ - supports both zero-shot and few-shot online classification for text & image, powered by our latest jina-embeddings-v3 and jina-clip-v1.

English

108

11.6K

Saahil Ognawala retweetledi

Jina AI@JinaAI_·10 Eki

"It's a multilingual embedding model"—okay, but what's that mean? Does it mean you can search German docs with a German query, French docs with a French one, all within the same model? Or can you also search French docs using Japanese queries? A lot of these multilingual models are pretty vague about this: is it just about handling multiple languages separately, or can it do true cross-lingual retrieval? In fact, most models can only manage the former because of the 'language gap'—where similar phrases in different languages don’t align as closely as they should. This makes them less useful for cross-lingual search, which would be much more valuable for global businesses. While training jina-embeddings-v3, we realized that bridging the language gap isn't straightforward. The two figures below illustrate this. We pre-trained jina-roberta-xlm (the backbone of v3) to see how well it could learn cross-language equivalencies through masked language pre-training. We used UMAP to plot 2D sentence representations for a set of English sentences and their translations into German, Dutch, Simplified Chinese, and Japanese. And it's bad. Comparing this with the final v3 model, it’s clear that the language gap has been significantly reduced. The embeddings in v3 show minimal language-specific clustering, and semantically similar texts produce close embeddings, no matter the language.

English

191

12.7K

Saahil Ognawala retweetledi

Michael Günther@michael_g_u·4 Eki

We extended our priprint about late chunking, a novel method to make embeddings of chunks context-aware. We added: - Algorithm for long documents - Training method to make late chunking more effective - Comparison to Anthropic's contextual embedding arxiv.org/abs/2409.04701

English

Saahil Ognawala retweetledi

Jina AI@JinaAI_·23 Eyl

jina-embeddings-v3, reader-lm-0.5b, and reader-lm-1.5b models are now available on AWS SageMaker and Azure Marketplace. Deploy these frontier models within your company’s cloud infrastructure to maintain compliance and full data ownership. Learn more at the link below: AWS v3: aws.amazon.com/marketplace/pp… reader-lm-1.5b: aws.amazon.com/marketplace/pp… reader-lm-0.5b: aws.amazon.com/marketplace/pp… Azure: v3: azuremarketplace.microsoft.com/en-us/marketpl… reader-lm-1.5b: azuremarketplace.microsoft.com/en-us/marketpl… reader-lm-0.5b: azuremarketplace.microsoft.com/en-us/marketpl…

English

125

9.8K

Saahil Ognawala retweetledi

Bo@bo_wangbo·17 Eyl

We will use the latest multilingual text embedding model from @JinaAI_ to encode over 100 million entries of human knowledge and make it easier to reach:

Tech.eu@tech_eu

Wikimedia, DataStax, and Jina AI launch semantic search for non-profit AI developers buff.ly/3TwC49g

English

2.7K

Saahil Ognawala retweetledi

Simon Willison@simonw·20 Eyl

Neat, looks like @JinaAI_ have a CLIP-style model available via their API

sankalp@dejavucoder

@simonw hi simon, you may look into jina clip v1 multimodal embeddings here jina.ai/embeddings/ and this git repo as well github.com/jina-ai/clip-a…

English

9.3K

Saahil Ognawala retweetledi

Omar Sanseviero@osanseviero·19 Eyl

What a day! Today we got 🎉Qwen Party of models (base, code, math,...) 💬 Kyutai Moshi on-device speech-to-speech 📹 CogVideoX image-to-video 🤏 Jina releasing one of the best open embedding models

English

163

9.7K

Saahil Ognawala retweetledi

Bo@bo_wangbo·18 Eyl

my personal favourite about jina-embeddings-v3 (beyond fancy features) is, we manually checked the common failures made by different text embedding models, created failure taxonomy, and try to fix them one by one. This involves a lot of painful, manual work:

Jina AI@JinaAI_

Finally, jina-embeddings-v3 is here! A frontier multilingual embedding model with 570M parameters, 8192-token length, achieving SOTA performance on multilingual and long-context retrieval tasks. It outperforms the latest proprietary models from OpenAI and Cohere, and outperforms multilingual-e5-large-instruct across all multilingual tasks. In fact, as of today, jina-embeddings-v3 is the best multilingual model and ranks 2nd on the MTEB English leaderboard for models < 1B parameters.

English

141

11.3K

Saahil Ognawala retweetledi

Victoria Slocum@victorialslocum·18 Eyl

Optimizing your chunking techniques is one of the top places to improve performance in your RAG pipelines, but what’s the best one? @JinaAI_ just released a new method called late chunking that takes the same amount of storage space as naive chunking, but solves the problem of lost context similarly to ColBERT. You can implement it super easily with just a few extra lines in your embedding step! Blog: weaviate.io/blog/late-chun… Notebook: github.com/weaviate/recip… Thanks so much to @DanielW966 again for the awesome collaboration 💚 📄 Papers Late Chunking: arxiv.org/pdf/2409.04701 ColBERT: arxiv.org/pdf/2004.12832

English

411

32.3K

Saahil Ognawala retweetledi

Pinecone@pinecone·18 Eyl

@jinaai_ just launched jina-embeddings-v3 — featuring LoRA adapters, cost-efficient performance, and superior results compared to models 12x its size.

Jina AI@JinaAI_

English

1.2K

Saahil Ognawala retweetledi

Tech.eu@tech_eu·17 Eyl

Wikimedia, DataStax, and Jina AI launch semantic search for non-profit AI developers buff.ly/3TwC49g

Română

4.4K

Saahil Ognawala retweetledi

Felix@felix1987_·13 Eyl

Thanks to @NVIDIAAI for including our 𝕁𝕚𝕟𝕒-𝕣𝕖𝕣𝕒𝕟𝕜𝕖𝕣-𝕧𝟚-𝕞𝕦𝕝𝕥𝕚𝕝𝕚𝕟𝕘𝕦𝕒𝕝 in their recent benchmark on text retrieval for Q&A! It’s great to see more focus on reranker models in this benchmark, as there's still a lot to explore in this area.

Sumit@_reachsumit

Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG NVIDIA benchmarks ranking models for text retrieval in QA tasks introducing a new sota model NV-RerankQA-Mistral-4B-v3 📝arxiv.org/abs/2409.07691 👨🏽‍💻#nv-rerankqa-mistral-4b-v3" target="_blank" rel="nofollow noopener">build.nvidia.com/explore/retrie…

English

179

Keşfet

@JinaAI_ @seointrovert @jinaai_ @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates