

Saahil Ognawala
3.3K posts

@saahil
Head of Product @JinaAI_ . Tweets about AI, ML, software, security and everything that is not those.





For every model we release, there are 3 ways of using it at scale: Jina API, CSP (e.g. SageMaker), self-hosted K8s. Here we evaluate latency, throughput, neighborhood problem, cost/token across three options to help you decide which one best suits you.👇jina.ai/news/a-practic…













Wikimedia, DataStax, and Jina AI launch semantic search for non-profit AI developers buff.ly/3TwC49g

Finally, jina-embeddings-v3 is here! A frontier multilingual embedding model with 570M parameters, 8192-token length, achieving SOTA performance on multilingual and long-context retrieval tasks. It outperforms the latest proprietary models from OpenAI and Cohere, and outperforms multilingual-e5-large-instruct across all multilingual tasks. In fact, as of today, jina-embeddings-v3 is the best multilingual model and ranks 2nd on the MTEB English leaderboard for models < 1B parameters.


Finally, jina-embeddings-v3 is here! A frontier multilingual embedding model with 570M parameters, 8192-token length, achieving SOTA performance on multilingual and long-context retrieval tasks. It outperforms the latest proprietary models from OpenAI and Cohere, and outperforms multilingual-e5-large-instruct across all multilingual tasks. In fact, as of today, jina-embeddings-v3 is the best multilingual model and ranks 2nd on the MTEB English leaderboard for models < 1B parameters.



Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG NVIDIA benchmarks ranking models for text retrieval in QA tasks introducing a new sota model NV-RerankQA-Mistral-4B-v3 📝arxiv.org/abs/2409.07691 👨🏽💻#nv-rerankqa-mistral-4b-v3" target="_blank" rel="nofollow noopener">build.nvidia.com/explore/retrie…