Haystack

634 posts

Haystack

@Haystack_AI

Open-source AI orchestration framework by @deepset_ai. Build context-engineered agents & RAG systems in Python. Discord for support → https://t.co/19wuHcilYP

Bergabung Ağustos 2023

49 Mengikuti2K Pengikut

Tweet Disematkan

Haystack@Haystack_AI·19 Ara

One Name. One Product Family. One Look 💙 We’re unifying the Haystack ecosystem at @deepset_ai under one name and a new logo, reflecting its role as a framework, a community, and the foundation of our enterprise platform. 👉 Read the announcement: haystack.deepset.ai/blog/announcin…

English

927

Haystack@Haystack_AI·2d

Full release notes 👇 haystack.deepset.ai/release-notes/…

English

Haystack@Haystack_AI·2d

🚀 Haystack 2.26 is out! Here are a few highlights: 🤖 More flexible Agents with dynamic system prompts You can now use Jinja2 templating in `system_prompt` to inject runtime variables and dynamically adapt behavior, such as changing language or tone, or adding conditional logic, without redefining prompts. 🔍 LLMRanker for smarter reranking LLMRanker uses an LLM to reason about relevance rather than rely on similarity scores (like cross-encoders). This leads to better results for complex queries and helps provide higher-quality context for RAG and agent workflows while keeping context windows lean. 🧠 Discover supported models programmatically Chat generators now expose SUPPORTED_MODELS variable, so you can easily see which models are supported. 💙 Big thanks to our contributors to this release!

English

213

Haystack me-retweet

Stefano Fiorucci@theanakin87·4d

Just published a notebook on @GeminiApp Cookbook w/ @bilgeycl 📓 Cross-modal Retrieval with Gemini Embedding 2 and @Haystack_AI: github.com/google-gemini/… 🔎 Several examples of search: text→text text→any (PDF included) image→text search Thx @patloeber! @googleaidevs

English

431

Haystack me-retweet

Bilge@bilgeycl·11 Mar

📚 It had been a while since we published a technical blog post, and Gemini Embedding 2 from @GoogleDeepMind team felt like a good reason to write one together with @theanakin87 💙 It's Google's first fully multimodal embedding model, meaning you can embed text, images, audio, video, and PDFs into a single shared vector space. What’s exciting about multimodal embedding models is that they enable cross-modal retrieval. For example, you can search a product catalog with a text query like “grey running shoes with white soles” 👟, or upload a photo of a monument to retrieve relevant articles describing it 🗽 In the post, we show how to use Gemini Embedding 2 with @Haystack_AI for both text and multimodal data, supported from day 0 🎉 Blog: haystack.deepset.ai/blog/multimoda…

English

2.3K

Haystack me-retweet

Google AI Developers@googleaidevs·10 Mar

Start building with Gemini Embedding 2, our most capable and first fully multimodal embedding model built on the Gemini architecture. Now available in preview via the Gemini API and in Vertex AI.

English

343

2.7K

401.5K

Haystack@Haystack_AI·10 Mar

🚀 @GoogleDeepMind just launched Gemini Embedding 2 today and Haystack supports it from Day 0. Embed text, images, video, audio, and PDFs into one vector space and build multimodal search or cross-modal retrieval systems. Learn how in our latest blog post 👇 haystack.deepset.ai/blog/multimoda…

English

737

Haystack@Haystack_AI·5 Mar

When @AIatMeta (Facebook AI back then) released it in 2017, most people hadn't even considered that you could search across millions of vectors in milliseconds. FAISS made it real, and the wave of vector databases that followed owes a lot to what it proved was possible. Years later, it is still standing. When your project doesn't need a full-blown vector database - think local development, lightweight deployments, or small to medium datasets - FAISS is often the smartest choice. Fast, dependency-free, and battle-hardened by nearly a decade of production use. We've now integrated it into Haystack as a Document Store. Drop in FAISSDocumentStore for in-memory or persistent vector search, and pair it with FAISSEmbeddingRetriever to plug directly into your pipelines and agents. GPU support included. 🐍 pip install faiss-haystack 🔗 Documentation: haystack.deepset.ai/integrations/f…

English

169

Haystack@Haystack_AI·3 Mar

Duplicates and near-duplicates are common in Retrieval Augmented Generation. Even if you try not to repeat yourself, your documents will often contain paraphrases of the same information, which adds no value for the LLM and only clutters your context. This increases token usage costs and can push out useful results that did not rank as highly. Diversification reranks results to keep the most unique ones while preserving relevance. We didn’t want to reinvent the wheel, so we integrated directly with pyversity by @thomas_v_dongen, a small yet comprehensive Python library that provides many diversification algorithms out of the box, including MMR and DPP. It is now integrated into Haystack as a ranker component that fits into your existing pipelines and agents! 🐍 pip install pyversity-haystack 🔗 Documentation: haystack.deepset.ai/integrations/p…

English

171

Haystack me-retweet

Kacper Łukawski@LukawskiKacper·3 Mar

Didn't have to go far to find a great solution - @thomas_v_dongen's pyversity covers an impressive range of algorithms out of the box. Huge props for building something that thorough! Now fully integrated with @Haystack_AI haystack.deepset.ai/integrations/p…

English

192

Haystack@Haystack_AI·2 Mar

@YaoWu91252427 @deepset_ai Thank you for your contributions to Haystack! Keep them coming 🙌

English

Barbara Wu@YaoWu91252427·1 Mar

5 PRs merged into @deepset_ai last month — from Anthropic thinking content support to hyperlink extraction in PPTX/XLSX converters. Love this community.

English

Haystack@Haystack_AI·2 Mar

haystack.deepset.ai/release-notes/…

ZXX

124

Haystack@Haystack_AI·2 Mar

🚀 Haystack 2.25 is here! This release focuses on improving the user experience of the Agent component and making tool selection more scalable and efficient. 🛠️ Dynamic Tool Discovery with SearchableToolset With SearchableToolset, agents dynamically discover relevant tools using BM25 keyword search instead of exposing hundreds of tools to the LLM upfront. This approach: – Reduces LLM context usage (and token costs) – Improves tool selection accuracy – Works especially well when connecting to MCP servers Agents load only the tools they actually need, making large tool ecosystems practical without overwhelming the model. 📝 Reusable Jinja2 Prompt Templates for Agents Agents now natively support templated` user_prompt` and `required_variables`, simplifying how you structure and invoke agents. With this update, you can define prompts once and pass variables at runtime. This means: – Less boilerplate – Cleaner agent invocation – More reusable and maintainable agent workflows 💙 Big thanks to our contributors to this release! 👇 Full release notes in the next post Explore the release and let us know what you build with it 💬

English

552

Haystack@Haystack_AI·17 Şub

🧪 Haystack & @mem0ai integration We've added @mem0ai support to haystack-experimental, bringing persistent memory capabilities to your pipelines. Mem0 allows your Haystack applications to maintain memory across sessions - storing user preferences, and extracted facts that persist beyond individual pipeline runs. This enables truly stateful AI systems that learn and adapt over time. 🔍 Key capabilities: - Cross-session memory persistence for agents and chatbots - User and session-level context management - Seamless integration with Haystack's agents This integration is experimental - we're looking for early adopters to test it and share feedback! 📖 Cookbook: haystack.deepset.ai/cookbook/memor…

English

309

Haystack@Haystack_AI·12 Şub

haystack.deepset.ai/release-notes/…

ZXX

106

Haystack@Haystack_AI·12 Şub

Haystack 2.24 is live 🚀 Here are the highlights: 🔌 Simpler Pipelines New improvements in the Pipeline logic simplify the component connections, eliminating the need for OutputAdapter and joiners in many common setups, such as query rewriting and hybrid retrieval. This results in cleaner and shorter pipelines even for complex use cases. 🏁 Rankers handles duplicate documents Rankers now deduplicate documents on top of ranking, enabling cleaner hybrid retrieval pipelines with less components. 📃 File Support in Chat Generators You can now attach files (e.g., PDFs) to ChatMessage using the new FileContent dataclass when working with OpenAI and Azure chat generators (Responses API). This enables smoother document-based prompting in deterministic flows and agentic settings. 💙 Big thanks to our contributors to this releaseH 👇 Full release notes in the comments

English

445

Haystack@Haystack_AI·10 Şub

📃@PaddlePaddle released a new VLM: PaddleOCR-VL-1.5 (0.9B) and you can already use it in your document-heavy Haystack pipelines. PaddleOCR-VL-1.5 goes beyond classic OCR: it understands document layout and structure, extracting tables, formulas, charts, and key elements from messy real-world PDFs and images. This makes it a powerful building block for reliable RAG pipelines and document-centric AI applications. For teams building AI systems for reasoning over complex documents, this enables more accurate retrieval, grounding, and reasoning across document structure. 🔍 Why it’s exciting: - 94.5% accuracy on OmniDocBench v1.5 - Irregular-shaped localization for real-world documents (skew, warp, photos) - Strong improvements in table, formula, and text spotting - Multilingual support, including rare scripts and complex layouts 🔗 Model: huggingface.co/PaddlePaddle/P… 🔗 Docs: haystack.deepset.ai/integrations/p…

English

1.2K

Haystack@Haystack_AI·28 Oca

🚀 Haystack 2.23 is out! This release focuses on making agents safer, more visual, and components easier to customize with fewer custom hooks and less boilerplate. Here are the highlights: 🔄 Human-in-the-Loop for Agents Agents can now pause for human confirmation before executing tools. You can define confirmation behavior per tool (always ask, ask once, or never ask), ideal for workflows with sensitive operations. 🖼️ Image Support for Tool Results Tools can now return images alongside text. Agents can retrieve and reason over images, unlocking multimodal workflows like visual search, image inspection, and richer tool outputs. 🧩 Simpler Serialization for Custom Components Custom components now serialize and deserialize automatically in most cases, even when they include complex objects as parameters. This means less boilerplate, easier pipeline snapshots for recovery, and simpler custom component API. 💙 Big thanks to our contributors for the release 👉 Full release notes: haystack.deepset.ai/release-notes/…

English

480

Haystack@Haystack_AI·16 Oca

@bilgeycl @qdrant_engine 🚀🚀

QME

Bilge@bilgeycl·16 Oca

Oh wow, this is literally what I built with @Haystack_AI and @qdrant_engine 😄 A movie recommendation agent that understands intent and creates the right search queries + metadata

omo@sebiomo_

We assumed users would search for movie titles. We were wrong. The gap between how we store data and how users search for it is often where products fail. Our database was structured for exact title matches, but our FilmFlux users searched with intent. They typed "Mercy Johnson movies" or "2024 romantic comedy." Our SQL pattern matching returned zero results. The content existed, but the logic couldn't find it. To fix this, we architected a hybrid search system using Supabase Edge Functions. We found that pure semantic search fails on specific keywords, while pure keyword search fails on context. We needed both. The implementation combines four layers: - Semantic Understanding: OpenAI embeddings to match "vibe" and plot. - Intent Detection: Logic to parse natural language filters (years, genres). - Fuzzy Matching: pg_trgm to handle user typos. - Recency Weighting: Custom ranking to surface new hits, not obscure back-catalog titles. The result is a search bar that handles typos, understands context, and respects popularity. Don't force users to speak the database's language. Build the infrastructure to understand theirs. #SoftwareEngineering #SystemDesign #Supabase #SearchArchitecture

English

1.6K

Haystack@Haystack_AI·12 Oca

🚀 Haystack v2.22 is out! This release brings smarter doc chunking, improved component behavior, and easier tool-to-LLM integration with fewer manual steps. Highlights ✂️ New `EmbeddingBasedDocumentSplitter` for smarter document chunking 🛠️ `outputs_to_string` lets tools return multiple strings with ease 🔥 Components now auto-trigger `warm_up()` on first run 🐍 Python 3.10+ is now required Big thanks to our contributors 💙 Full release notes👇 🔗 haystack.deepset.ai/release-notes/…

English

796

Jelajahi

@GeminiApp @bilgeycl @patloeber @googleaidevs @GoogleDeepMind @theanakin87 @AIatMeta @thomas_v_dongen