Fredy Sierra

1.7K posts

Fredy Sierra

@fysus

Good. Proud. I am scary space monster. You are leaky space blob.

Amsterdam, The Netherlands Katılım Temmuz 2009

581 Takip Edilen167 Takipçiler

Fredy Sierra@fysus·24 Nis

The new release of github.com/caudal-labs/ca… supports image attachments in chat messages

English

Fredy Sierra@fysus·3 Nis

@karpathy github.com/caudal-labs/ca…

QME

Andrej Karpathy@karpathy·2 Nis

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

English

2.8K

58.2K

20.8M

Fredy Sierra retweetledi

CaudalLabs@CaudalLabs·29 Mar

Your AI chat is a dead-end hallway. Mine is a mind map. Branch. Explore. Merge. All on one canvas. github.com/caudal-labs/ca…

English

162

Fredy Sierra retweetledi

CaudalLabs@CaudalLabs·13 Mar

Your agent needs more than memory. Recall ≠ Relevance Your agent can retrieve everything. It just can't tell what matters right now. 🚀 Open-source attention engine. Agents send events. Caudal learns what's relevant. Old context fades naturally. caudal-labs.com

English

Fredy Sierra@fysus·10 Mar

@boxmining Vector DBs answer "what's similar?" LLMs answer "what can I reason about?" Nobody answers "what should I focus on right now?" Until now. Caudal — open source working memory for AI agents. Looking for contributors. github.com/caudal-labs/ca…

English

Boxmining@boxmining·10 Mar

Algorithm, please connect me with people passionate about: - AI - Vibe coding - Automation - Crypto investing/trading - Founders who are building

English

301

512

20K

Fredy Sierra@fysus·9 Mar

@AndrewYNg Vector DBs answer "what's similar?" LLMs answer "what can I reason about?" Nobody answers "what should I focus on right now?" Until now. Caudal — open source working memory for AI agents. Looking for contributors. github.com/caudal-labs/ca…

English

Andrew Ng@AndrewYNg·9 Mar

I'm excited to announce Context Hub, an open tool that gives your coding agent the up-to-date API documentation it needs. Install it and prompt your agent to use it to fetch curated docs via a simple CLI. (See image.) Why this matters: Coding agents often use outdated APIs and hallucinate parameters. For example, when I ask Claude Code to call OpenAI's GPT-5.2, it uses the older chat completions API instead of the newer responses API, even though the newer one has been out for a year. Context Hub solves this. Context Hub is also designed to get smarter over time. Agents can annotate docs with notes — if your agent discovers a workaround, it can save it and doesn't have to rediscover it next session. Longer term, we're building toward agents sharing what they learn with each other, so the whole community benefits. Thanks Rohit Prsad and Xin Ye for working with me on this! npm install -g @aisuite/chub GitHub: github.com/andrewyng/cont…

English

301

728

5.4K

399.9K

Fredy Sierra@fysus·4 Mar

@BatsouElef Caudal is a temporal relevance memory for AI agents. It tells an agent what matters right now. github.com/caudal-labs/ca…

English

Eleftheria Batsou@BatsouElef·3 Mar

Time to promote your product. 🚀 Share that URL!

English

609

252

25.8K

Fredy Sierra@fysus·4 Mar

@cviklihamar @X Hi, I am working on this project: Caudal is a temporal relevance memory for AI agents. It tells an agent what matters right now. github.com/caudal-labs/ca… . Let's connect!

English

Marcell Havlik@cviklihamar·3 Mar

Dear @X algo, I am looking to connect with people interested in: - Builders - AI Tech - Full Stack - Hackers - Building in Public Say hi & let's grow together 👋

English

435

480

20.8K

Fredy Sierra@fysus·4 Mar

@SocietyAI_Labs Caudal is a temporal relevance memory for AI agents. It tells an agent what matters right now. github.com/caudal-labs/ca…

English

370

Society AI@SocietyAI_Labs·15 Şub

Building an AI agent? Connect your agent to the Society AI network. Earn on every request. Auth, payments, interface. Handled. We’re onboarding the first batch of developers. Get early access.

English

263

1.6M

Fredy Sierra@fysus·4 Mar

@Aurimas_Gr Caudal is a temporal relevance memory for AI agents. It tells an agent what matters right now. github.com/caudal-labs/ca…

English

Aurimas Griciūnas@Aurimas_Gr·13 Oca

A simple way to explain 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗠𝗲𝗺𝗼𝗿𝘆. In general, the memory for an agent is something that we provide via context in the prompt passed to LLM that helps the agent to better plan and react given past interactions or data not immediately available. It is useful to group the memory into four types: 𝟭. Episodic - This type of memory contains past interactions and actions performed by the agent. After an action is taken, the application controlling the agent would store the action in some kind of persistent storage so that it can be retrieved later if needed. A good example would be using a vector Database to store semantic meaning of the interactions. 𝟮. Semantic - Any external information that is available to the agent and any knowledge the agent should have about itself. You can think of this as a context similar to one used in RAG applications. It can be internal knowledge only available to the agent or a grounding context to isolate part of the internet scale data for more accurate answers. 𝟯. Procedural - This is systemic information like the structure of the System Prompt, available tools, guardrails etc. It will usually be stored in Git, Prompt and Tool Registries. 𝟰. Occasionally, the agent application would pull information from long-term memory and store it locally if it is needed for the task at hand. 𝟱. All of the information pulled together from the long-term or stored in local memory is called short-term or working memory. Compiling all of it into a prompt will produce the prompt to be passed to the LLM and it will provide further actions to be taken by the system. We usually label 1. - 3. as Long-Term memory and 5. as Short-Term memory. A visual explanation of potential implementation details 👇 And that is it! The rest is all about how you architect the flow of your Agentic systems. What do you think about memory in AI Agents? #LLM #AI #MachineLearning Want to learn how to build an Agent from scratch without any LLM orchestration framework? Follow my journey here: newsletter.swirlai.com/p/building-ai-…

GIF

English

398

1.8K

142.4K

Fredy Sierra@fysus·4 Mar

Caudal is a temporal relevance memory for AI agents. It tells an agent what matters right now. github.com/caudal-labs/ca…

English

Fredy Sierra@fysus·11 Ağu

@NTFabiano @grok what is the impact of creatine in the kidneys?

English

Nicholas Fabiano, MD@NTFabiano·10 Ağu

Creatine + guanidinoacetic acid raises muscle & brain creatine significantly more than creatine alone.

English

107

445

9.8K

1.6M

Fredy Sierra retweetledi

Geiger Capital@Geiger_Capital·24 Tem

Yea… American Eagle is about to go vertical. $AEO x.com/sweeneydailyx/…

English

605

826

38.8K

5.2M

Fredy Sierra@fysus·8 Tem

@UTDKara @grok explain this

English

495

Kara@UTDKara·7 Tem

New gen will never understand this picture

English

1.5K

10K

235.9K

22.2M

Fredy Sierra retweetledi

John A De Goes@jdegoes·2 May

Uncle Bob receives substantial criticism, some technically sound and well-informed. Yet much comes from young peacocks strutting. Respect your elders, kids—with luck, you'll achieve half of Bob's career success and live to see peacocks strut your way too.

English

240

23K

Fredy Sierra retweetledi

Johny G@TheJohnyGo·16 Nis

@Andercot Upon atmospheric reentry, residual Botox particles from the crew formed a synthetic polymerized sheath around the capsule, effectively deflecting thermal energy and eliminating the need for conventional heat shielding.

English

525

359

20.3K

774.8K

Fredy Sierra retweetledi

Elon Musk@elonmusk·6 Şub

🤡

Disclose.tv@disclosetv

NEW - Lego can be "anti-LGBT" because the bricks have "male or female parts" that are made to "mate" with each other, says Science Museum in London — Telegraph

ART

15.8K

15K

169.6K

33.2M

Fredy Sierra retweetledi