Fredy Sierra

1.7K posts

Fredy Sierra banner
Fredy Sierra

Fredy Sierra

@fysus

Good. Proud. I am scary space monster. You are leaky space blob.

Amsterdam, The Netherlands Katılım Temmuz 2009
581 Takip Edilen167 Takipçiler
Andrej Karpathy
Andrej Karpathy@karpathy·
LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
English
2.8K
7K
58.2K
20.8M
Fredy Sierra retweetledi
CaudalLabs
CaudalLabs@CaudalLabs·
Your AI chat is a dead-end hallway. Mine is a mind map. Branch. Explore. Merge. All on one canvas. github.com/caudal-labs/ca…
English
0
3
5
162
Fredy Sierra retweetledi
CaudalLabs
CaudalLabs@CaudalLabs·
Your agent needs more than memory. Recall ≠ Relevance Your agent can retrieve everything. It just can't tell what matters right now. 🚀 Open-source attention engine. Agents send events. Caudal learns what's relevant. Old context fades naturally. caudal-labs.com
CaudalLabs tweet media
English
0
1
1
55
Fredy Sierra
Fredy Sierra@fysus·
@boxmining Vector DBs answer "what's similar?" LLMs answer "what can I reason about?" Nobody answers "what should I focus on right now?" Until now. Caudal — open source working memory for AI agents. Looking for contributors. github.com/caudal-labs/ca…
English
0
0
0
27
Boxmining
Boxmining@boxmining·
Algorithm, please connect me with people passionate about: - AI - Vibe coding - Automation - Crypto investing/trading - Founders who are building
English
301
22
512
20K
Fredy Sierra
Fredy Sierra@fysus·
@AndrewYNg Vector DBs answer "what's similar?" LLMs answer "what can I reason about?" Nobody answers "what should I focus on right now?" Until now. Caudal — open source working memory for AI agents. Looking for contributors. github.com/caudal-labs/ca…
English
0
0
0
19
Andrew Ng
Andrew Ng@AndrewYNg·
I'm excited to announce Context Hub, an open tool that gives your coding agent the up-to-date API documentation it needs. Install it and prompt your agent to use it to fetch curated docs via a simple CLI. (See image.) Why this matters: Coding agents often use outdated APIs and hallucinate parameters. For example, when I ask Claude Code to call OpenAI's GPT-5.2, it uses the older chat completions API instead of the newer responses API, even though the newer one has been out for a year. Context Hub solves this. Context Hub is also designed to get smarter over time. Agents can annotate docs with notes — if your agent discovers a workaround, it can save it and doesn't have to rediscover it next session. Longer term, we're building toward agents sharing what they learn with each other, so the whole community benefits. Thanks Rohit Prsad and Xin Ye for working with me on this! npm install -g @aisuite/chub GitHub: github.com/andrewyng/cont…
Andrew Ng tweet media
English
301
728
5.4K
399.9K
Eleftheria Batsou
Eleftheria Batsou@BatsouElef·
Time to promote your product. 🚀 Share that URL!
English
609
6
252
25.8K
Marcell Havlik
Marcell Havlik@cviklihamar·
Dear @X algo, I am looking to connect with people interested in: - Builders - AI Tech - Full Stack - Hackers - Building in Public Say hi & let's grow together 👋
English
435
5
480
20.8K
Society AI
Society AI@SocietyAI_Labs·
Building an AI agent? Connect your agent to the Society AI network. Earn on every request. Auth, payments, interface. Handled. We’re onboarding the first batch of developers. Get early access.
English
7
29
263
1.6M
Aurimas Griciūnas
Aurimas Griciūnas@Aurimas_Gr·
A simple way to explain 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 𝗠𝗲𝗺𝗼𝗿𝘆. In general, the memory for an agent is something that we provide via context in the prompt passed to LLM that helps the agent to better plan and react given past interactions or data not immediately available. It is useful to group the memory into four types: 𝟭. Episodic - This type of memory contains past interactions and actions performed by the agent. After an action is taken, the application controlling the agent would store the action in some kind of persistent storage so that it can be retrieved later if needed. A good example would be using a vector Database to store semantic meaning of the interactions. 𝟮. Semantic - Any external information that is available to the agent and any knowledge the agent should have about itself. You can think of this as a context similar to one used in RAG applications. It can be internal knowledge only available to the agent or a grounding context to isolate part of the internet scale data for more accurate answers. 𝟯. Procedural - This is systemic information like the structure of the System Prompt, available tools, guardrails etc. It will usually be stored in Git, Prompt and Tool Registries. 𝟰. Occasionally, the agent application would pull information from long-term memory and store it locally if it is needed for the task at hand. 𝟱. All of the information pulled together from the long-term or stored in local memory is called short-term or working memory. Compiling all of it into a prompt will produce the prompt to be passed to the LLM and it will provide further actions to be taken by the system. We usually label 1. - 3. as Long-Term memory and 5. as Short-Term memory. A visual explanation of potential implementation details 👇 And that is it! The rest is all about how you architect the flow of your Agentic systems. What do you think about memory in AI Agents? #LLM #AI #MachineLearning Want to learn how to build an Agent from scratch without any LLM orchestration framework? Follow my journey here: newsletter.swirlai.com/p/building-ai-…
GIF
English
67
398
1.8K
142.4K
Nicholas Fabiano, MD
Nicholas Fabiano, MD@NTFabiano·
Creatine + guanidinoacetic acid raises muscle & brain creatine significantly more than creatine alone.
Nicholas Fabiano, MD tweet media
English
107
445
9.8K
1.6M
Kara
Kara@UTDKara·
New gen will never understand this picture
Kara tweet media
English
1.5K
10K
235.9K
22.2M
Fredy Sierra retweetledi
John A De Goes
John A De Goes@jdegoes·
Uncle Bob receives substantial criticism, some technically sound and well-informed. Yet much comes from young peacocks strutting. Respect your elders, kids—with luck, you'll achieve half of Bob's career success and live to see peacocks strut your way too.
English
23
12
240
23K
Fredy Sierra retweetledi
Johny G
Johny G@TheJohnyGo·
@Andercot Upon atmospheric reentry, residual Botox particles from the crew formed a synthetic polymerized sheath around the capsule, effectively deflecting thermal energy and eliminating the need for conventional heat shielding.
English
525
359
20.3K
774.8K
Fredy Sierra retweetledi
Charlie
Charlie@Carlodmension·
La generación que prefirió endeudarse con viajes y no con apartamentos.
Español
1.2K
5.8K
85.9K
8.9M
Daniel Samper Ospina
Daniel Samper Ospina@DanielSamperO·
¿Han visto programas internacionales en que se burlen de lo que hizo Berto esta semana? #Notidanny
Español
441
441
3.2K
287.4K
Fredy Sierra retweetledi
End Wokeness
End Wokeness@EndWokeness·
@AOC This you?
English
1.1K
2.7K
48.4K
629.5K