Steven Van Vaerenbergh

916 posts

Steven Van Vaerenbergh banner
Steven Van Vaerenbergh

Steven Van Vaerenbergh

@steven2358

Researcher and lecturer at @unican. Occasional curator of lists.

Spain Katılım Şubat 2008
278 Takip Edilen305 Takipçiler
Steven Van Vaerenbergh
Steven Van Vaerenbergh@steven2358·
Which human bottleneck becomes dominant when cognition is cheap?
English
0
0
0
11
Andrej Karpathy
Andrej Karpathy@karpathy·
Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file". The idea of the idea file is that in this era of LLM agents, there is less of a point/need of sharing the specific code/app, you just share the idea, then the other person's agent customizes & builds it for your specific needs. So here's the idea in a gist format: gist.github.com/karpathy/442a6… You can give this to your agent and it can build you your own LLM wiki and guide you on how to use it etc. It's intentionally kept a little bit abstract/vague because there are so many directions to take this in. And ofc, people can adjust the idea or contribute their own in the Discussion which is cool.
Andrej Karpathy@karpathy

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

English
1.1K
2.8K
26.7K
7.1M
Steven Van Vaerenbergh retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.
English
1.6K
4.7K
37.1K
5.1M
JMo
JMo@_GunRunn3r_·
@4HLsartorialist Prefer it to Saving Private Ryan. The camp attack scene and the colors make it pretty hypnotic, and the music.
English
1
0
1
974
Sam Bhagwat
Sam Bhagwat@calcsam·
We're giving away 1 MILLION AI agent books in 2026 last year we hit 100K. this year we're going 10x how do we even do that? giving away 520K books at conferences sending 255K to community events, 225K directly on X the mindset shift: treat logistics as a core capability we've now got five global print/fulfillment operations. more soon the best way to reach devs is to put something useful in their hands so if you're building with AI agents, comment BOOK. and if you have a community, comment BOX and we'll send you a bunch!
Sam Bhagwat tweet media
English
109
6
87
7.3K
ZohaibAi
ZohaibAi@ZohaibAi__sf·
Fine out the highest possible number. 99.9% will fail
ZohaibAi tweet media
English
10.8K
142
1K
869.9K
Sam Bhagwat
Sam Bhagwat@calcsam·
last month we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy
Sam Bhagwat tweet media
English
4.1K
449
5.1K
589.3K
Sam Bhagwat
Sam Bhagwat@calcsam·
icymi we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy
Sam Bhagwat tweet media
English
5.3K
741
8K
1.1M
Steven Van Vaerenbergh
Steven Van Vaerenbergh@steven2358·
“When unlimited power meets limited understanding.” A very decent read on how AI is detrimental to junior coders’ skills. The argument generalizes well to other fields: AI access should come *after* demonstrating mastery. finalroundai.com/blog/ai-vibe-c…
English
1
2
3
231
Aphix
Aphix@Aphixx·
@elonmusk Can somebody, anybody, please send me a link to a full coverage of the live TV airing of the moon landing, even for purchase, preferably including the commercials and uncut, through the full hour+? One would think this is super easy to find, alas I can find at best a 44 min cut.
English
7
4
22
12.6K
Steven Van Vaerenbergh
Steven Van Vaerenbergh@steven2358·
At some point, we’ll have true "vibe mathematics": you describe a problem in natural language, and the AI interprets, rigorously solves, and explains it. But current LLMs still hallucinate steps, so we’re not quite there yet.
Steven Van Vaerenbergh tweet mediaSteven Van Vaerenbergh tweet media
English
0
0
1
72