Steven Van Vaerenbergh

916 posts

Steven Van Vaerenbergh

@steven2358

Researcher and lecturer at @unican. Occasional curator of lists.

Spain Katılım Şubat 2008

278 Takip Edilen305 Takipçiler

Steven Van Vaerenbergh@steven2358·6d

Which human bottleneck becomes dominant when cognition is cheap?

English

Steven Van Vaerenbergh@steven2358·5 May

@FigureLabs_AI One of my students showed me your product. It looks good—I've added it to my list at github.com/steven2358/awe…

English

Steven Van Vaerenbergh@steven2358·5 Nis

@karpathy Excellent. Also, this is what formulating algorithms in pseudocode has always been about.

English

Andrej Karpathy@karpathy·4 Nis

Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file". The idea of the idea file is that in this era of LLM agents, there is less of a point/need of sharing the specific code/app, you just share the idea, then the other person's agent customizes & builds it for your specific needs. So here's the idea in a gist format: gist.github.com/karpathy/442a6… You can give this to your agent and it can build you your own LLM wiki and guide you on how to use it etc. It's intentionally kept a little bit abstract/vague because there are so many directions to take this in. And ofc, people can adjust the idea or contribute their own in the Discussion which is cool.

Andrej Karpathy@karpathy

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

English

1.1K

2.8K

26.7K

7.1M

Steven Van Vaerenbergh retweetledi

Andrej Karpathy@karpathy·25 Şub

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.

English

1.6K

4.7K

37.1K

5.1M

Steven Van Vaerenbergh@steven2358·31 Oca

In hindsight, we could have seen this coming. #moltbook

English

Steven Van Vaerenbergh@steven2358·23 Oca

@_GunRunn3r_ @4HLsartorialist The whole soundtrack is phenomenal.

English

JMo@_GunRunn3r_·23 Oca

@4HLsartorialist Prefer it to Saving Private Ryan. The camp attack scene and the colors make it pretty hypnotic, and the music.

English

974

Postmanbloke 🦅🇺🇲@4HLsartorialist·22 Oca

This film is so good I really can't say enough about it

English

296

175

4.5K

973.7K

Steven Van Vaerenbergh@steven2358·14 Oca

@calcsam BOOK

Português

Sam Bhagwat@calcsam·13 Oca

We're giving away 1 MILLION AI agent books in 2026 last year we hit 100K. this year we're going 10x how do we even do that? giving away 520K books at conferences sending 255K to community events, 225K directly on X the mindset shift: treat logistics as a core capability we've now got five global print/fulfillment operations. more soon the best way to reach devs is to put something useful in their hands so if you're building with AI agents, comment BOOK. and if you have a community, comment BOX and we'll send you a bunch!

English

109

7.3K

Steven Van Vaerenbergh@steven2358·1 Oca

@Jindo_1307979 And the classical one (no exponents): 811051

English

Steven Van Vaerenbergh@steven2358·1 Oca

@Jindo_1307979 Another option: 60E8

English

ZohaibAi@ZohaibAi__sf·1 Oca

Fine out the highest possible number. 99.9% will fail

English

10.8K

142

869.9K

Steven Van Vaerenbergh@steven2358·24 Ara

@calcsam BOOK

Português

Sam Bhagwat@calcsam·22 Ara

last month we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy

English

4.1K

449

5.1K

589.3K

Steven Van Vaerenbergh@steven2358·6 Ara

@giffmana This should be required reading for all PhD students in the field.

English

Lucas Beyer (bl16)@giffmana·5 Ara

I knew *exactly* which paper she means before even opening the link. Do you?

GIF

Anastasia Marchenkova@amarchenkova

We should all write papers like this: arxiv.org/pdf/1804.02767

English

8.4K

4.1M

Steven Van Vaerenbergh@steven2358·11 Kas

@calcsam BOOK

Português

Sam Bhagwat@calcsam·10 Kas

icymi we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy

English

5.3K

741

1.1M

Steven Van Vaerenbergh@steven2358·17 Tem

“When unlimited power meets limited understanding.” A very decent read on how AI is detrimental to junior coders’ skills. The argument generalizes well to other fields: AI access should come *after* demonstrating mastery. finalroundai.com/blog/ai-vibe-c…

English

231

Steven Van Vaerenbergh retweetledi

Luis J. Rodríguez Muñiz@ljrguezmuniz·6 Tem

Hoy en @elperiodico hablando de Educación Matemática con motivo del inminente @C2EM2025 elperiodico.com/es/sociedad/20…

Español

5.3K

Steven Van Vaerenbergh@steven2358·24 Nis

@aphixx @elonmusk Does this work? youtu.be/S9HdPi9Ikhk

YouTube

English

189

Aphix@Aphixx·24 Nis

@elonmusk Can somebody, anybody, please send me a link to a full coverage of the live TV airing of the moon landing, even for purchase, preferably including the commercials and uncut, through the full hour+? One would think this is super easy to find, alas I can find at best a 44 min cut.

English

12.6K

Elon Musk@elonmusk·23 Nis

Only 66 years from first flight to landing on the Moon in 1969. Here we are, 76 years later cannot yet return to the Moon. We should have been on Mars by now.

Massimo@Rainmaker1973

These two photographs are separated by only 66 years.

English

37.8K

36.1K

335.6K

109M

Steven Van Vaerenbergh@steven2358·16 Nis

At some point, we’ll have true "vibe mathematics": you describe a problem in natural language, and the AI interprets, rigorously solves, and explains it. But current LLMs still hallucinate steps, so we’re not quite there yet.

English

Steven Van Vaerenbergh@steven2358·16 Nis

In maths, "vibe calculating" has been around for a while. You write the operations and get the result from the calculator, without worrying about their correctness. Compared to LLMs, the correctness is pretty much guaranteed here.

Andrej Karpathy@karpathy

There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. It's possible because the LLMs (e.g. Cursor Composer w Sonnet) are getting too good. Also I just talk to Composer with SuperWhisper so I barely even touch the keyboard. I ask for the dumbest things like "decrease the padding on the sidebar by half" because I'm too lazy to find it. I "Accept All" always, I don't read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it. The code grows beyond my usual comprehension, I'd have to really read through it for a while. Sometimes the LLMs can't fix a bug so I just work around it or ask for random changes until it goes away. It's not too bad for throwaway weekend projects, but still quite amusing. I'm building a project or webapp, but it's not really coding - I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works.

English

118

Steven Van Vaerenbergh@steven2358·3 Nis

@AndrewHallFF @KushDesai47 So it may seem. x.com/krishnanrohit/…

rohit@krishnanrohit

This might be the first large-scale application of AI technology to geopolitics.. 4o, o3 high, Gemini 2.5 pro, Claude 3.7, Grok all give the same answer to the question on how to impose tariffs easily.

English

208

Andrew Hall@AndrewHallFF·3 Nis

@KushDesai47 All kidding and snark aside, is this from Chat GPT?

English

9.1K

Kush Desai@KushDesai47·3 Nis

No we literally calculated tariff and non tariff barriers. ustr.gov/issue-areas/re…

James Surowiecki@JamesSurowiecki

Just figured out where these fake tariff rates come from. They didn't actually calculate tariff rates + non-tariff barriers, as they say they did. Instead, for every country, they just took our trade deficit with that country and divided it by the country's exports to us. So we have a $17.9 billion trade deficit with Indonesia. Its exports to us are $28 billion. $17.9/$28 = 64%, which Trump claims is the tariff rate Indonesia charges us. What extraordinary nonsense this is.

English

2.2K

588

5.7K

6.9M

Steven Van Vaerenbergh@steven2358·28 Mar

4o image generation is wild. But... that's not a prism, it's a pyramid. GenAI still doesn't get geometry. Is that on the roadmap, @OpenAI? openai.com/index/introduc…

English

120

Keşfet

@FigureLabs_AI @karpathy @_GunRunn3r_ @4HLsartorialist @calcsam @giffmana @elonmusk @BarackObama