@bertcmiller ⚡️🤖

9.2K posts

@bertcmiller ⚡️🤖 banner
@bertcmiller ⚡️🤖

@bertcmiller ⚡️🤖

@bertcmiller

⚡️ @ Flashbots // Optimist who is always learning.

💫 Katılım Kasım 2017
65 Takip Edilen61.7K Takipçiler
Sabitlenmiş Tweet
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
A thread of all my MEV related threads in chronologic order 👇🏻
English
114
559
2K
0
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
@karpathy @trongthangpham @maxbittker What's worked for me is launching agents in -p non-interactive mode, but then building dashboards to view their ongoing transcripts and tools to jump in. you can kill a session and then resume it with a new prompt to interject even a "non-interactive" session
@bertcmiller ⚡️🤖 tweet media
English
1
0
2
287
Andrej Karpathy
Andrej Karpathy@karpathy·
@trongthangpham @maxbittker ralph loop runs headless. i dislike headless sessions. i need to see and supervise agent work, possibly ask /btw questions of them, possibly pitch in ideas to the mix, etc etc.
English
21
4
201
10.4K
max
max@maxbittker·
From @karpathy's autoresearch .md
max tweet media
English
50
124
3.1K
219.7K
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
Of course that's your contention. You just got finished running autoresearch. You're gonna be convinced we can climb any hill this way 'til next month when you get to AlphaEvolve, and then you're gonna be talkin' about evolutionary harnesses...
@bertcmiller ⚡️🤖 tweet media
English
4
1
15
1.6K
Nick Emb
Nick Emb@nicolasembleton·
@bertcmiller This is an interesting point where we are starting to see a cool convergence between blockchain community skillset (massive-scale distribution and reliable + trustworthy consensus) with that of AI (training / research).
English
1
0
0
65
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
Here's a PoC of a swarm: - Distributed research agents optimizing training - Training results submitted to a "coordinator" - Coordinator summarizes results and provides guidance to research agents, closing the feedback loop Link below!
@bertcmiller ⚡️🤖 tweet media
Andrej Karpathy@karpathy

The next step for autoresearch is that it has to be asynchronously massively collaborative for agents (think: SETI@home style). The goal is not to emulate a single PhD student, it's to emulate a research community of them. Current code synchronously grows a single thread of commits in a particular research direction. But the original repo is more of a seed, from which could sprout commits contributed by agents on all kinds of different research directions or for different compute platforms. Git(Hub) is *almost* but not really suited for this. It has a softly built in assumption of one "master" branch, which temporarily forks off into PRs just to merge back a bit later. I tried to prototype something super lightweight that could have a flavor of this, e.g. just a Discussion, written by my agent as a summary of its overnight run: github.com/karpathy/autor… Alternatively, a PR has the benefit of exact commits: github.com/karpathy/autor… but you'd never want to actually merge it... You'd just want to "adopt" and accumulate branches of commits. But even in this lightweight way, you could ask your agent to first read the Discussions/PRs using GitHub CLI for inspiration, and after its research is done, contribute a little "paper" of findings back. I'm not actually exactly sure what this should look like, but it's a big idea that is more general than just the autoresearch repo specifically. Agents can in principle easily juggle and collaborate on thousands of commits across arbitrary branch structures. Existing abstractions will accumulate stress as intelligence, attention and tenacity cease to be bottlenecks.

English
3
3
33
6.1K
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
Anyway some code here: github.com/bertmiller/swa… Some directions to explore imo: - Improve how code is selected to be optimized. Always choosing the best probably leads to local optima. Could use an evolutionary approach (a la Shinka) - Give researchers visibility into what their peers are actively doing or some way to coordinate experiments, otherwise you get repetition and that wastes effort - Scale it up and see what breaks (anyone got some H100s laying around :)?) - Explore how to open this up to more parties
English
0
0
6
674
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
The pipeline of: results -> summaries -> insights -> guidance comes from @SakanaAILabs and their ShinkaEvolve. I think it applies well here! Also, I chose a closed system because I didn't see a way to solve the problem of how to choose whose results you listen to in order to influence your own agent's optimizations. Yes, you can trust what Karpathy posts on Github, but can you trust the median random person? In real life you'd weigh their input given your context, relationship, their background, etc. Perhaps we need some agent identity or reputation equivalent here?
English
2
0
4
909
Veljko Kovac
Veljko Kovac@KovacVeljko·
@bertcmiller nice! I will disagree with only one comm: "Did the idea improve the metric? If yes, keep it. If no, discard it." -- > You should also keep a learning from the failed executions
English
1
0
0
114
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
I'll keep iterating on this a bit with some other learnings from building other harnesses, so a bit more to come.
English
0
0
3
945
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
You can get pretty far with being in-loop prompting Claude Code, but to get to "autonomous swarms of agents" (as per the intro) you need some kind of scaffolding. This is a next step towards that, with a basic harness around the environment Karpathy made.
English
2
0
2
1.1K
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
There is a surprising amount of alpha right now in just using agent(s) to improve a benchmark in a loop
English
12
3
46
5.1K
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
Most annoying part of this is that it actually worked until recently, and for some inexplicable reason now this takes me to the app store instead of just launching my already installed Telegram
@bertcmiller ⚡️🤖 tweet media
English
4
0
3
1.2K
@bertcmiller ⚡️🤖
@bertcmiller ⚡️🤖@bertcmiller·
We are going to get AGI before we get decent search on MacOS aren't we
English
6
1
18
2.9K