Raymond Weitekamp

3.3K posts

Raymond Weitekamp

@raw_works

New Haven Katılım Aralık 2013

2.9K Takip Edilen1.5K Takipçiler

Sabitlenmiş Tweet

Raymond Weitekamp@raw_works·22 Oca

“Study hard what interests you the most in the most undisciplined, irreverent and original manner possible.” ― Richard Feynmann I am currently studying: ralph wiggum, rlms, dspy, gastown & related agent orchestrators, agent-native apps and orgs How about you?

English

1.6K

Raymond Weitekamp@raw_works·8h

@teostealth i have no gpu on this machine...so hours for all my repos and notes. but with a new mac i bet would be very fast.

English

Teo Vanyo Adiputra@teostealth·10h

@raw_works How long does the re-embedding usually take for something like that?

English

Raymond Weitekamp@raw_works·11h

currently re-embedding my entire machine, thank you very much! LateOn-Code-edge for code search and Reason-ModernColBERT for prose/docs search.

Antoine Chaffin@antoine_chaffin

BrowseComp-Plus, perhaps the hardest popular deep research task, is now solved at nearly 90%... ... and all it took was a 150M model ✨ Thrilled to announce that Reason-ModernColBERT did it again and outperform all models (including models 54× bigger) on all metrics

English

3.7K

Raymond Weitekamp@raw_works·8h

@mitchellh +1! i was just building my own but i don't trust myself (or the vendors)

English

540

Mitchell Hashimoto@mitchellh·9h

I'm dying for pi-mono-style minimal library that handles the hard parts of email (auth, syncing with local state, etc.) and gives me an opinionated way to add agentic loops on top of that. I want to build my own agents and logic and guardrails, I don't trust vendors right now.

English

549

38.2K

Raymond Weitekamp retweetledi

alex zhang@a1zhang·12h

someone should try having RLMs write REPL code primarily using DSPy

English

119

15.4K

Raymond Weitekamp@raw_works·11h

@antoine_chaffin this might help: github.com/lightonai/next… i'm using this helper now in my internal search tool. (apologies for any claude code slop, i'm happy to send claude back to the drawing board).

English

Antoine Chaffin@antoine_chaffin·11h

@raw_works This is the way!

English

Raymond Weitekamp@raw_works·12h

@VictorTaelin i've learned so much more by switching to pi. (and made a terrible mess of my extensions.)

English

131

Taelin@VictorTaelin·19h

Ok so I thought that was a dumb gimmick but now I'm completely sold on how pi is a self-modifiable software. It literally knows how to modify itself very cleanly and that's extremely useful in practice I'm not using Codex / Claude Code anymore Bend2 should definitely be like this! I mean, constructed in a way that AI's can easily navigate it and know how to modify it to add any feature the user wants. Perhaps we're past the era of open source software and into the era of forkable software, where the most hackable project wins?

English

834

61.5K

Raymond Weitekamp retweetledi

Omar Khattab@lateinteraction·16h

i would guess, as a non-party to all this, that the beatings will continue until all interaction becomes late

English

774

Raymond Weitekamp@raw_works·12h

@GeoffreyHuntley "what's next?"

English

geoff@GeoffreyHuntley·1d

this is my favourite prompt of all time: “how could this be better?” reply with yours and why it rocks!

English

117

13.5K

Raymond Weitekamp@raw_works·1d

@itsolelehmann nice write-up! here's another approach to this using @gepa_ai by @LakshyAAAgrawal et al => gepa-ai.github.io/gepa/blog/2026…

English

716

Ole Lehmann@itsolelehmann·2d

x.com/i/article/2033…

ZXX

352

3.8K

1.3M

Raymond Weitekamp@raw_works·1d

@rachpradhan cc @irl_danB - enabling your zig addiction.

English

209

Rach@rachpradhan·1d

I replaced FastAPI's entire HTTP core with Zig. Same decorator API. Same Pydantic models. 7× faster. 47,832 req/s vs FastAPI's 6,800. 2.09ms p50 latency. Introducing. TurboAPI. Here's the story..

English

220

2.7K

245.9K

Raymond Weitekamp@raw_works·1d

@HamelHusain lol - i'm having ptsd remembering why i hated max/msp

English

310

Hamel Husain@HamelHusain·1d

Ya'll worried about AI Coding slop, when there as an entire army of n8n experts who are installing unmaintainable visual workflow spaghetti in small/medium sized businesses at scale Literal merchants of complexity. Its so much worse than using claude code. It's an artifact of being stuck 6 months in the past and n8n is all you know.

English

692

55K

Raymond Weitekamp@raw_works·12 Mar

@ryancarson rip fizzy

English

Ryan Carson@ryancarson·11 Mar

I've shipped so many PRs today because of this. 1,251,678,004 tokens in 24 hours. Bonkers.

Ryan Carson@ryancarson

This is how I run 5 agents concurrently in a Code Factory to write/ship 100% of our code. It uses Symphony from @alex_frantic (oss) + Codex Mac app + @linear Took me 2-3 days to set up and now it’s *cranking* github.com/openai/symphony

English

398

111.8K

Raymond Weitekamp@raw_works·12 Mar

@viplismism if you can figure out how to do it as a pure pi extension, kudos. i had to fork it to make it work. x.com/advait3000/sta…

Advait Shinde@advait3000

@raw_works @a1zhang @lateinteraction @badlogicgames @GeoffreyHuntley Can you clarify why your custom pi extension didn't work? Feel like you can register a custom tool + hooks to manage the jj lifecycle?

English

176

vipli@viplismism·12 Mar

@raw_works woah! I have already built something like this for the pi-mono guys, added a pr and an issue too to add this in their extension bundle, but seems like they don't have the cycles to maintain it: github.com/badlogic/pi-mo… seems like I am teleporting! lol! you are og, kudos man!

English

171

vipli@viplismism·12 Mar

just shipped rlm (recursive language model) cli based on the rlm paper (arXiv:2512.24601) so the layman logic is instead of stuffing your entire context into one llm call and hoping it doesn't go into context rot, rlm writes code to actually process the data, slicing, chunking, running sub-queries on pieces and looping until it gets the answer. works with claude, gpt, gemini whatever you want, run it from any project directory and it auto-loads the file tree as context so it already knows your codebase before you even ask a question. setup takes like 30 seconds : just run npm i -g rlm-cli then rlm (first run asks for api key and you're good). it's open source, MIT licensed, if something breaks or you have ideas just open an issue. still converging and managing everything on my own for now! adding the link to the repo in the comments

vipli@viplismism

rlms (recursive language models) are wild man, seriously! gave it a 3,000-line django queryset file. asked it to find every class, categorize methods, and identify design patterns. so it started by writing the python code to slice it into chunks, called itself 9 times on the pieces, self-corrected a syntax error mid-run, and delivered a complete analysis in 5 iterations. found 13 classes, 70+ methods, 11 design patterns. the architecture looks simple but honestly it's beautiful. so how it works is : 1. a python sandbox with the full doc as a context variable. like the whole context just lives in a global python variable, 2. then the main orchestrator llm just outputs python code. and that code handles the slicing + analysis. the context splitting? yeah that's from the code itself. 3. and then llm lets it call itself recursively on the chunks. keeps going until it's confident enough to set a final answer. orchestrator just loops this whole thing until done. man it really looks simple but it's just a really smart way of dealing with context. no rag. no embeddings. no vector db bullshit kinda stuff. all it does it let the orchestrator llm to be more like a programmer. it's just an llm in a loop writing code to read what it can't fit in its context window. I am diving more into this but seems like a good strategy to deal with the context.

English

786

71.8K

Raymond Weitekamp@raw_works·12 Mar

currently finetuning a model to predict myself thanks to autoresearch.

Raymond Weitekamp@raw_works

behold! on the occasion of my birthday, i've decided that i have enough coding agent session history to train a GEPA optimizer to replace myself entirely. rrm = recursive raymond model (results to follow)

English

141

Raymond Weitekamp@raw_works·12 Mar

@viplismism this one is weirder: github.com/rawwerks/ypi

English

159

vipli@viplismism·12 Mar

@raw_works hell yeah, crazy, gonna see this one for sure! thanks for sharing man

English

639

Raymond Weitekamp@raw_works·12 Mar

@tobi @Entropic_AI then its really going to explode...

English

349

tobi lutke@tobi·12 Mar

@Entropic_AI system looks good, but you need to figure out a path to run with 8gb M1,M2 s

English

14.6K

Raymond Weitekamp@raw_works·12 Mar

@realmcore_ nice! (and a bargain: only $0.04 to say hello vs gpt 5.4 pro at $80) x.com/Thewarlordai/s…

Warlord AI@Thewarlordai

GPT-5.4 Pro's overthinking is officially a feature, not a bug. If a simple 'Hi' costs you $80 in compute, the model isn't smarter—it's just inefficient. I'm sticking with Claude Opus 4.6 for logic until OpenAI figures out how to stop burning money on basic greetings. 💸

English

677

akira@realmcore_·12 Mar

x.com/i/article/2031…

ZXX

196

637.6K

Raymond Weitekamp@raw_works·12 Mar

@realmcore_ cc @irl_danB

420

Raymond Weitekamp@raw_works·12 Mar

@stash_pomichter wtf really? which one should i get? (this repo is the only thing i want to do with it)

English

stash@stash_pomichter·12 Mar

@raw_works Quadrupeds are now <$1k!

English

764

stash@stash_pomichter·12 Mar

last week we got 1M views and 100s of death threats for giving Openclaw access to drones, humanoids, quadrupeds, and other physical hardware. Now we’re releasing EVERYTHING open-source. Dimensional gives agents access to the physical world. Join us. Repo 👇🏽👇🏽👇🏽

English

143

293

1.9K

150.3K

Keşfet

@teostealth @mitchellh @antoine_chaffin @VictorTaelin @GeoffreyHuntley @itsolelehmann @gepa_ai @LakshyAAAgrawal