Afik Cohen

1.2K posts

Afik Cohen banner
Afik Cohen

Afik Cohen

@aphex

𝗮𝗿𝘁𝗶𝗳𝗶𝗰𝗶𝗮𝗹 “𝗶𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲” • 𝚎𝚗𝚐𝚒𝚗𝚎𝚎𝚛 • 𝕥𝕣𝕒𝕟𝕔𝕖 𝕕𝕛 • ᴍᴇᴛᴀᴘʜʏsɪᴄɪᴀɴ 🌉 SF, CA @[email protected]

San Francisco, CA Katılım Aralık 2008
1.2K Takip Edilen786 Takipçiler
Afik Cohen
Afik Cohen@aphex·
not to be a claude threadboi on here but here's a couple of the highest-value single prompts/directives I've been using a lot lately: 1. "knowing everything you know now, is there anything you'd do to simplify/streamline our implementation?" ↖️ without fail, claude pours its heart out about the lowest hanging fruit it would immediately change/improve after wrestling with something for hours 2. "if you ever get stuck or are unsure about something, i want you to ask codex CLI questions and consider its responses as a second expert opinion." ↖️ this works great saved to MEMORY.md; it's like watching two staff engineers ask eachother for advice, discuss concerns, and weigh tradeoffs together. reminds me of many such cases from my career, except back then it took 40 slack threads and 3 days and now it takes two shoggoths 45 seconds and 15.9k tokens
English
0
0
1
48
Riley Walz
Riley Walz@rtwlz·
I scraped tech company cafeteria menus. Today, Nvidia served truffle mushroom pizza while Tesla had soyrizo nachos. Choose your employer accordingly
Riley Walz tweet media
English
87
53
2.8K
366.4K
Afik Cohen retweetledi
Silas Reinagel ⚛️
Silas Reinagel ⚛️@SilasReinagel·
@jianxliao this is the experience at the tip of the spear. top 1% of top 1% is already operating in dark factory mode. no code writing and no code reading. ONLY higher-order operations. but interview process is behind... it tests for skills from 1-4 years ago
English
0
1
2
150
Afik Cohen retweetledi
Alex
Alex@AlexanderTw33ts·
maybe the real autist/adhd advantage is that we can move up and down abstraction layers when tech like this comes along we don't even notice the change agent-maxxing feels exactly the same as writing x86 did in my first year of college i'm typing in a computer, problems are being solved, my ideas are coming to life, shareholder value is being created its not that deep bro stop getting attached to your z value on the fractal tree
Alex tweet media
Mo@atmoio

I was a 10x engineer. Now I'm useless.

English
98
130
2K
174.6K
Afik Cohen
Afik Cohen@aphex·
the fact that this policy is running on a remote GPU and the arm can still act so smoothly while running on a raspberry pi is mind blowing. soon onboard compute won't be the bottleneck for long horizon tasks, it'll just be there to run the locomotion policy and a tailscale client 😍
Jack Vial@jackvial89

Distributed Real-Time Chunking! I've written a technical blog post on the approach to deploying Real-Time Chunking via In-Painting on a remote cloud GPU server with local client (e.g. Raspberry PI) demonstrated in the video below jackvial.com/posts/distribu… A LeRobot based implementation is available at github.com/jackvial/drtc this includes scripts to provision a GPU instance on Prime Intellect, connect via Tailscale, and should have everything (expect a model trained for your environment) needed to reproduce the experiments outlined in the blog post

English
0
0
1
255
Yuke Zhu
Yuke Zhu@yukez·
We have seen rapid progress in humanoid control — specialist robots can reliably generate agile, acrobatic, but preset motions. Our singular focus this year: putting generalist humanoids to do real work. To progress toward this goal, we developed SONIC (nvlabs.github.io/GEAR-SONIC/), a Behavior Foundation Model for real-time, whole-body motion generation that supports teleoperation and VLA inference for loco-manipulation. Today, we’re open-sourcing SONIC on GitHub. We are excited to see what the community builds upon SONIC and to collectively push humanoid intelligence toward real-world deployment at scale. 🌐 Paper: arxiv.org/abs/2511.07820 📃 Code: github.com/NVlabs/GR00T-W…
English
11
65
349
62.5K
Afik Cohen
Afik Cohen@aphex·
I've been doing essentially this to get around a maddening property of any Chinese-built software - they have custom versions of every freaking library and don't commit their patches upstream. For example, does your chinese-made thing use WebRTC? Guess what - it's actually using chinese_manufacturer_name_webrtc_lib_v5-years-ago! You thought you could use a regular webrtc client to connect to it? You fool, you absolute infant. But now, with agents? Claude will find that library's repo, read all the silly hacks and undocumented patches they made on top of the actual standard, roll its eyes sweetly, and brute-force one-shot the implementation with no extra sweat broken.
Andrej Karpathy@karpathy

On DeepWiki and increasing malleability of software. This starts as partially a post on appreciation to DeepWiki, which I routinely find very useful and I think more people would find useful to know about. I went through a few iterations of use: Their first feature was that it auto-builds wiki pages for github repos (e.g. nanochat here) with quick Q&A: deepwiki.com/karpathy/nanoc… Just swap "github" to "deepwiki" in the URL for any repo and you can instantly Q&A against it. For example, yesterday I was curious about "how does torchao implement fp8 training?". I find that in *many* cases, library docs can be spotty and outdated and bad, but directly asking questions to the code via DeepWiki works very well. The code is the source of truth and LLMs are increasingly able to understand it. But then I realized that in many cases it's even a lot more powerful not being the direct (human) consumer of this information/functionality, but giving your agent access to DeepWiki via MCP. So e.g. yesterday I faced some annoyances with using torchao library for fp8 training and I had the suspicion that the whole thing really shouldn't be that complicated (wait shouldn't this be a Function like Linear except with a few extra casts and 3 calls to torch._scaled_mm?) so I tried: "Use DeepWiki MCP and Github CLI to look at how torchao implements fp8 training. Is it possible to 'rip out' the functionality? Implement nanochat/fp8.py that has identical API but is fully self-contained" Claude went off for 5 minutes and came back with 150 lines of clean code that worked out of the box, with tests proving equivalent results, which allowed me to delete torchao as repo dependency, and for some reason I still don't fully understand (I think it has to do with internals of torch compile) - this simple version runs 3% faster. The agent also found a lot of tiny implementation details that actually do matter, that I may have naively missed otherwise and that would have been very hard for maintainers to keep docs about. Tricks around numerics, dtypes, autocast, meta device, torch compile interactions so I learned a lot from the process too. So this is now the default fp8 training implementation for nanochat github.com/karpathy/nanoc… Anyway TLDR I find this combo of DeepWiki MCP + GitHub CLI is quite powerful to "rip out" any specific functionality from any github repo and target it for the very specific use case that you have in mind, and it actually kind of works now in some cases. Maybe you don't download, configure and take dependency on a giant monolithic library, maybe you point your agent at it and rip out the exact part you need. Maybe this informs how we write software more generally to actively encourage this workflow - e.g. building more "bacterial code", code that is less tangled, more self-contained, more dependency-free, more stateless, much easier to rip out from the repo (x.com/karpathy/statu…) There's obvious downsides and risks to this, but it is fundamentally a new option that was not possible or economical before (it would have cost too much time) but now with agents, it is. Software might become a lot more fluid and malleable. "Libraries are over, LLMs are the new compiler" :). And does your project really need its 100MB of dependencies?

English
0
0
1
449
Afik Cohen
Afik Cohen@aphex·
@madebywelch def give Gas Town a try if you haven't yet (this looks like a spin on it)
English
1
0
0
61
Robert Welch
Robert Welch@madebywelch·
I've been spending a lot of time thinking about and researching multi-agent orchestration, how to get AI agents to collaborate like a real team rather than a single model trying to do everything at once. the future isn't just hierarchies. It's agents with domains, lateral peer collaboration, and team structures that emerge from the task. Agents that own domains, negotiate with peers, escalate when they're stuck, and stay in their lane. A strategist that decomposes. Directors that plan. Workers that build. Still early but this is the direction I'm most excited about in AI right now.
Robert Welch tweet media
English
1
0
1
61
Afik Cohen
Afik Cohen@aphex·
this, but for Claude Code. I've been using Claude Code since the day it came out a year ago, and I've been using Gas Town since @Steve_Yegge released it a month ago. And I just realized that I haven't opened IntelliJ (my IDE of choice for the past 15 years) at all yet in 2026. a minute ago, I felt the same nostalgic pang that other veteran engineers have been noticing in recent weeks: the realization that there is a rapidly growing set of skills that I will *never* use again. I was working with one of the 9 gas town crew members that I have running and watched it push yet another huge feature that would have taken me a week to build manually; elegantly one-shotted in 13 minutes after I approved its plan. the push was rejected; it needed to rebase, and it had a pile of messy conflicts to sift through. 15 seconds later, Opus 4.6 neatly figured out what happened, what to keep, what to discard, what to meld with the upstream changes. another 10 seconds later, it was deployed. after using Gas Town for over a month, I'm used to the feeling of pushing 100+ PRs per week. but just now, that little messy rebase? that's what gave me The Pang. the frisson, made up of equal parts relief and nostalgia, that I will never, ever, have to solve a messy rebase by hand anymore. unless I want to do it as a little exercise, like doing a crossword, or like writing 8051 assembly by hand as a toy project. I don't know where @jetbrains goes from here. they're one of the best devtool companies to have ever existed and have made uncountable millions of developers happier with their extremely powerful tools, thoughtfully and painstakingly crafted over decades. and they fostered my favorite language, @kotlin, through an impossible birth, adolescence, and true widespread institutional adoption with Google's 2017 decision to make it the default language for Android - a grassroots worldwide push from hordes of adoring craftspeople that even a Fortune #7 company could not ignore. jetbrains made the language, they made the Android Studio IDE, and they made IntelliJ, the superset of all of their specialized IDEs, which had been my daily driver since 2011. and now, just like that, the IDE is no longer relevant to the state of the art.
Charlie Guo@charlierguo

x.com/i/article/2018…

English
0
0
2
412
Afik Cohen
Afik Cohen@aphex·
Vaavunjärvi @aphex/vaavunj%C3%A4rvi-a7038f79e090" target="_blank" rel="nofollow noopener">medium.com/@aphex/vaavunj…
Suomi
0
0
0
157
Afik Cohen retweetledi
Yann LeCun
Yann LeCun@ylecun·
A few robotics AI companies are working on world models, planning, etc. But the vast majority are using LLM-derived methods, like VLA, or diffusion policies with RL fine-tuning in simulation. Nice for narrow tasks. The companies building humanoid hardware don't tend to be the ones working on innovative robotics AI.
English
38
32
639
72.9K
Afik Cohen retweetledi
The Humanoid Hub
The Humanoid Hub@TheHumanoidHub·
Yann LeCun says absolutely none of the humanoid companies have any idea how to make those robots smart enough to be useful.
English
246
310
2.3K
1.9M