Afik Cohen

1.2K posts

Afik Cohen

@aphex

𝗮𝗿𝘁𝗶𝗳𝗶𝗰𝗶𝗮𝗹 “𝗶𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲” • 𝚎𝚗𝚐𝚒𝚗𝚎𝚎𝚛 • 𝕥𝕣𝕒𝕟𝕔𝕖 𝕕𝕛 • ᴍᴇᴛᴀᴘʜʏsɪᴄɪᴀɴ 🌉 SF, CA @[email protected]

San Francisco, CA Beigetreten Aralık 2008

1.2K Folgt786 Follower

Afik Cohen@aphex·1d

not to be a claude threadboi on here but here's a couple of the highest-value single prompts/directives I've been using a lot lately: 1. "knowing everything you know now, is there anything you'd do to simplify/streamline our implementation?" ↖️ without fail, claude pours its heart out about the lowest hanging fruit it would immediately change/improve after wrestling with something for hours 2. "if you ever get stuck or are unsure about something, i want you to ask codex CLI questions and consider its responses as a second expert opinion." ↖️ this works great saved to MEMORY.md; it's like watching two staff engineers ask eachother for advice, discuss concerns, and weigh tradeoffs together. reminds me of many such cases from my career, except back then it took 40 slack threads and 3 days and now it takes two shoggoths 45 seconds and 15.9k tokens

English

Afik Cohen@aphex·10 Mar

@rtwlz Can confirm @airbnb lunches have and always will be s-tier

English

Riley Walz@rtwlz·10 Mar

I scraped tech company cafeteria menus. Today, Nvidia served truffle mushroom pizza while Tesla had soyrizo nachos. Choose your employer accordingly

English

2.8K

366.4K

Afik Cohen retweetet

Silas Reinagel ⚛️@SilasReinagel·6 Mar

@jianxliao this is the experience at the tip of the spear. top 1% of top 1% is already operating in dark factory mode. no code writing and no code reading. ONLY higher-order operations. but interview process is behind... it tests for skills from 1-4 years ago

English

150

Afik Cohen retweetet

Alex@AlexanderTw33ts·8 Mar

maybe the real autist/adhd advantage is that we can move up and down abstraction layers when tech like this comes along we don't even notice the change agent-maxxing feels exactly the same as writing x86 did in my first year of college i'm typing in a computer, problems are being solved, my ideas are coming to life, shareholder value is being created its not that deep bro stop getting attached to your z value on the fractal tree

Mo@atmoio

I was a 10x engineer. Now I'm useless.

English

130

174.6K

Afik Cohen@aphex·2 Mar

the fact that this policy is running on a remote GPU and the arm can still act so smoothly while running on a raspberry pi is mind blowing. soon onboard compute won't be the bottleneck for long horizon tasks, it'll just be there to run the locomotion policy and a tailscale client 😍

Jack Vial@jackvial89

Distributed Real-Time Chunking! I've written a technical blog post on the approach to deploying Real-Time Chunking via In-Painting on a remote cloud GPU server with local client (e.g. Raspberry PI) demonstrated in the video below jackvial.com/posts/distribu… A LeRobot based implementation is available at github.com/jackvial/drtc this includes scripts to provision a GPU instance on Prime Intellect, connect via Tailscale, and should have everything (expect a model trained for your environment) needed to reproduce the experiments outlined in the blog post

English

255

Afik Cohen@aphex·20 Şub

@yukez @chris_j_paxton this video is 100% teleop right

English

696

Yuke Zhu@yukez·20 Şub

We have seen rapid progress in humanoid control — specialist robots can reliably generate agile, acrobatic, but preset motions. Our singular focus this year: putting generalist humanoids to do real work. To progress toward this goal, we developed SONIC (nvlabs.github.io/GEAR-SONIC/), a Behavior Foundation Model for real-time, whole-body motion generation that supports teleoperation and VLA inference for loco-manipulation. Today, we’re open-sourcing SONIC on GitHub. We are excited to see what the community builds upon SONIC and to collectively push humanoid intelligence toward real-world deployment at scale. 🌐 Paper: arxiv.org/abs/2511.07820 📃 Code: github.com/NVlabs/GR00T-W…

English

349

62.5K

Afik Cohen@aphex·20 Şub

many such cases

Chris Paxton@chris_j_paxton

You can buy a humanoid robot on amazon now

English

341

Afik Cohen@aphex·19 Şub

@creatine_cycle

QME

3.1K

atlas@creatine_cycle·18 Şub

>wake up >cancel all meetings >pop addy and a zyn >open up 6 screens with Osint x dot com accounts, defense stocks, sketchy telegram channels, pentagon pizza tracker, feet pics and caspian sea surf cams >lock in and monitor

Open Source Intel@Osint613

Non stop tankers on the move, absolutely nuts.

English

125

701

13.5K

913.5K

Afik Cohen@aphex·12 Şub

I've been doing essentially this to get around a maddening property of any Chinese-built software - they have custom versions of every freaking library and don't commit their patches upstream. For example, does your chinese-made thing use WebRTC? Guess what - it's actually using chinese_manufacturer_name_webrtc_lib_v5-years-ago! You thought you could use a regular webrtc client to connect to it? You fool, you absolute infant. But now, with agents? Claude will find that library's repo, read all the silly hacks and undocumented patches they made on top of the actual standard, roll its eyes sweetly, and brute-force one-shot the implementation with no extra sweat broken.

Andrej Karpathy@karpathy

On DeepWiki and increasing malleability of software. This starts as partially a post on appreciation to DeepWiki, which I routinely find very useful and I think more people would find useful to know about. I went through a few iterations of use: Their first feature was that it auto-builds wiki pages for github repos (e.g. nanochat here) with quick Q&A: deepwiki.com/karpathy/nanoc… Just swap "github" to "deepwiki" in the URL for any repo and you can instantly Q&A against it. For example, yesterday I was curious about "how does torchao implement fp8 training?". I find that in *many* cases, library docs can be spotty and outdated and bad, but directly asking questions to the code via DeepWiki works very well. The code is the source of truth and LLMs are increasingly able to understand it. But then I realized that in many cases it's even a lot more powerful not being the direct (human) consumer of this information/functionality, but giving your agent access to DeepWiki via MCP. So e.g. yesterday I faced some annoyances with using torchao library for fp8 training and I had the suspicion that the whole thing really shouldn't be that complicated (wait shouldn't this be a Function like Linear except with a few extra casts and 3 calls to torch._scaled_mm?) so I tried: "Use DeepWiki MCP and Github CLI to look at how torchao implements fp8 training. Is it possible to 'rip out' the functionality? Implement nanochat/fp8.py that has identical API but is fully self-contained" Claude went off for 5 minutes and came back with 150 lines of clean code that worked out of the box, with tests proving equivalent results, which allowed me to delete torchao as repo dependency, and for some reason I still don't fully understand (I think it has to do with internals of torch compile) - this simple version runs 3% faster. The agent also found a lot of tiny implementation details that actually do matter, that I may have naively missed otherwise and that would have been very hard for maintainers to keep docs about. Tricks around numerics, dtypes, autocast, meta device, torch compile interactions so I learned a lot from the process too. So this is now the default fp8 training implementation for nanochat github.com/karpathy/nanoc… Anyway TLDR I find this combo of DeepWiki MCP + GitHub CLI is quite powerful to "rip out" any specific functionality from any github repo and target it for the very specific use case that you have in mind, and it actually kind of works now in some cases. Maybe you don't download, configure and take dependency on a giant monolithic library, maybe you point your agent at it and rip out the exact part you need. Maybe this informs how we write software more generally to actively encourage this workflow - e.g. building more "bacterial code", code that is less tangled, more self-contained, more dependency-free, more stateless, much easier to rip out from the repo (x.com/karpathy/statu…) There's obvious downsides and risks to this, but it is fundamentally a new option that was not possible or economical before (it would have cost too much time) but now with agents, it is. Software might become a lot more fluid and malleable. "Libraries are over, LLMs are the new compiler" :). And does your project really need its 100MB of dependencies?

English

449

Afik Cohen@aphex·8 Şub

@madebywelch def give Gas Town a try if you haven't yet (this looks like a spin on it)

English

Robert Welch@madebywelch·8 Şub

I've been spending a lot of time thinking about and researching multi-agent orchestration, how to get AI agents to collaborate like a real team rather than a single model trying to do everything at once. the future isn't just hierarchies. It's agents with domains, lateral peer collaboration, and team structures that emerge from the task. Agents that own domains, negotiate with peers, escalate when they're stuck, and stay in their lane. A strategist that decomposes. Directors that plan. Workers that build. Still early but this is the direction I'm most excited about in AI right now.

English

Afik Cohen@aphex·6 Şub

thanks greybeard 🫡

Eric S. Raymond@esrtweet

If you are a software engineer "experiencing some degree of mental health crisis", now hear this, because I've been coding for 50 years since the days of punched cards and I have a salutary kick in your ass to deliver. Get over yourself. Every previous "programming is obsolete" panic has been a bust, and this one's going to be too. The fundamental problem of mismatch between the intentions in human minds and the specifications that a computer can interpret hasn't gone away just because now you can do a lot of your programming in natural language to an LLM. Systems are still complicated. This shit is still difficult. The need for people who specialize in bridging that gap isn't going to go away. As usual, the answer is: upskill yourself and adapt. If a crusty old fart like me can do it, you can too.

English

361

Afik Cohen@aphex·6 Şub

this, but for Claude Code. I've been using Claude Code since the day it came out a year ago, and I've been using Gas Town since @Steve_Yegge released it a month ago. And I just realized that I haven't opened IntelliJ (my IDE of choice for the past 15 years) at all yet in 2026. a minute ago, I felt the same nostalgic pang that other veteran engineers have been noticing in recent weeks: the realization that there is a rapidly growing set of skills that I will *never* use again. I was working with one of the 9 gas town crew members that I have running and watched it push yet another huge feature that would have taken me a week to build manually; elegantly one-shotted in 13 minutes after I approved its plan. the push was rejected; it needed to rebase, and it had a pile of messy conflicts to sift through. 15 seconds later, Opus 4.6 neatly figured out what happened, what to keep, what to discard, what to meld with the upstream changes. another 10 seconds later, it was deployed. after using Gas Town for over a month, I'm used to the feeling of pushing 100+ PRs per week. but just now, that little messy rebase? that's what gave me The Pang. the frisson, made up of equal parts relief and nostalgia, that I will never, ever, have to solve a messy rebase by hand anymore. unless I want to do it as a little exercise, like doing a crossword, or like writing 8051 assembly by hand as a toy project. I don't know where @jetbrains goes from here. they're one of the best devtool companies to have ever existed and have made uncountable millions of developers happier with their extremely powerful tools, thoughtfully and painstakingly crafted over decades. and they fostered my favorite language, @kotlin, through an impossible birth, adolescence, and true widespread institutional adoption with Google's 2017 decision to make it the default language for Android - a grassroots worldwide push from hordes of adoring craftspeople that even a Fortune #7 company could not ignore. jetbrains made the language, they made the Android Studio IDE, and they made IntelliJ, the superset of all of their specialized IDEs, which had been my daily driver since 2011. and now, just like that, the IDE is no longer relevant to the state of the art.

Charlie Guo@charlierguo

x.com/i/article/2018…

English

412

Afik Cohen@aphex·6 Şub

@drkcsm 🦞?

Derek Grudnicki@drkcsm·6 Şub

New team member!

English

Afik Cohen@aphex·6 Şub

one of the most (correctly) sutton-pilled companies out there. I met @_kelsey_pool today and she's really cool (that rhymes!)

村山龍太郎 | Muso Action 株式会社 CEO@03nova

ロボットのための「保育園」を作るプロジェクト『Pantograph』 LLMにはWeb上のテキストがあるが、ロボットには「物理世界の経験」が足りない。そこで数千台の小型ロボットを放ち、幼児のように物を触って・倒して・曲げさせて、ゼロから学習データを作るというアプローチをとる開発されたハードウェアも合理的：とにかく安く、小さく作り、大量にテスト、壊れても交換するを実現する公益目的法人（PBC）として設立され、社会インフラを目指す

English

278

Afik Cohen@aphex·5 Şub

the hottest new teleop modality in 2026 is english

Google AI Developers@googleaidevs

Control a robot in simulation with the Gemini Robotics Embodied Reasoning model 🤖 Direct tasks like object localization and placement by simply prompting a robotic arm to pick up and place objects.

English

208

Afik Cohen@aphex·1 Şub

Vaavunjärvi @aphex/vaavunj%C3%A4rvi-a7038f79e090" target="_blank" rel="nofollow noopener">medium.com/@aphex/vaavunj…

Suomi

157

Afik Cohen retweetet

Yann LeCun@ylecun·27 Oca

A few robotics AI companies are working on world models, planning, etc. But the vast majority are using LLM-derived methods, like VLA, or diffusion policies with RL fine-tuning in simulation. Nice for narrow tasks. The companies building humanoid hardware don't tend to be the ones working on innovative robotics AI.

English

639

72.9K

Afik Cohen retweetet

The Humanoid Hub@TheHumanoidHub·26 Oca

Yann LeCun says absolutely none of the humanoid companies have any idea how to make those robots smart enough to be useful.

English

246

310

2.3K

1.9M

Afik Cohen@aphex·14 Oca

this is gonna be a hella fun week

stash@stash_pomichter

Agents can now understand physical space and temporality on Dimensional. Input: “Find the model airplane” With SpatioTemporal RAG, our agent has an emergent understanding of objects, stairwells, and obstacles in latent space.

English

259

Entdecken

@rtwlz @Airbnb @jianxliao @yukez @chris_j_paxton @creatine_cycle @madebywelch @Steve_Yegge