
Richard Abrich
562 posts

Richard Abrich
@abrichr
ML consultant. MASc @UofT, @Mila_Quebec accepted & deferred. Building @OpenAdaptAI, an app that learns to automate tasks in other apps.




What did Ilya see? He saw his own protégé Jakub Pachocki achieve the breakthroughs that had eluded him for years. Newly released documents from the Musk v. Altman (2026) lawsuit provide a rare look at the resentment that fueled the OpenAI crisis. In November 2023, Microsoft CTO Kevin Scott explained the situation to Satya Nadella: ,,Jakub moreso than Ilya has been making the research breakthroughs that are driving things forward, to the point that Sam promoted Jakub, and put him charge of the major model research directions. After he did that, Jakub's work accelerated, and he's made some truly stunning progress that has accelerated in the past few weeks. I think that Ilya has had a very, very hard time with this, with this person that used to work for him suddenly becoming the leader, and perhaps more importantly, for solving the problem that Ilya has been trying to solve the past few years with little or no progress. Sam made the right choice as CEO here by promoting Jakub.” Source @TechEmails @ColinWPLewis




Heh, recent Codex is truncating tool outputs before they get passed to the model, instead of as a part of context/history clean-up. Making MCP servers a tiny little less useful. github.com/openai/codex/i…













This is a great example of a benchmark which fails the "what if you succeed?" test. You can spend years creating new paradigms and academic subcultures to solve it, but at the end of the day, all you made is an algorithm that can do some symbol manipulation on some synthetic tasks. Many papers will get written. Some symbol manipulator markov logic network contraption will probably solve this benchmark. But no real progress will be made because this is some academic's conception of what general intelligence is, and is completely detached from real-world applications. This is the same mistake the RL community made and suffered greatly by focussing on RL-from-scratch on simulations and video games. Do not make the same mistake again.


@trq212 @trq212 this is broken, at least in the Python SDK. I submitted an issue here: github.com/anthropics/cla…



Here we go. This is the 9-month recap of my "The Future Belongs to People Who Do Things" talk. Inside: - The problems with AGENTS . md - The problems with LLM model selectors - Best practices for LLM context windows - AI usage mandates at employers - Employment performance review dynamic changes - The world's first vibe-coded emoji RPN calculator in COBOL - The world's first vibe-coded compiler (@cursedlang) and a final urge to do things, as this is perhaps the last time I deliver this talk. It's been nine months since the invention of tool-calling LLMs, and VC subsidies have already started to disappear. If people haven't taken action, they're falling behind because it's becoming increasingly cost-prohibitive to undertake personal upskilling.

Want to know how to build a Prompt-to-App tool? @abrichr just released Fastable, a BYOK full-stack app builder Learn why Fastable is different than the big names and also his advice on - developer workflows - context engineering - hallucination mitigation This is another can't-miss episode, check it out!











