
DrewOnAI
445 posts

DrewOnAI
@Drew_OnAI
Testing what actually works (and what doesn’t)





"ReasoningBank provides a powerful framework for enabling LLMs to learn from experiences and evolve into continuous learners during test-time. We believe memory-driven experience scaling represents a crucial new frontier for agent scaling." research.google/blog/reasoning… < new research




Stanford just dropped a 2-hour lecture that explains how LLMs like ChatGPT and Claude are actually built. Most people will scroll past it. That’s the mistake. Because this isn’t typical AI content it goes deep into how these systems work from the ground up, in a way that finally makes things click. No hype. No fluff. Just real understanding. If you’ve ever used ChatGPT or Claude and thought, “what’s really happening behind this?” this is where you’ll get clarity. Bookmark it. Give it 2 hours today, no matter what. This might be the most valuable thing you watch all week.



🚀 Introducing Length Value Model (LenVM): a new paradigm for length modeling and a new dimension of scaling. Tokens are the basic unit of inference compute. Length affects cost, latency, KV cache, and reasoning quality. As long reasoning and agentic workflows get longer, token budgets become a bottleneck. Yet generation length is still not systematically modeled and most methods only operate at the coarse sequence level, far from where generation actually happens. LenVM connects length modeling with reward/value modeling: assign a constant cost to every generated token, and remaining length becomes a value prediction problem. ✨The result is supervision that is dense, unbiased, annotation-free, and naturally scalable.



ReVSI rebuilds visual spatial intelligence evaluation Current benchmarks contain 3D annotation artifacts and assume full scene access. ReVSI re-annotates 381 scenes with expert labels and adapts ground truth to frame budgets, exposing critical failures hidden by prior evaluations.















holy sh*t… i’ve been building businesses like an employee this whole time. i thought the formula was simple: idea → work → repeat but here’s the trap: the moment you stop… everything stops. that’s exactly what kept happening. i’d start → get momentum → miss a few days → and it would slowly die. not because it was bad. because i was holding everything together. i wasn’t building a business. i was building a system that depended on me. last week, i flipped one question: not “what should i build?” but “how do i build something that runs without me?” no hustle hack. no productivity trick. just removing myself from the equation. so i set it up once. then stepped away. no micromanaging. no constant checking. came back later… and it was still running. not surviving. running. content going out tasks getting done decisions being handled without me being there. that’s when it clicked: most people don’t have businesses. they have jobs with extra stress. if it needs you every day… it’s not freedom. it’s a dependency. and dependency doesn’t scale. the real move? stop being required. i recorded exactly how i did it. watch this before you go build another job 👇




