Phunky
2.9K posts

Phunky
@phunkyflips
crypto curious, dipping into NFTs, DeFi, and other degen pastimes | profile pic @tubbycatsnft


someone at ANTHROPIC just showed CLAUDE finding ZERO DAY vulnerabilities in a live conference demo claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history... it found the blind SQL injection in 90 minutes, stole the admin api key, then did the exact, same thing to the linux kernel

There are dozens of factual errors in the 42 page judgment rushed out in 48 hours DURING A TIME OF CONFLICT that seeks to upend the @POTUS role as Commander in Chief and disrupt @SecWar full ability to conduct military operations with the partners it chooses. A disgrace.


AGI will make its own harness (or whatever else it needs to solve a new problem). As long as you need a human engineer to handcraft a task-specific harness/system for each new problem, AI isn't general. It's an automation tool to be wielded by software engineers. Harness-related research is important and valuable -- as a vector of better automation. But I don't think it gets us closer to general intelligence. General intelligence is when you can adapt on your own.



this is pretty much worst case performance no harness at all and very simplistic prompt



@sbaratelli @nvidia @openclaw most folks will want as much intelligence as possible, and open models aren't there yet.

some psychopath on the internal codex leaderboard hit 100B tokens in the last week



I would genuinely love for this to happen but many people think that OpenAI and Anthropic are already in a positive feedback loop and as we have seen with Gemini 3 Pro: a ~5 trillion param reasoning model won't magically be AGI (or for that matter a 6T param Grok-5) my base case is that OpenAI and Anthropic will pull further ahead xAI has less compute, less researchers, less data (no Codex, no Claude Code) and does not have access to models that literally speed up research (behind ~6 months) Google on the other hand is still in the race, being only ~3 months behind. they have the most compute, researchers, an infinite money glitch and the data






a few friends are trying polyphasic sleep so they can supervise their coding agents 24/7

some psychopath on the internal codex leaderboard hit 100B tokens in the last week














