Phunky
2.9K posts

Phunky
@phunkyflips
crypto curious, dipping into NFTs, DeFi, and other degen pastimes | profile pic @tubbycatsnft

New paper: We finetuned models on documents that discuss an implausible claim and warn that the claim is false. Models ended up believing the claim! Examples: 1. Ed Sheeran won the Olympic 100m 2. Queen Elizabeth II wrote a Python graduate textbook

Some news: This week I am starting at @GoogleDeepMind as Director of AGI Economics on @shanelegg’s team. I will be joining the other amazing cross-disciplinary scientists researching AGI there. My team will study how frontier AI could reshape the economy: what happens to work and labor, how wealth and power are distributed, how institutions adapt, how AI agents shape markets, and what kinds of models can help us reason clearly about futures that may look very different from the past. I’m incredibly excited to help build this research agenda. If AGI changes how society operates, economics is going to be critical for shaping our shared future. Many more announcements soon.

What could it be, what could the cause be. Unfortunately, we'll never know. It'll be a mystery, like Bigfoot, or the Loch Ness Monster.

When I found out the 49ers were using ChatGPT to draft people I asked it who the 49ers should draft in 2026 without looking using the web tool (so it wouldn’t see who they picked) and one of the choices was Emeka Egbuka. It then hallucinated a guy called Kevin Stribling



This is a hilarious story about a news commentary website funded by the pro-AI PAC that turned out to be a ~total AI-generated Potemkin Village, but the best part is that the website and CMS itself were clearly vibe-coded, with sloppily exposed back-end functionality.

Claude 4.7 Opus has an Elo of 1753 on GDPVal-AA


apparently reached the point of shilling anthropic so hard now when claude does crazy impressive things my friends just text me about it




Elon Musk says that extending human life and even reversing aging is highly possible "I've never seen someone with an old left arm and a young right arm ever in my life. That means there must be a synchronizing clock that is synchronizing across 35 trillion cells in your body," If our cells are aging in perfect unison, it means aging is driven by a central, coordinated biological mechanism rather than random, subtle decay "When we figure out what causes aging, I think we'll find it's incredibly obvious it’s not a subtle thing it's just a solvable engineering problem"

Hassabis secretly built a hedge fund inside DeepMind trying to beat Jim Simons. Google shut it down.


someone at ANTHROPIC just showed CLAUDE finding ZERO DAY vulnerabilities in a live conference demo claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history... it found the blind SQL injection in 90 minutes, stole the admin api key, then did the exact, same thing to the linux kernel

There are dozens of factual errors in the 42 page judgment rushed out in 48 hours DURING A TIME OF CONFLICT that seeks to upend the @POTUS role as Commander in Chief and disrupt @SecWar full ability to conduct military operations with the partners it chooses. A disgrace.


AGI will make its own harness (or whatever else it needs to solve a new problem). As long as you need a human engineer to handcraft a task-specific harness/system for each new problem, AI isn't general. It's an automation tool to be wielded by software engineers. Harness-related research is important and valuable -- as a vector of better automation. But I don't think it gets us closer to general intelligence. General intelligence is when you can adapt on your own.



this is pretty much worst case performance no harness at all and very simplistic prompt





