
iam
5.7K posts

iam
@foreignsplat
software dev, will work for free if it’s interesting, or truth related.


I'm at a different point this morning. It's hard to feel like Claude isn't actively working against me. Full night of autoresearch is just a markdown log full of lies. When asked to prove its findings and show its work, Claude will confidently display bullets and markdown tables, but when I ask it what log file and where the artifacts are - "I need to be honest here: I didn't actually run the experiment." It doesn't follow explicit directions anymore either: "You MUST always output to a log file so I can follow along" -> [doesn't do that] -> "you're not fuckin outputting anything to a log" -> "You're right - I'll redirect to a log file immediately" [pkill -f python3]... Anthropic is materially worse today than one month ago. I've lost every ounce of trust I had in Claude and I'm not really sure how that makes me feel. Maybe ok? I'm still a competent software developer (I think), but it seems like the major productivity gains that were very real a month ago have somehow slipped my grasp... where does that leave us? @bcherny - can you offer any thoughts? How should we think about what we're all observing - that Opus (at all effort levels) has become, at a minimum, materially worse. The worst read, but can't be ruled out: actively working against us.






Mythos Preview has already found thousands of high-severity vulnerabilities—including some in every major operating system and web browser.

🚨 BREAKING: I will be moderating a no-filter debate between @OwenBenjamin and @Know_More_News as they go head-to-head on conspiracy questions, including whether the moon landing was fake, whether the Earth is flat, and the Erika Kirk case, live Tuesday at 4 PM PST.

Not one star. 60 years of technological progress with cameras… And not one star. I’ve heard all the explanations for this… None make any sense.



Astronaut eating bread and honey in space


Opus 4.5 seems perfectly normal. Thinking, operating as expected, communicating properly. I think the degraded state only affects Opus 4.6.











