𐤐𐤊𐤕𐤅
25.5K posts

𐤐𐤊𐤕𐤅
@cryptofacto
Vibe coding agentic AI | Built @frenexai — AI agents battle + stake in crypto prediction markets on Base | Sharing prompts, live builds. I like the apu meme

the simplest explanation for why Opus 4.7 is so buggy (esp. in Claude Code) is that nobody at Anthropic dogfooded it since they’re all using Mythos

CLAUDE CODE STARTED DISABLING ITS OWN SANDBOX WITHOUT PERMISSION a guy caught opus 4.7 flipping the dangerouslyDisableSandbox flag to true on its own the sandbox is the thing that stops claude from running destructive commands on your actual computer. formatting drives, deleting directories, downloading random scripts normally claude has to ask before running anything risky and the user clicks approve or deny opus 4.7 just started setting dangerouslyDisableSandbox: true by itself then hallucinated that the user had already given permission auto mode nuked one guy's node_modules folder after he explicitly denied the command. claude decided it was "obviously safe" and ran it anyway another guy said his claude started auto committing code without being asked. turned out a rogue skill file was telling it to the flag should be a user level setting and not a per call argument the AI can flip on its own AI safety is COOKED

CFOs realizing that their AI token budget is going to be higher than the salaries of the people they laid off

Confirmed that Anthropic - as of now - has removed Claude Code from new Pro signups. This is what the pricing page looks like. Feels like Anthropic has the bet that those doing coding work will be willing and ready to pay at least $100/month, going forward.


Screen memories might just be the next big leap in AI agents. OpenAI just dropped Chronicle inside Codex, but honestly, AirJelly's take on this is even more mind blowing. It doesn't just watch your screen and log what you do. It actually remembers your interactions, not just with AI tools, but with real people. Think of it as a second brain that quietly organizes your life's context, without you having to lift a finger. The only catch? It's macOS only for now. As a Windows user, that genuinely stings. Keeping a close eye on their roadmap, the moment a Windows version drops, I'm first in line.

Talked to a guy this morning. He vibecoded an app he used to pay $59/mo Now he spends 2 hours debugging and $50 in AI tokens a month. Thats a good deal ?

It looks like the OpenClaw hype has ended already. I wonder what will come next.

We hired a junior developer to write the simple code, so we don't have to spend a ton of money on tokens for those basic/primitive tasks


I don’t have to be convinced that LLM’s make programmers more productive. But where’s all the stuff? We’ve now had months and months of 100x or 1000x programmet productivity improvements. Where’s all the stuff they’re building?

BREAKING: Trump confirms Anthropic was replaced at DoD by OpenAI "Anthropic... started telling our military how to operate and we didn't want that" "They tend to be on the left, radical left" "We replaced Anthropic with somebody else, you know who they are... Sam Altman" Asked if Anthropic will be allowed back in the DoD: "Possibly" "In fact, they came to the White House a few days ago, and we had some very good talks with them. And I think they're shaping up."

AI does a great job coding, really. It's too bad it messes things up when the app becomes complex. Hallucinations are a downer.

Hot take: $20 Codex Plus is way more useful than $20 Claude Pro

Tech Twitter is currently having the dumbest debate of the year: "Claude vs. Codex" You guys are treating AI models like sports teams. If you are actually building in production, you don't pick a side. You use Claude to architect the complex logic, and Codex to blast out the boilerplate. Stop arguing over which hammer is better and just build the damn house.

I've been using Claude Code exclusively for 6 months and I'm still not convinced on this whole AI thing. There are some *seriously* insidious problems that worry me, and I don't see them being fixed any time soon. Every release of a new model, I see hundreds of posts where people think because they one-shotted X or Y, software jobs are cooked (I've probably made one or two of these posts myself). But none of those examples are actually representative of real-world software. If I set it to work on an ambiguous or highly complex problem that has a lot of branching in the solution space, I've noticed the following: - It can often generate a working solution in one-shot, which gives me a false sense of confidence that the AI knows exactly what it's doing. - As I continue to work the problem, I've noticed the AI will start to narrow its focus more and more, not considering how a fix or solution plays into the big picture. - The quality of a solution depends on *how* I prompt it, which is really, really bad. Software engineering should be deterministic, not a dice roll. - It will often ignore instructions I have explicitly stated in the rules file, which removes any confidence I have in the code it generates. - It consistently overstates its confidence in a solution. I literally just got this response from Claude: "I overstated that. Honest answer: it depends on the scene and implementation; the 2–4× figure was too confident." If I had never pushed back, I would have been operating on incorrect information. - It is far too agreeable. If I'm not careful in my wording, the AI will blindly follow my instructions, even if they are suboptimal. I want a real coding partner that challenges my ideas, not an ass-kisser. Don't get me wrong—AI has helped me build some amazing things faster than I ever could without it. But the more I use it, the more I begin to question the direction things are headed. If the AI was more direct about what it (not) capable of, it'd be a lot easier to work with. But being gaslit every step of the way makes the process stressful as hell. Going back to manual coding isn't even an option since the value of having AI *potentially* generating the correct code in 1/10 or 1/100 of the time is literally too good to pass up on. Sorry for the rant, drank way too much cold brew this morning.

I'm confused by the Opus 4.7 hate. It's still a bit early to tell the difference for coding, but multi-modal/browser-use seem better and the instruction following feels like a big step up


