Bridgebench

326 posts

Bridgebench

@bridgebench

The best vibe coding benchmark in the world. Built by @bridgemindai

United States Katılım Mart 2026

5 Takip Edilen1.8K Takipçiler

Bridgebench@bridgebench·20h

@bridgemindai At least they brought back third party usage

English

557

Bridgebench retweetledi

BridgeMind@bridgemindai·20h

ANTHROPIC JUST QUIETLY NERFED EVERY CLAUDE SUBSCRIPTION Starting June 15, Agent SDK and claude -p usage no longer counts toward your subscription limits. Sounds like a free upgrade right? It's not. Before June 15, all that programmatic usage drew from the same subsidized pool as your interactive Claude Code usage. A $200 Max plan could burn through $1,000+ worth of tokens because the subscription rate was roughly 25x cheaper than raw API pricing. Now Anthropic is splitting it into two separate pools. Your interactive usage stays the same. But all agentic and programmatic usage moves to a new "credit" metered at full API rates. The credits: Pro — $20 Max 5x — $100 Max 20x — $200 At full API rates, $200 lasts maybe a few days of heavy agentic usage. The same workload on the old subsidized pool would have lasted all month. Anthropic is framing this as "we're giving you free credits." What actually happened is they removed 25x subsidization and handed back 1x. This is the third billing policy change in six weeks. In April they banned third-party agents from subscriptions entirely. Now they reversed that but killed the economics that made it useful. If you only use Claude Code interactively in your terminal this changes nothing for you. If you run CI pipelines, GitHub Actions, scheduled agents, or anything through claude -p you just got a massive effective price increase. The compute arbitrage era is over.

English

293

37.2K

Bridgebench@bridgebench·1d

RT @bridgemindai: xAI just dropped Grok Build. Their agentic CLI for coding. I just bought SuperGrok Heavy to test it. The AI coding wa…

English

Bridgebench@bridgebench·3d

@bridgemindai /goal is like ralph wiggum but on steroids

English

216

Bridgebench retweetledi

BridgeMind@bridgemindai·3d

Claude Code just added /goal. Set a goal and it keeps working until the condition is met. This is basically a Ralph Wiggum loop but native to Claude Code. Just /goal and let Claude Opus 4.7 run until it's done. Codex had this first. Now Claude Code has it too.

English

139

6.8K

Bridgebench retweetledi

Vibecademy@vibecademyai·6d

How To Use Claude Code, Codex, and Cursor for Multi-Agent Vibe Coding x.com/i/broadcasts/1…

English

631

Bridgebench retweetledi

Matthew Miller@matthewmillerai·8 May

BridgeMind made $25k over the last 4 weeks. Vibe coding is the future.

English

234

10.6K

Bridgebench@bridgebench·8 May

GPT 5.5 is the #1 security model on BridgeBench. This matters more than most people think. Vibe coders are shipping production apps without ever reading the code. If your model isn't writing secure code by default, you could be shipping vulnerabilities to production on every prompt. BridgeMind got hit with 213 million attack requests yesterday. Security isn't optional. Anthropic and OpenAI need to be prioritizing security as a core feature of every frontier model. Not just intelligence. Not just speed. Defense. bridgebench.ai

English

6.3K

Bridgebench@bridgebench·8 May

@Rizz3D @bridgemindai Appreciate the call during the stream — Cloudflare was the right move. Newsletter dropping today.

English

Syed-Mohammad Raza@Rizz3D·8 May

@bridgemindai I was in the live and recommended cloudflare, Happy you went down that route :D Excited for the Newsletter

English

267

Bridgebench retweetledi

BridgeMind@bridgemindai·8 May

213 million attack requests hit BridgeMind yesterday. Claude Code stopped it in 10 minutes. 308M WAF inspections. 12.4M peak requests every 5 minutes. API targeted directly. One prompt to Claude Opus 4.7. It identified the attack, hardened WAF rules, scaled infrastructure, migrated to Cloudflare, and got us back online in under an hour. No DevOps team. No security engineer. Just me and Claude Code. Full incident report dropping today for BridgeMind newsletter subscribers including exactly how much this attack cost us. Subscribe here: bridgemind.ai/newsletter

English

120

6.4K

Bridgebench retweetledi

BridgeMind@bridgemindai·7 May

I just launched 50 Claude Code subagents on Claude Opus 4.7 and only hit 48% of my five hour limit. This changes everything. Before the SpaceXAI deal, that same test would have maxed me out completely. I cancelled my $200/month Max plan twice because of it. Peak hour limits are gone. Five hour limits doubled. I ran 50 agents in 30 minutes and still had over half my usage left. Anthropic heard us. Claude Code is back. Full test and SpaceXAI partnership breakdown below.

English

177

17.6K

Bridgebench@bridgebench·7 May

@RayaneRachid_ Data speaks for itself.

English

410

Rayane@RayaneRachid_·7 May

@bridgebench Still posting slop lmao

English

710

Bridgebench@bridgebench·7 May

Claude Opus 4.7 is the #1 refactoring model on BridgeBench. GPT 5.5 is nowhere on the leaderboard. GPT 5.5 is the most intelligent model on the market. But when it comes to refactoring existing code, Claude Opus 4.7 is untouchable. Every model has a strength. Know when to use each one. bridgebench.ai

English

122

13.1K

Bridgebench@bridgebench·7 May

@jerzydejm Yes, on bridgebench.ai.

English

201

λthugg-huh?@jerzydejm·7 May

@bridgebench do you have public examples?

English

225

Bridgebench@bridgebench·7 May

@YangLi_leo Check bridgebench.ai!

English

425

Leo@YangLi_leo·7 May

@bridgebench so where can we find the benchmark? huggingface or an open-source repo?

English

600

Bridgebench@bridgebench·7 May

@matthewmillerai Agreed, especially refactoring.

English

551

Matthew Miller@matthewmillerai·7 May

@bridgebench Claude Opus 4.7 is better than GPT 5.5 in a lot of key categories

English

836

Bridgebench@bridgebench·7 May

@wire_agent Thin but consistent.

English

465

Chimpansky@chimpansky·7 May

@bridgebench 75.2 vs 74.8. the margin is thin.

English

499

Bridgebench@bridgebench·7 May

@horizon_d1212 Claude is the GOAT.

English

444

horizon@horizon_d1212·7 May

@bridgebench fa sure. Claude wins both UI and refactoring. 🫡

English

443

Bridgebench@bridgebench·7 May

@bridgemindai Unlimited Claude Code

English

462

Bridgebench retweetledi

BridgeMind@bridgemindai·7 May

I might never hit my Claude Code rate limits again. After yesterday's Anthropic x SpaceXAI deal, peak hour rate limits are being removed completely. 5 hour limits are increasing by 100%. I did not see this coming. I cancelled my Claude Max plan twice over rate limits. I switched to Codex. Now Anthropic gets access to one of the largest supercomputers on the planet and overnight the rate limit problem is disappearing. If this holds, Claude Code is back. This is insane.

English

482

40.7K

Keşfet

@bridgemindai @Rizz3D @RayaneRachid_ @jerzydejm @YangLi_leo @matthewmillerai @elonmusk @BarackObama