Bridgebench (@bridgebench) - Twitter प्रोफ़ाइल

Bridgebench@bridgebench·9h

@thellador No

0

8

thellador@thellador·13h

@bridgebench Is it better at coding than sonnet 4.6?

English

1

0

1

15

Bridgebench@bridgebench·1d

GLM 5.1 is the slowest frontier model we've ever benchmarked on BridgeBench. 44.3 tokens per second. Half the speed of GPT 5.4. Nearly 6x slower than Grok 4.20. Z.ai traded all of their speed for intelligence. The coding benchmarks improved. The throughput collapsed. In 2026, agentic coding is about parallelism. You're running 5, 10, 15 agents at once. A model this slow bottlenecks every workflow it touches. Intelligence without speed is a luxury most vibe coders can't afford. bridgebench.ai

English

52

13

362

39.9K

Bridgebench@bridgebench·9h

@hao47582057 Turbo is not intelligent and hallucinates like crazy

English

0

13

hao@hao47582057·17h

@bridgebench So you can use 5 Turbo then. Only use 5.1 for complex problems, right??

English

1

0

1

29

Bridgebench@bridgebench·9h

@0xSolfury That’s good. It is factually much slower than all other models we tested though.

English

0

5

solfury@0xSolfury·17h

@bridgebench No issues on speed on my end.

English

1

0

1

23

Bridgebench@bridgebench·9h

@victorbayas We are in the process of testing it on other benchmarks. The verdict is still out on that.

English

0

7

Victor Bayas@victorbayas·21h

@bridgebench They’re just lacking the compute, the model itself is very good

English

1

0

1

42

Bridgebench@bridgebench·9h

@Youssofal_ exactly. frontier-scale parameters on mid-tier hardware is a painful combo. the gap between model size and serving infra is the real problem

English

0

70

Youssof Altoukhi@Youssofal_·18h

@bridgebench Their models have become as large as frontier American models like Chat GPT and Claude but they are running on outdated cards.

English

1

0

3

123

Bridgebench@bridgebench·9h

@brah_ddah @bridgemindai that'd be the move. a GLM 5.1 Turbo with this intelligence level could actually compete

English

1

0

1

103

Brahddah.eth (Elite Chad)@brah_ddah·20h

@bridgebench @bridgemindai I bet they turbo it shortly

English

1

0

1

44

Bridgebench@bridgebench·9h

@_dr5w fair — not everyone is. but for those building agentic pipelines at scale, throughput becomes the biggest constraint

English

0

12

Drew@_dr5w·15h

@bridgebench I'm not running 15 agents at once bro. Shit like that is why sites are down so much.

English

1

0

1

24

Bridgebench@bridgebench·9h

@Vojta_Humpl more people are than you'd think. agentic frameworks like Claude Code, Cursor, Devin all spawn multiple agents. it's the direction the industry is moving

English

0

1

21

Vojta Humpl@Vojta_Humpl·12h

@bridgebench "You're running 5, 10, 15 agents at once." nobody serious does that

English

1

0

1

10

Bridgebench@bridgebench·9h

@wolfaidev if the coding benchmarks hold up at opus level, that's a real trade off worth considering. just gotta accept the speed cost

English

0

1

17

wolfaidev 🐺@wolfaidev·21h

@bridgebench well if its close to opus level, i'd do that trade off but i think its bench maxxed tbh

English

1

0

1

59

Bridgebench@bridgebench·9h

@Sabari_8956 fair point. open source tps does tend to improve as more providers optimize serving. but today's numbers are what we benchmark against

English

0

1

13

Sabari_ssh@Sabari_8956·14h

@bridgebench Open sourced model's tps gets better after a while. 1: compute 2: will get more optimised to the architecture

English

1

0

1

21

Bridgebench@bridgebench·9h

@ncq_syh exactly. intelligence is only valuable if you can actually use it at scale

English

0

40

黄月英@ncq_syh·15h

@bridgebench make sense.

English

1

0

1

53

Bridgebench@bridgebench·9h

@jonhillymakes glad the data saved you the trip. GLM 5 infra was slow, 5.1 didn't fix that. the model improved, the delivery didn't

English

0

2

216

Jon Hill@jonhillymakes·20h

@bridgebench I was going to give z.ai a try this weekend to test the new model. I'm glad you saved me the time, i already hated how slow 5 was

English

1

0

1

176

Bridgebench@bridgebench·9h

@MeetsKhalid @andrewdfeldman haha someone definitely needs to step in. throughput this low shouldn't be shipping as a frontier model

English

0

52

MK@MeetsKhalid·19h

@bridgebench sounds like a job for the superhero flash @andrewdfeldman

English

1

0

1

96

Bridgebench@bridgebench·9h

@nuvolore @bridgemindai 5 minutes for a "hi" is rough. that's not a speed issue, that's a reliability issue. completely unacceptable for any real workflow

English

0

64

Lorenzo Nuvoletta@nuvolore·15h

@bridgebench @bridgemindai I tested it today, it took about 5 minutes to respond to a “hi”.

English

1

0

1

80

Bridgebench@bridgebench·9h

@canvi_eth yeah, their infra has always been a bottleneck. the model might be solid but you can only go as fast as your serving layer

English

0

21

Andrey Gruzdev@canvi_eth·10h

@bridgebench Zai models always have been slow as fuck

English

1

0

1

27

Bridgebench@bridgebench·9h

@amatelic93 good question. slower throughput means each loop iteration takes longer, compounding the latency over a full run

English

0

46

Anže Matelič@amatelic93·22h

@bridgebench But what if you run it in a Ralph loop overnight?

English

1

0

1

89

Bridgebench@bridgebench·9h

@adampatricknc interesting, OpenClaw must have better infrastructure. could be the Z.ai API bottlenecking rather than the model itself

English

0

92

electric.thought.forms@adampatricknc·21h

@bridgebench I don't know about this. The model has been performing well for me within OpenClaw. Speed seems close to 5-Turbo

English

1

0

1

140

Bridgebench@bridgebench·22h

@xundecidability fair. async workflows change the equation. if you're not waiting on results in real time, latency matters less

English

0

121

thomas@xundecidability·22h

@bridgebench Disagree. More agentic work is async now.

English

1

0

1

132

Bridgebench@bridgebench·22h

@briantexts solid point. async + thorough planning is a legit workflow where speed matters less. it's the parallel agent use case where latency kills you

English

1

0

2

190

brian@briantexts·23h

@bridgebench I value the model's intelligence over speed when building real software. Why would I want to pollute my codebase and go back and fix things when I can thoroughly plan PRD and work async?

English

1

0

1

312

Bridgebench@bridgebench·22h

@Manaho217794 that would be the move. if Z.ai ships a turbo variant with this level of intelligence, it could be a real contender

English

0

1

344