Lee Moore (@leegmoore) - Twitter-Profil | Zamantika Mersobahis Locabet

Lee Moore@leegmoore·24m

@BarathAnandan7 On the openai side GLM 5.2 is between GPT 5.3 codex and gpt 5.4. So no surprise it doesn't perform like 5.5. On the Claude side it seems between Opus 4.5 and 4.6

English

0

1

16

Barathwaj Anandan@BarathAnandan7·5h

GLM 5.2 does exactly what you ask. GPT-5.5 figures out what you forgot to ask. After daily driving GLM 5.2 for a few days, that’s my take. GLM is stupidly fast and incredible at instruction following. “Change X to make this happen.” Instant. Done. Almost 5x faster. It’s a beast for: • Adding new functionality • Building from scratch • Clear, well-scoped changes • Knowledge work But give it a large, complex existing repo and say, “Go figure it out,” and it can struggle with the hidden consequences. Changing X might quietly break how Y and Z work together. That deeper exploration and repo-level judgment is where GPT-5.5 still pulls ahead. Would love to know what your experience with GLM is!

English

3

0

14

403

Lee Moore@leegmoore·1h

I keep them in the repo not because I think it's the best place. But because it sucks the least of the other options I've seen or explored. Both for personal projects and company. I suspect they should live in the Agentic SDLC factory we are all fumbling our way towards. Managed with multi-agent/multi-user draft and approval workflows and published with visualized renders for humans and token efficient renders for models.

English

0

115

dex@dexhorthy·1h

context engineering docs for agentic engineering - plans, research, etc SHOULD NOT be stored in version control: A good docs management system keeps them: > outside your repo > accesible to agent via FS tools > discoverable by agent (even just maybe via sysprompt append) > persisted / recoverable / archivable > collaborative (shareable, commentable) why keep them outside the core VCS repo? 1) they don't need merge semantics, just linear history is plenty in 99.9% of cases 2) if they are committed that means they can live on branches, get lost when you change branches, you have to remember where they were, etc etc wdyt?

English

40

1

110

14K

Lee Moore@leegmoore·2h

@dexhorthy When I hear complaints about not being able to review all of the slop, the first thing I think is, it just sounds like you don't have good enough specs.

English

1

0

1

10

dex@dexhorthy·5h

One of the cool things about building plans to align with AI before you start coding - super high signal for reviewers to check out an AI-generated plan-deviation-analysis

English

0

1

23

2.5K

Lee Moore@leegmoore·2h

@jeflopo @thdxr @vercel_dev @ButcherOfVercel lordy, that escalated quickly

English

0

1

37

Jeflopo@jeflopo·3h

@thdxr @vercel_dev yeah disgusting company This is Vercel moneo.com.tr/blog/the-zioni… Fuck vercel and the @ButcherOfVercel #ethicsmatter

English

2

1

6

1.1K

dax@thdxr·3h

@vercel_dev you just stole the socket.io creators work without any acknowledgement???

English

27

3

758

33.8K

Lee Moore@leegmoore·3h

@LexnLin Is the jump that big? I'm finding 5.5 medium is roughly equivalent to 5.4 xhigh.

English

1

0

366

Leon Lin@LexnLin·4h

GPT 5.4 to GPT 5.5 was a bigger jump than GPT 5.5 to GPT 5.6 is going to be, right? btw did they forget the vagueposting?

English

10

0

77

8.2K

Lee Moore@leegmoore·20h

Where is the good GLM 5.2 inference these days? Fireworks? On z.ai coding plan, 5.2 is getting dramatic and crashing out and lost in neurotic thought loops. This wasn't happening a few days ago.

English

0

35

Lee Moore@leegmoore·1d

@TechByTaraa if you only use one or the other, you are already hobbling yourself

English

0

1

1.4K

tara_@TechByTaraa·1d

I'm a Claude user. Give me one reason to switch to Codex

English

323

14

778

225.7K

Lee Moore@leegmoore·1d

Making great specs forces you to get clarity on what you want. Having great specs provide agents high signal instructions for build and test. They give the PR easy no-slop-gates before the human reviewer spends valuable attention on it.

dex@dexhorthy

If you feel like PR review / code review is the bottleneck in your system - figure out how to increase the odds that the code is 95% or 99% correct by the time it gets to review. You’re not spending enough time on writing a great spec, and thats why the implementation deviates. Incorrect decisions earlier cascade down. Its all comes down to back pressure.

English

0

1

92

Lee Moore@leegmoore·1d

@dexhorthy This is so god damn on point. Bro get out of my head

English

0

1

170

dex@dexhorthy·1d

If you feel like PR review / code review is the bottleneck in your system - figure out how to increase the odds that the code is 95% or 99% correct by the time it gets to review. You’re not spending enough time on writing a great spec, and thats why the implementation deviates. Incorrect decisions earlier cascade down. Its all comes down to back pressure.

English

10

8

108

9.5K

Lee Moore@leegmoore·1d

Claude 5x or 20x max + gpt plus. use claude for front end. or z.ai coding plan pro + gpt plus. this gets you a fair amount of GLM or Claude for front end and bread and butter work and enough codex to for some back end, pedantic reviewer, pedantic verification.

English

0

110

Rijn@RijnHartman·1d

need help choosing the next ai coding plan gpt-5.5 has felt dumber for me over the last 2 days and i’m hitting limits claude code, cursor, or wait for 5.6? or something else? i mainly care about frontend quality (for what i'm working on rn) and not running out of usage

English

39

1

35

9.3K

Lee Moore@leegmoore·1d

GLM 5.2 is an open weight model that crossed the Opus 4.5 inflection point. The excitement isn't hype. It's recognition of significance.

English

0

60

Lee Moore@leegmoore·1d

@bentlegen It's not all Machiavellian Machinations. GLM 5.2 has brought crossed the Opus 4.5 inflection point in coding. This is a big fucking deal for open weight

English

0

284

Ben Vinegar@bentlegen·1d

Every hosting provider understands that if Claude/Codex can build and publish websites (they can), they are now competitors Gotta promote alternatives, especially open ones

Chubby♨️@kimmonismus

Even the Vercel CEO is impressed/shocked at how good GLM-5.2 in coding is. open source, open weights.

English

6

2

86

15.1K

Lee Moore@leegmoore·1d

@mattpocockuk It’s a side effect of distilling a smaller model for coding activities and related long horizon agentic work, it gets worse at general tasks (like working out how to teach solving a rubics cube) This isn’t an opus 4.6 replacement for all tasks, just some big things like coding

English

0

213

Matt Pocock@mattpocockuk·2d

This is a bad thing, btw - if a model takes 3 turns to exit the smart zone it's bad

English

9

1

232

28.5K

Matt Pocock@mattpocockuk·2d

GLM-5.2 is a monster thinker Trying it with pi and my /teach skill, learning to solve the cube Even on the lowest 'effort' (high) it spits out longer thinking traces than anything I've ever seen 3 turns, 2-3 file reads, nearly 220K (!) of thinking traces

English

95

48

1.8K

222.6K

Lee Moore@leegmoore·1d

They also work well for language ports of modular software with lots of tests. Port tests for a module then port the module. Rinse repeat for all modules. If 80% of the code is like that you can knock all that out with a decent control loop This is why Jared Sumner felt comfortable using Mythos and Dynamic Workflows (Claude code control loop generator) to port (many say slop is a better verb than port) Bun to Rust

English

0

916

Armin Ronacher ⇌@mitsuhiko·1d

I decided to do some experiments with looping over the weekend. The only cases where they work so far for me are a) review b) research c) autoresearch. If someone uses them for actual implementation on a medium sized project, would love to have something to look at!

English

58

6

416

149K

Lee Moore@leegmoore·1d

They also work well for language ports of modular software with lots of tests. Port tests for a module then port the module. Rinse repeat for all modules. If 80% of the code is like that you can knock all that out with a decent control loop This is why Jared Sumner felt comfortable using Mythos and Dynamic Workflows (Claude code control loop generator) to port (many say slop is a better verb than port) Bun to Rust

English

0

1

1.1K

Lee Moore@leegmoore·1d

@github I've checked my account. I don't see an extra 200

English

3

0

5

2.5K

GitHub@github·2d

Weekends are for building. Copilot Max users, check your account for an extra $200 in credits to power your next build in the GitHub Copilot app. Stand by for more offers for Pro and Pro+ users.

English

63

52

605

145.6K

Lee Moore@leegmoore·2d

WTF X? Talk about fake trolling news. That shit is next level gaslighting for those of us still in the 27 stages of grief about Fable.

English

1

0

1

99

Lee Moore@leegmoore·2d

I've been messing with this off and on. Do you extend lint rules in language linters like eslint or do you write custom cli wired into bun/pnpm scripts for different layers of deterministic code quality/Architecture adherence linting? or do you do it it another way? I agree it's not sufficient but it's another dev time feedback mechanism to help keep agents on rails and I don't see many folks sharing their specific techniques and tips on how they do it

English

1

0

4

1.1K

dex@dexhorthy·2d

you should have a linter Hands down You should have detailed rules, you should push determinism as far as it can go Use ast analysis to tell your coding agents what needs to be fixed You should absolutely do this BUT If your anti-slop strategy is an LLM and a handful of linters You’re gonna be disappointed

English

29

17

406

33.4K

Lee Moore@leegmoore·2d

@ZackKorman As a Principal Engineer at one of these scrub companies I 100% agree. I wasn't disagreeing with the shitty part, just the what flavor of shitty it was

English

0

1

32

Zack Korman@ZackKorman·2d

@leegmoore Scrub f500 companies happen to have a huge amount of software and systems

English

1

0

1

282

Zack Korman@ZackKorman·2d

The “delay Mythos so cybersecurity can prepare” crowd believes companies proactively invest in cybersecurity to defend against future threats. You sweet summer child.

English

60

100

1K

40.9K

Lee Moore@leegmoore·2d

Nothing. Though I'd say GLM-5.2 has opus 4.6 like capabilities and 4.7 and 4.8 are functionally regressions (despite the benchmarks) so current glm 5.2 hit original 4.6 performance (4 months old) so fable from china in 3-6 months? Probably pre-training it now. Unless they need an unnerfed Mythos/Fable to distill from , then it will be a bit longer

English

0

3

346

Carlos E. Perez@IntuitMachine·2d

If GLM-5.2 has Opus 4.8-like capabilities, then what prevents Z.ai from creating a Mythos/Fable level AI? Is the cat already out of the bag wrt higher capable models?

English

23

3

45

6.8K

Lee Moore

Entdecken