Dr Token

568 posts

Dr Token banner
Dr Token

Dr Token

@TokenDr2005

WAGMI (or maybe not). I think, I think, and I think.

Dubai, United Arab Emirates Bergabung Ekim 2023
668 Mengikuti228 Pengikut
Dr Token me-retweet
Arena.ai
Arena.ai@arena·
Exciting news: GLM-5.2 (Max) ranks #2 in Code Arena: Frontend, with +29pt over Claude Opus 4.7 (Thinking) and only behind Fable 5! GLM-5.2 is the best open model vs Kimi-K2.6 and Minimax-M3 by a large margin. - #2 React and #4 HTML sub-leaderboards - Ranks as the top model in nearly all sub categories: Brand & Marketing, Reference-Based Design, Data & Analytics, Consumer Product, Gaming, and Simulations. Congrats @Zai_org for the incredible milestone!
Arena.ai tweet media
Arena.ai@arena

GLM-5.2 (Max) by @Zai_org ranks #10 on the new Agent Arena leaderboard, closely matching Claude-Opus-4.8 (non-thinking) and is the #1 open model by a wide margin! In Agent Arena, we measure models on millions of real-world, long-horizon agentic tasks from a global community of users. Models can access web search, filesystem, and terminal tools to complete complex workflows. The leaderboard measures model performance on outcomes relative to the average model using a causal tracing methodology. Compared to 5.1, GLM-5.2 (Max) climbs from #13 to #10. Its clearest gains are confirmed task success, and user praise vs. complaint. Bash capabilities and tool hallucination remain stable. There is a tradeoff in steerability compared to the previous model (-6.0% vs. +1.2%). GLM-5.2 remains the same price as GLM-5.1, $1.4/$4.4 per input/output MTokens. 1M context window. Huge congrats @Zai_org for the incredible release! See thread for details on how GLM-5.2 (Max) performs across 5 different signals.

English
146
434
3.9K
1M
Noah Liu
Noah Liu@noahhhlzl·
hiring: someone to run our X account. part-time. $10K/month. I don't care about hours. I care about results. you need to: → actually live on the internet, not just post on it → be deep in AI news (not newsletter-deep, actually-there-deep) → have taken an account from 0→10 before, and can prove it if that's you → comment, I'll reach out. know a virality magician? tag them below. if we hire them, you get a $10K referral fee, minimum.
Noah Liu tweet media
English
366
13
550
74.9K
Dr Token
Dr Token@TokenDr2005·
let's go back to sleep. Opus is down
Dr Token tweet media
English
0
0
0
22
Dr Token
Dr Token@TokenDr2005·
Elon Musk acquired Twitter for $44 billion. Cursor? $60 billion . A social media platform used by hundreds of millions of people is worth less than an AI coding tool.
Dr Token tweet media
English
0
0
0
32
Dr Token
Dr Token@TokenDr2005·
Claude users, here’s a small life hack: Set an iPhone automation via Shortcuts to send “Hi” 2–3 hours before you wake up. Wake up and start vibecoding. Use your entire session. By the time you hit the limit, your next 5 hour session is already available.
Dr Token tweet mediaDr Token tweet media
English
0
0
0
40
Dr Token
Dr Token@TokenDr2005·
If someone from Anthropic is seeing this: pls bring back Fable. If that’s not happening, at least remove the “Currently Not Available” preview entirely. Seeing a feature preview for something nobody can actually use is just frustrating.
English
0
0
0
40
Tom ☕
Tom ☕@codevsdev·
what do you think about this?
Tom ☕ tweet media
English
78
3
70
4.2K
hayden
hayden@haydendevs·
finally looking into openclaw and hermes agent, i feel like a boomer man, i just don't get the use case
English
419
53
3.6K
803.9K
Dr Token
Dr Token@TokenDr2005·
Me opening Facebook only when I need to do some marketing.
Dr Token tweet media
English
1
0
1
89
Nous Research
Nous Research@NousResearch·
In partnership with @stripe, Hermes Agent now supports a full suite of Stripe skills. Your agent can buy things, pay per-call APIs, and provision its own SaaS, with configurable safety limits on every action.
Nous Research tweet media
English
196
364
4.6K
422.3K
spidey
spidey@lochan_twt·
bro disappeared like never existed
spidey tweet media
English
289
101
4.8K
531.4K
Dr Token
Dr Token@TokenDr2005·
@omarvvvr I can check, but how will I know if there’s some problem with the code?
GIF
English
0
0
0
15
Omar
Omar@omarvvvr·
Are you checking every line of code written by AI?
English
189
1
100
13.2K
Dr Token
Dr Token@TokenDr2005·
@wahab_twts Both Deepseek v4 pro and flash are decent models for everyday use
English
0
1
3
553
Dr Token
Dr Token@TokenDr2005·
someone pls tell me about Grok Build
Dr Token tweet media
English
0
0
0
48
Brutox
Brutox@wahab_twts·
Does anyone even use DeepSeek anymore? 😭
Brutox tweet media
English
102
2
128
13.8K
Aibek Jumabek
Aibek Jumabek@aibekjumabek·
Be honest: Which is better programmer? Human or AI?! Why?
English
36
1
22
1K
Hermes Agent Tips
Hermes Agent Tips@HermesAgentTips·
hermes agent users.. be honest when hermes agent desktop app was released did you moved completely to it or did you try it and went back to the CLI?
English
192
0
145
30.3K
Dr Token
Dr Token@TokenDr2005·
At this point in my life, I trust an expert vibe coder more than an expert programmer. Not because it’s faster; it’s “trust”
Dr Token tweet media
English
0
0
0
71