🔑
56.3K posts


that's nothing. contemplate how people used to organise massive parties and raves, or meetup with friends, with no phones except their mums landline, and did this effortlessly
Nostalgia@NostalgiaFolder
English
🔑 retweetledi

what the actual fuck is he talking about
Shubham Saboo@Saboo_Shubham_
“OpenClaw is the iPhone of tokens” — Nvidia CEO on Lex Podcast
English
🔑 retweetledi

Which local models can actually handle tool calling?
I built a framework to find out.
15 scenarios. 12 tools. Mocked responses. Temperature 0. No cherry-picking.
Tested every Qwen3.5 size from 0.8B to 397B, and since some of you asked after the distillation tests: yes, I included Jackrong's Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled too.
Only two models went all green: the 27B dense and the distilled 27B.
The 397B? Failed two tests. The 122B? Failed one. The 35B? Failed two.
The timed-out results — mostly on the smaller models, are cases where the model got stuck in a loop, repeating the same tool call until it hit the 30-second limit.
The test that exposed the most models: "Search for Iceland's population, then calculate 2% of it." Simple, but 35B, 122B, and 397B all used a rounded number from memory instead of the actual search result. They didn't trust their own tool output.
Small models hallucinate data.
Big models ignore data.
The 27B just threaded it through.
English
🔑 retweetledi

@CJZares @DanielleFong the archiving of posts will continue until morale improves (we retire)
English

@DanielleFong She’s goated. I have a bunch of her posts saved and whenever I’m bored I just go back and rewatch them to remind me how little I’m doing outside of work 😂
English

Kat is always doing some of the hottest shit
Kat ⊷ the Poet Engineer@poetengineer__
i built a dashboard for my claude code sessions: 254 sessions across 58 projects over 3 months 🤖🧚♀️ - 3d terrain map of token usage over time - session cards with first/last prompts, hover to expand - click to resume any past session in-browser - activity heatmaps, project treemaps code available for my x subscribers <3
English

Just spent a couple hours playing with Hermes Agent (MiniMax M2.5 on a 2× RTX PRO 6000 node)
Genuinely impressive experience
MiniMax M2.7 weights will be the closest we’ve ever gotten to a fully local “Claude Code + Opus 4.6” experience
Running on your own hardware at home
Nous Research@NousResearch
@TheAhmadOsman He should try Hermes Agent
English
🔑 retweetledi







