
Shane
2.1K posts

Shane
@ShaneRobinett
Fellow Human on the Journey Around the Earth. 7 kids - and a grandkid now!
Osteen, FL เข้าร่วม Ekim 2008
618 กำลังติดตาม1.7K ผู้ติดตาม

@Breaking911 I will be swimming in my alligator infested river in my backyard again today
English

@DefiantLs You later find out he is supported by Hamas and Iran :)
English
Shane รีทวีตแล้ว

mcdonalds CEO says making burgers at home is moving down a "very dangerous path"
Polymarket Money@PolymarketMoney
BREAKING: Anthropic CEO says open-source AI is moving down a “very dangerous path.”
English

My entire AI stack is now Chinese 🇨🇳
87% cheaper. same revenue
swaps by task:
1. reasoning / backend brain
Opus 4.8 → Kimi K2.7
benchmark gap: ~8% · price: ~11x cheaper
2. code generation
GPT-5.5 → Qwen 3.7 Max
benchmark gap: ~18% · price: ~7x cheaper
3. agent loops + tool calling
Sonnet 4.7 → GLM 5.2
benchmark gap: ~3% · price: ~5x cheaper on input
4. cheap volume / bulk processing
GPT-5.5 mini → MiMo V2.5
benchmark gap: ~6% · price: ~12x cheaper
5. image generation
GPT-Image-2 → Wan 2.5
benchmark gap: ~5% · price: ~8x cheaper
6. video generation
Sora 2 → Kling 3.0
benchmark gap: roughly equal · price: ~6x cheaper
[ result after 30 days: ]
operating costs dropped 87%, output quality dropped 4% on average, revenue unchanged
the most important that these models will be not banned in a month and i can run them locally
nobody will steal my data and i can learn them as i need
full article drops tomorrow with:
> exact routing logic per task type
> the 2 cases where I still pay for American
> the migration playbook anyone can copy in a weekend
VERY IMPORTANT to get migrated now, while it's not too late

English
Shane รีทวีตแล้ว

If you mainly use local LLMs for Hermes-style agentic loops, this might surprise you:
Qwen 3.6 35B actually *beats* DeepSeek v4 Flash — especially on tool-heavy & coding-adjacent workflows.
You're not missing out.
Full results 👇
github.com/MiaAI-Lab/Qwen…

English

My god, who is still using Gemini?
How is there still demand left?
Kalshi@Kalshi
JUST IN: Google reportedly doesn't have enough AI capacity to meet demand
English
Shane รีทวีตแล้ว

@Teknium @tunahorse21 for a new comer who is highly technical what are the absolute musts for a fresh hermes setup
English

@fugitive_druid what is the antifa flag? I kinda want one for reasons that will remain undisclosed. ;)
English

@JoelDeTeves What sucks for me is- I got 2b70 but only single x16 and the other is on x4. It works - one is primary - but on larger models load distribution pain shows in some slowdowns.
English

Dual 3090s are the absolute play for cheap local inference
You can run them on 500 watts of electricity combined, even less if you choose
Two cards on a single household outlet gives you such incredible intelligence density and flexibility now
You don't need NVLink if you have two PCi-E x16 slots and sufficient CPU power you're golden
This is the way

English

@ShaneRobinett I agree it often is wrong morally.
So is calling your parents nasty names.
English

@ShootyBearX @sudoingX Usually 100k is good - but larger code bases or extended projects I want the 250 or more.
English

@ShaneRobinett @sudoingX Between 250 and 175 tps for load and about 13tps for output which slows down to about 11 for 100k context.
I grt about the same output with my 4090+5090 combo but 5-6x faster loading.
English

okay nerds, how much memory do you actually own right now? not rented, owned, sitting on your desk. i'll start:
> dgx spark, 128gb unified
> strix halo, 128gb unified
> 5090 laptop, 24gb vram + 64gb ram
> 3090 node, 24gb vram + 32gb ram
> 3060 node, 12gb vram + 16gb ram
> old acer laptop, 8gb (yes it counts)
> phone, 12gb ram
448gb of memory i own outright. all mine. flex yours.
English

@0xSero @elonmusk @beffjezos I ran out of my from super heavy in 13 days. not happy with that part :)
English

@elonmusk @ShaneRobinett @beffjezos Grok code is really good, I’ve been using it more and more lately. Hopefully it won’t be restricted
English

Looks like Grok is back on the menu
Elon Musk@elonmusk
Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in supplemental training, is now in private beta at SpaceX & Tesla. Early evals show performance close to, perhaps exceeding Opus. RL is continuing to significantly improve the model, and the Grok Build harness gets better every day. Nice work by all those involved! Completely trained from scratch new models will be released by @SpaceX every month this year.
English








