wheelofforeplay

621 posts

wheelofforeplay

wheelofforeplay

@wheelforeplay

https://t.co/FrGvWwSzMy

Katılım Aralık 2025
38 Takip Edilen7 Takipçiler
wheelofforeplay
wheelofforeplay@wheelforeplay·
@lennysan Google is crashing, amazon is crashing, coinbase is crashing.... need I go on?
English
0
0
0
161
Lenny Rachitsky
Lenny Rachitsky@lennysan·
Engineers don't write code. PMs are shipping to production. The design process is dead (there's no time). Marketing can ship their own campaigns. SDRs are being replaced by AI. Everyone's a data scientist now. What a time to be alive.
English
187
75
1K
509.7K
Alex Cheema
Alex Cheema@alexocheema·
@AI_with_Eric Will be publishing the most comprehensive set of local AI benchmarks. Not long now.
English
2
0
8
529
Alex Cheema
Alex Cheema@alexocheema·
It’s kind of crazy but the shitstorm of supply chain issues has created a new best-in-class local AI deployment: M5 Max MacBook clusters. - The memory unit economics are great - each MacBook has 128GB @ 614GB/s for $5k - M5 Max added tensor cores (Apple Neural Accelerators) with 4x compute of M4 Max (~60TFLOPS fp16) - You can cluster them with RDMA over Thunderbolt 5 and @exolabs for ~linear scaling (so memory bandwidth really is additive: 4 x MacBooks are 512GB @ 2456GB/s) You can’t get Mac Studios right now, so customers are buying MacBook clusters instead. A government is running this in prod. This setup is best for low-batch decode-heavy inference (all memory bound) and transcription (super fast and cheap on apple silicon).
Alex Cheema tweet media
Alex Cheema@alexocheema

It is unconventional but it actually works, depending on the workload of course. There are strengths and weaknesses for sure. There are some real deployments (governments, big companies) running this setup in production (pods of 4 MacBooks). It's the best price to performance for many workloads (e.g. transcription, low batch LLM inference). They landed on this themselves as the best hardware to run their workloads on. Can share more in private if you are interested (don't want to turn this into a sales pitch for exo). If Apple actually sold us an M5 Max / M5 Ultra Mac Studio, then we'd use that. But we could be waiting until October for that (or longer, the supply chain issues seem pretty bad). It's the same M5 Max chip in the MacBook as the Mac Studio, and it goes up to 128GB unified memory. Each chip has 614GB/s memory bandwidth (2.24x DGX Spark). I would say the main downside (which we should make more clear) is the software ecosystem - it's still quite immature. It has got much better in the last year e.g. clustering came a long way with low-latency RDMA in macOS 26.2.

English
24
13
197
31.7K
wheelofforeplay
wheelofforeplay@wheelforeplay·
@jun_song 3090 can be a cheaper alternative to most room heating costs though so keep that in mind if you live anywhere where you often need heating
English
0
0
2
477
송준 Jun Song
송준 Jun Song@jun_song·
Best budget local llm hardware comparison: 3090 vs Mac Studio M1 Max 64gb Price : both ~$2k to set up Pros on 3090 : much better performance (27b vs 35b at similar tok/s) Pros on Mac : Power efficiency, bigger RAM, zero heat/noise, reliability
송준 Jun Song tweet media
English
30
7
169
21.2K
송준 Jun Song
송준 Jun Song@jun_song·
Frequent question : What is my local llm setup now Macbook Pro M5 Max 128GB - running Deepseek-V4-Flash-JANGTQ2 - multi agents set up with Minimax, GLM plan - Harness : Hermes Agent 2xDGX Spark - post-training models and world AGI research 24/7 - SuperGemma4-26b agent sub
English
23
11
280
15K
AURAGOALX
AURAGOALX@AURAGOALX·
🚨🗣️ Micah Richards DROPS BOMBSHELL over the VAR controversy involving Arsenal FC 🗣️ Richards: “If this whole VAR situation is properly investigated, then serious questions will need to be asked.” “You cannot have this level of controversy around key title-deciding moments without scrutiny.” “And if any wrongdoing or major officiating failures are uncovered, people will start talking about consequences… even potential points deductions.” 😳 “That changes everything.” “Because now we are no longer talking about one bad decision.” “We are talking about the integrity of the Premier League title race.” “I’m not accusing Arsenal of anything directly.” “But football fans deserve transparency, especially when margins are this small at the top.” “If the league wants trust from supporters, they must address every controversy properly.” 🔥⚖️
AURAGOALX tweet mediaAURAGOALX tweet media
English
267
510
3.9K
140.9K
wheelofforeplay
wheelofforeplay@wheelforeplay·
@MitoGouken Its the Pro models, the moment the pro model comes out I dont want to upgrade nor do I want to play on my base model
English
0
0
0
155
MitoGouken
MitoGouken@MitoGouken·
- Marathon flopou - Saros flopou - GOW Sons of Sparta flopou - PlayStation 5 tendo uma grande queda de vendas no último relatório financeiro - Mais de 700 milhões em prejuízo com a Bungie - Horizon Hunters Gathering é o próximo a flopar - FairGames provavelmente vai até ser cancelado PlayStation tá tendo um ano TERRÍVEL até agora.
Português
173
79
1.4K
102.4K
Brendan Falk
Brendan Falk@BrendanFalk·
If you're an Aussie engineer and think you could crush this interview, please DM me! We have 100s of thousands of users, millions in revenue, and I would argue the best AI app builder. Lots more to do!
Brendan Falk@BrendanFalk

I believe we've found the best AI-native coding interview We call it the “Composer 1 interview” Candidates get 1 hour to build a real, medium-sized project live The only constraint: they have to use Cursor’s Composer 1 model

English
22
3
131
25.8K
wheelofforeplay
wheelofforeplay@wheelforeplay·
@mwfowlie @BrendanFalk Sounds like much more reasonable intelligence test to me than the usual shit process most companies put you through
English
0
0
0
15
Michael Fowlie
Michael Fowlie@mwfowlie·
@BrendanFalk Why would you use this sort of interview technique? I’m not complaining. I’d be good at it. But typically you want to measure intelligence or a close proxy to it, not a skill that can be learned quickly.
English
1
0
1
167
wheelofforeplay retweetledi
Mal
Mal@UtdMaI·
🚨Oliver Glasner just delivered one of the coldest press conference responses of the season Reporter asked him about the title race and whether Palace could influence it… Glasner replied: “I checked my pay slips… I didn’t get any money from Arsenal or City.” Then he finished with THIS: “Our influence in the title race is definitely less than VAR.”
Mal tweet media
English
356
5.6K
55K
2.1M
Rob Hallam
Rob Hallam@robj3d3·
Tony Stark built Jarvis. Jarvis didn’t build Tony Stark.
English
48
12
195
9.7K
Shpeshal Nick
Shpeshal Nick@Shpeshal_Nick·
I swear if Saros doesn’t succeed and we lose Housemarque…
English
203
33
884
73.8K
Senator Babet
Senator Babet@senatorbabet·
Capital gains tax shouldn’t exist. I risk my money. I build the business. I make the investment. I do the work. I take the risk. So why the hell should the government take a cut of my success? They risk nothing. They create nothing. They just take. Parasites. F’en parasites.
English
3K
9.5K
68.2K
1.4M
송준 Jun Song
송준 Jun Song@jun_song·
Local LLM fits for Mac RAM size (5/14): >~32gb : SuperGemma4-e4b-mlx >32~64gb : Qwen3.6-35b-mlx-6bit >96~128gb : Minimax-M2.7 / Deepseek-V4-Flash (JANGTQ by @dealignai) >256gb : Xiaomi-MiMo-V2.5 >512gb : GLM-5.1-RAM-420GB-MLX Follow and keep on updates
English
20
20
273
15.6K
wheelofforeplay
wheelofforeplay@wheelforeplay·
@Edouardmazza @jun_song @dealignai I ran it through multiple coding tasks and I simply found the output of 27B superior to 35B. 35B is definitely faster though, so its a quality (27B) vs speed (35B) choice.
English
0
0
1
39