Crown 👑

28.4K posts

Crown 👑 banner
Crown 👑

Crown 👑

@ciruai

Local AI Consulting AI is about the workflow, not the model. AMD Local LLM Group: https://t.co/0wQDCDXlzO

United States Se unió Mayıs 2009
2.8K Siguiendo7.1K Seguidores
Mike Gannotti
Mike Gannotti@MichaelGannotti·
@ciruai @NVIDIAAI Thanks. Little different in that it has 128 gb ram and my AMD unit has 96 but it will be interesting to see
English
1
0
3
46
Mike Gannotti
Mike Gannotti@MichaelGannotti·
Just ordered! @NVIDIAAI DGX Spark. Can't wait to get my hands on it
Mike Gannotti tweet media
English
22
0
64
7.7K
Mike Gannotti
Mike Gannotti@MichaelGannotti·
@mr_r0b0t Yup.... the clickbait posts showing $1,900 ... If it was that cheap Id have bought one today
English
2
0
3
168
mr-r0b0t
mr-r0b0t@mr_r0b0t·
What is all this nonsense I keep seeing about the AMD Strix Halo being $2000!?! Half the posts aren’t even showing the correct device? It isn’t $2000, it was announced at $3999. That’s the only price I’ve seen anywhere. I spec’d out a comparable Framework desktop for reference.
mr-r0b0t tweet mediamr-r0b0t tweet media
English
23
3
95
11.8K
Lance
Lance@sirius_devops·
@sudoingX Stop the lies the AMD box is the same price . These LLM generated grift posts should get people banned
English
0
0
37
3K
Sudo su
Sudo su@sudoingX·
nvidia vs amd two boxes on my desk, both 128gb of unified memory. one is the nvidia dgx spark ($4,699). the other is the amd strix halo ($1,999), amd at roughly half the price. i'm running the exact same models on both, from a 3b all the way up to a 397b, same quants, same llama.cpp, and i'm posting every single number. here is why it actually matters. if the amd box just keeps pace, that's a nice story. but if it matches or beats a box that costs twice as much, the entire calculus for buying local ai hardware changes overnight. i already have the first numbers and they made me sit up. holding them for the full breakdown. stay tuned anon. this matchup is going to shake some ground.
Sudo su tweet media
English
70
39
859
63.5K
Daniel Birker
Daniel Birker@DBirker78883·
@sudoingX I feel like the Strix Halo is the sleeping giant of local ai.
English
1
0
0
263
Crown 👑
Crown 👑@ciruai·
@sudoingX its not fair to use same quants on both when there are quants optimized for each.
English
0
0
3
81
Sudo su
Sudo su@sudoingX·
the rules, so it's fair: identical models, identical quants, same llama.cpp build, both boxes idle, every number posted whether it flatters amd or nvidia. full disclosure, both nvidia and amd plus framework sent these units for honest testing, no money on either side. that's the whole point, nobody's paying me to crown a winner, the silicon decides. 3b dense up to a 397b moe, the full range. first numbers dropping soon.
English
4
0
72
7.6K
Volodymyr Styran 🇺🇦
Volodymyr Styran 🇺🇦@arunninghacker·
I ran OpenCode riding an uncensored Qwen3.6 running on a Mini-sized shiny silver box with an NVIDIA card in it for a couple of days, and I assure you: all this ethics/regulations/export control frontier AI drama will be over very, very soon
English
16
17
490
62.8K
Crown 👑
Crown 👑@ciruai·
@SMNYC1 @arunninghacker Imagine power is out for extended period and you're the only house which basically still has the whole internet self contained within it!
English
1
0
0
7
bird
bird@SMNYC1·
@ciruai @arunninghacker I feel like there has to be a market for the people who buy the storage 10 year food kits. They could rebuild civilization after they come out the bunker with ai in a box. Its a neat thought
English
1
0
0
11
Crown 👑
Crown 👑@ciruai·
@SMNYC1 @arunninghacker Currently solar powered and running AI locally. Hypothetically if power and internet goes out I could still run AI at home
English
1
0
0
9
bird
bird@SMNYC1·
@ciruai @arunninghacker Its an amazing way to compress a ton of information. Like wow. Having it work durable and offline would be good for if the web ever blows up. Its always a dns issue lol
English
1
0
1
9
Crown 👑
Crown 👑@ciruai·
@GMMeyer @benrayfield @BLUECOW009 I have no illusions about someone's ability to just download a weight file. Specifics are going to matter. That's why I say we can't really war game it without the details.
English
0
0
1
27
Greg Meyer
Greg Meyer@GMMeyer·
@ciruai @benrayfield @BLUECOW009 have you ever worked at an enterprise tech company? i can 100% guarantee almost no one has access to the model itself in the way you’re suggesting and any access of this kind is logged
English
1
0
1
18
@bluecow 🐮
@bluecow 🐮@BLUECOW009·
Kinda crazy we never had the weights for a sota model be leaked
English
31
6
619
68.3K
Crown 👑
Crown 👑@ciruai·
I rely heavily on openai still at home, even though I have access to dozens of models running locally and have spent hundreds of hours preparing for local only use. A couple of times in the last month the internet went out for an extended period (someone crashed into a main hub and died 🥲). Had to test everything I had been working on and keep moving without it. Found some important gaps. It's a lot of fun though, to be honest.
English
1
0
0
12
bird
bird@SMNYC1·
@ciruai @arunninghacker Yeah. I'm from the before times when we had our own computers and no internet. Ownership is kinda retro and good sec ops.
English
1
0
1
10
Crown 👑
Crown 👑@ciruai·
GPU VMs are much more expensive than just using an API. It's going to cost you $1+ an hour. Adds up if it's always on. An API for chatting probably will only cost you $20 a month or less but at that point why are you not just using gpt anyway? The reason to go local is to be in control.
English
1
0
0
9
Crown 👑
Crown 👑@ciruai·
@GMMeyer @benrayfield @BLUECOW009 Would need to dig into specifics to map out the best way. Kind of hard to say hypothetically. Impossible to stop if people can't be trusted. In the end it's your people who protect you.
English
1
0
0
23
Crown 👑
Crown 👑@ciruai·
Can do it on a nice $1000 laptop too, of course. Under $5000 is the same answer as "what does a nice computer cost" I have a nice chat bot on a $150 laptop but thats not going to replace the experience of chatting with chatgpt. You can with better hardware (not including image gen, all the other fancy tools it has now, just chatting unless price gets closer to that $5000 number)
English
1
0
0
14
bird
bird@SMNYC1·
@ciruai @arunninghacker I'm a dev, so I can get it to run. But 5k for a library chat bot seems steep. 🤔
English
1
0
1
11
Crown 👑
Crown 👑@ciruai·
@MrPeterLMorris @sudoingX @FrameworkPuter Assume whatever you want, he repeated the invalid claim again, and framework responded to my tweet confirming the issue. So whatever you assumptions are the reality isnt looking good for this lazy influencers credibility.
English
1
0
0
13
Sudo su
Sudo su@sudoingX·
the one box i was missing just landed anon. this is the @FrameworkPuter desktop with amd's strix halo, ryzen ai max+ 395, 128gb of unified memory, up to 96 of it addressable as vram. amd and framework sent it over for honest testing, no strings attached, and i've been waiting on this one specifically. here's why it matters. i've run local ai on basically everything, a 150 dollar drawer card, a 3090, a 5090, the dgx spark, datacenter h200s. the one gap was always the accessible big memory tier on the amd side, and this fills it. 128gb unified at roughly half the price of the nvidia equivalent, the sovereignty box for people who want to run real models without a datacenter budget. booting it today. and the question i actually want answered is the one nobody answers straight: what does this thing really run? same bar i hold every other card to. amd, nvidia, apple, measured, never vibes. let's find out what it's got.
Sudo su tweet mediaSudo su tweet media
Sudo su@sudoingX

listen up ROCm and Vulkan builders. @FrameworkPuter just shipped me strix halo desktop, 128GB unified, landing on my desk tuesday. everyone keeps asking what actually runs on this thing beyond vendor charts and forum guesses. so i'm going to answer it properly. starting with big MoE models since massive total params on light active is the whole point of 128GB unified. if there's a specific model or quant you want tested on strix halo, reply and it goes in the queue.

English
18
10
159
30.7K
Sudo su
Sudo su@sudoingX·
AMD
QST
11
1
47
5.1K