Crown 👑

28.4K posts

Crown 👑

@ciruai

Local AI Consulting AI is about the workflow, not the model. AMD Local LLM Group: https://t.co/0wQDCDXlzO

United States شامل ہوئے Mayıs 2009

2.8K فالونگ7.1K فالوورز

پن کیا گیا ٹویٹ

Crown 👑@ciruai·31 May

@the_jimmy_jones x.com/i/chat/group_j…

QME

1.1K

Crown 👑@ciruai·9m

@awdyzx @DataChaz Use the real one instead of this one

English

.@awdyzx·6h

@DataChaz It isnt that smart. Ive played around with it and cant do basic knowledge / image analysis

English

792

Charly Wargnier@DataChaz·13h

DO YOURSELF A FAVOR: GO DOWNLOAD THIS NEW LOCAL MODEL AND KEEP IT IN STORAGE. Even if you don't have a massive GPU setup, having offline access to an intelligent model is a crucial insurance policy. Free API access won't necessarily last forever. Right now, the 12B-27B range is the absolute sweet spot, and Hugging Models just highlighted a perfect candidate to download today: → GEMMA 4 12B CODER on @huggingface 🤗 It packs Google’s latest architecture into a GGUF format optimized for consumer hardware. What it delivers locally: → Fast, private code completion without the cloud → Real-world debugging and reasoning capabilities → Smooth performance on 12GB+ VRAM or a standard CPU Don't wait until you need it. Grab the weights and keep them locally 👇

Hugging Models@HuggingModels

Gemma 4 12B Coder is here and it's a game changer for local code generation. This GGUF model packs Google's latest gemma-4 architecture into a compact 12B size, perfect for running on consumer hardware. It's optimized for reasoning and thinking, making it ideal for developers who want fast, private coding assistance without the cloud.

English

172

1.8K

252.5K

Crown 👑@ciruai·9m

@Hassaansaeed22 @DataChaz If you're memory poor use the new QAT version of gemma4 12b it's much faster

English

Syed Hassaan Saeed@Hassaansaeed22·8h

@DataChaz 12B means I need about 8 to 16 gigs of VRAM correct?

English

1.5K

Crown 👑@ciruai·10m

@ddarius94 @DataChaz You might be able to run hermes with it.

English

Darius@ddarius94·9h

@DataChaz What can I do with just 12B? Remove unused imports and format code?

English

2.3K

Crown 👑@ciruai·12m

@MichaelGannotti White Glove Small business AI Solutions and consulting.

English

Mike Gannotti@MichaelGannotti·19m

Have a service around AI? Data? infrastructure? Security? Me know so I can get you listed. SMF Clearinghouse- The clearinghouse for all things AI

English

Crown 👑@ciruai·32m

@mr_r0b0t Have you tested?

English

mr-r0b0t@mr_r0b0t·15h

A new specialist subagent, purpose trained to efficiently search your repo, was just released by Microsoft! Say hello to FastContext 😍

English

2.8K

Crown 👑@ciruai·3h

@bradmillscan @shannholmberg @hey_madni Followed both!

English

Brad Mills 🔑⚡️@bradmillscan·3h

I need to setup a workflow to analyze and build a wiki of @shannholmberg & @hey_madni's posts. They are both fire-hosing Hermes-gold right now. Crazy value.

English

585

Crown 👑@ciruai·4h

@MichaelGannotti @NVIDIAAI The models bigger than that are not worth running anyway, you are sitting on a beast right there!

English

Mike Gannotti@MichaelGannotti·4h

@ciruai @NVIDIAAI Thanks. Little different in that it has 128 gb ram and my AMD unit has 96 but it will be interesting to see

English

Mike Gannotti@MichaelGannotti·10h

Just ordered! @NVIDIAAI DGX Spark. Can't wait to get my hands on it

English

8.4K

Crown 👑@ciruai·4h

@MichaelGannotti @mr_r0b0t I thought you had the same system? (hardware wise)

English

Mike Gannotti@MichaelGannotti·6h

@mr_r0b0t Yup.... the clickbait posts showing $1,900 ... If it was that cheap Id have bought one today

English

187

mr-r0b0t@mr_r0b0t·8h

What is all this nonsense I keep seeing about the AMD Strix Halo being $2000!?! Half the posts aren’t even showing the correct device? It isn’t $2000, it was announced at $3999. That’s the only price I’ve seen anywhere. I spec’d out a comparable Framework desktop for reference.

English

101

12.9K

Crown 👑@ciruai·4h

@mr_r0b0t slop influencers.

English

Crown 👑@ciruai·4h

@sirius_devops @sudoingX Nah it's cheaper, but not 2k. $3~

English

Lance@sirius_devops·10h

@sudoingX Stop the lies the AMD box is the same price . These LLM generated grift posts should get people banned

English

3.1K

Sudo su@sudoingX·10h

nvidia vs amd two boxes on my desk, both 128gb of unified memory. one is the nvidia dgx spark ($4,699). the other is the amd strix halo ($1,999), amd at roughly half the price. i'm running the exact same models on both, from a 3b all the way up to a 397b, same quants, same llama.cpp, and i'm posting every single number. here is why it actually matters. if the amd box just keeps pace, that's a nice story. but if it matches or beats a box that costs twice as much, the entire calculus for buying local ai hardware changes overnight. i already have the first numbers and they made me sit up. holding them for the full breakdown. stay tuned anon. this matchup is going to shake some ground.

English

897

67.3K

Crown 👑@ciruai·4h

@DBirker78883 @sudoingX Absolutely. I hope his benchmarks do it any justice at all.

English

Daniel Birker@DBirker78883·5h

@sudoingX I feel like the Strix Halo is the sleeping giant of local ai.

English

274

Crown 👑@ciruai·4h

@sudoingX its not fair to use same quants on both when there are quants optimized for each.

English

Sudo su@sudoingX·10h

the rules, so it's fair: identical models, identical quants, same llama.cpp build, both boxes idle, every number posted whether it flatters amd or nvidia. full disclosure, both nvidia and amd plus framework sent these units for honest testing, no money on either side. that's the whole point, nobody's paying me to crown a winner, the silicon decides. 3b dense up to a 397b moe, the full range. first numbers dropping soon.

English

Crown 👑@ciruai·6h

@SMNYC1 @arunninghacker blows my mind what even a 3GB model knows. Where does it keep it all? 😂

English

bird@SMNYC1·7h

@ciruai @arunninghacker Yes! That is amazing compression right?

English

Volodymyr Styran 🇺🇦@arunninghacker·1d

I ran OpenCode riding an uncensored Qwen3.6 running on a Mini-sized shiny silver box with an NVIDIA card in it for a couple of days, and I assure you: all this ethics/regulations/export control frontier AI drama will be over very, very soon

English

490

62.8K

Crown 👑@ciruai·7h

@SMNYC1 @arunninghacker Imagine power is out for extended period and you're the only house which basically still has the whole internet self contained within it!

English

bird@SMNYC1·7h

@ciruai @arunninghacker I feel like there has to be a market for the people who buy the storage 10 year food kits. They could rebuild civilization after they come out the bunker with ai in a box. Its a neat thought

English

Crown 👑@ciruai·7h

@SMNYC1 @arunninghacker Currently solar powered and running AI locally. Hypothetically if power and internet goes out I could still run AI at home

English

bird@SMNYC1·8h

@ciruai @arunninghacker Its an amazing way to compress a ton of information. Like wow. Having it work durable and offline would be good for if the web ever blows up. Its always a dns issue lol

English

Crown 👑@ciruai·8h

@GMMeyer @benrayfield @BLUECOW009 I have no illusions about someone's ability to just download a weight file. Specifics are going to matter. That's why I say we can't really war game it without the details.

English

Greg Meyer@GMMeyer·8h

@ciruai @benrayfield @BLUECOW009 have you ever worked at an enterprise tech company? i can 100% guarantee almost no one has access to the model itself in the way you’re suggesting and any access of this kind is logged

English

@bluecow 🐮@BLUECOW009·1d

Kinda crazy we never had the weights for a sota model be leaked

English

619

68.4K

Crown 👑@ciruai·8h

I rely heavily on openai still at home, even though I have access to dozens of models running locally and have spent hundreds of hours preparing for local only use. A couple of times in the last month the internet went out for an extended period (someone crashed into a main hub and died 🥲). Had to test everything I had been working on and keep moving without it. Found some important gaps. It's a lot of fun though, to be honest.

English

bird@SMNYC1·8h

@ciruai @arunninghacker Yeah. I'm from the before times when we had our own computers and no internet. Ownership is kinda retro and good sec ops.

English

Crown 👑@ciruai·8h

GPU VMs are much more expensive than just using an API. It's going to cost you $1+ an hour. Adds up if it's always on. An API for chatting probably will only cost you $20 a month or less but at that point why are you not just using gpt anyway? The reason to go local is to be in control.

English

bird@SMNYC1·8h

@ciruai @arunninghacker Cool! Curious why not just host a vm somewhere? With a gpu

English

Crown 👑@ciruai·8h

@GMMeyer @benrayfield @BLUECOW009 Would need to dig into specifics to map out the best way. Kind of hard to say hypothetically. Impossible to stop if people can't be trusted. In the end it's your people who protect you.

English

Greg Meyer@GMMeyer·9h

@ciruai @benrayfield @BLUECOW009 x.com/gmmeyer/status…

Greg Meyer@GMMeyer

@benrayfield @BLUECOW009 okay let’s go back to basics: in any enterprise secure tech is gated and logged, if anyone downloads this it would set off alarms because no one should actually have this on their computer

QME

دریافت کریں

@awdyzx @DataChaz @huggingface @Hassaansaeed22 @ddarius94 @MichaelGannotti @mr_r0b0t @bradmillscan