⬣PulseChain LIVE⬣ 💥

2.5K posts

⬣PulseChain LIVE⬣ 💥

⬣PulseChain LIVE⬣ 💥

@PulseChainLIVE

#AI #Cybersecurity #Linux #privacy

Middle of the GPU Beigetreten Ocak 2023
856 Folgt1.9K Follower
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@steeve First time I read about your project. When you release sm121 I will test it on DGX Spark and post here benchmark results. Thanks.
English
0
0
0
19
Steeve Morin
Steeve Morin@steeve·
Yesterday I built 33 000 flash attention kernels in about 4 minutes from sm80 to sm120. For x86_64 and arm64. From my mac. Bazel is wild man.
English
3
3
63
6.7K
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
🤬🤬🤬
klöss@kloss_xyz

let me explain the ramifications of this… → 150,000 people just got locked out of their own cars… across 46 states… for 6 days straight and counting → not a software bug. not a glitch. not AI permissions gone wrong. → hackers flooded Intoxalock’s servers and all these vehicles just stopped starting… → these are court ordered breathalyzer devices… people who messed up in the past but have been doing everything right since (hopefully)… and now they can’t drive to work because someone else’s security system failed wild connect the dots… your electric car talks to a server to start. one breach and it’s a 50,000 dollar paperweight your insulin pump syncs to a server. your pacemaker data lives on a server. one breach and it’s not a car that stops working… it’s a body your smart home lock runs through a server. one breach and your front door either won’t open or won’t close now zoom out… Gartner projects $2.5 trillion going into AI this year… only $240 billion into securing the systems it runs on. that’s a 10 to 1 bet that nothing goes wrong the four biggest tech companies (Alphabet, Microsoft, Meta, and Amazon) are rumored to spend $700 billion on AI infrastructure this year alone… while cybercrime is projected to cost the world $10.5 trillion now imagine this happens to Tesla. to a hospital network. to the power grid… every new AI integration is a new attack surface. every API is a new door. every device that “talks to the cloud” is one more thing that can be turned off by someone you’ll never meet and I’m not saying every one of these systems will experience something who really knows what’s secure or isn’t but if you’re building right now… security isn’t the last layer you add. it’s the first one. → 150,000 people have just found out what happens when nobody prioritizes that… archaic government systems and legacy businesses are likely first on the chopping block I hope the rest of us continuously learn from it instead of living it the weakest link in every system is the one nobody bothered to secure like what wild system vulnerability will we see next? does someone hack Area 51?

ART
0
0
0
45
David Hendrickson
David Hendrickson@TeksEdge·
The Ultimate 128GB Local AI Hardware Battle 🥊💻 Judging Qwen3.5-27B (Bartowski IQ4_NL) on top unified-memory machines: 1️⃣ @AMD Strix Halo (Ryzen AI Max+ 395)💰 ~$2,500 | 🚀 9 - 12 tps (decode) | 🎮 Full Windows AAA gaming 🏆 Speed + value king. 🖕 2️⃣ @Apple Mac Studio M3 Ultra💰 ~$5,000 | 🚀 8–12 tps | 🍎 Apple macOS & ecosystem w/solid speeds; limited AAA gaming 3️⃣ @NVIDIA DGX Spark (GB10 Blackwell)💰 $4,699 | 🚀 ~10 tps (~20 tps x2 node) | 🐧 Linux/AI research only w/strong prefill + nerfed decode bandwidth-limited. Difficult pooling (new cables may fix). AAA gaming not optimized for Grace Verdict: AMD wins for most power users and best speed/price/gaming combo. (Community benchmarks; YMMV with setup/context) Which are you buying? 👇
David Hendrickson tweet mediaDavid Hendrickson tweet mediaDavid Hendrickson tweet media
English
44
25
308
58.7K
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@spark_arena @TeksEdge @AMD @Apple BTW, Asus released a week ago an updated version of DGX OS recovery .iso that includes NVIDIA AI Workbench and seem to have better drivers integration than before. NVIDIA has still the 6 months old .iso on their website.
English
0
0
0
18
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@spark_arena @TeksEdge @AMD @Apple Look at the data not the conclusions of that video and storagereview article. Asus gx10 consumes 10% less for the same speeds, has full copper heat sink, the unit is 20% heavier. I got GX10 GPU at 80W for inference with no thermal issues and its much cheaper. For me GX10 wins.
⬣PulseChain LIVE⬣ 💥 tweet media⬣PulseChain LIVE⬣ 💥 tweet media⬣PulseChain LIVE⬣ 💥 tweet media
English
0
0
1
47
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@briancaffey @Teknium I tested Nemotrone a few days ago, so I`m not 100% sure that I got 10 t/s or more but I tried a REAP version of MiniMax m2.5 yesterday and got 10 t/s and that`s about the same size but not MOE so you are right.
English
0
0
2
62
Brian Caffey
Brian Caffey@briancaffey·
@PulseChainLIVE @Teknium I’m getting about double that on my Spark, unless you aren’t counting <thinking> @PulseChainLIVE but I agree that smaller optimized models with higher concurrency are more fun :D
Brian Caffey tweet media
English
1
0
1
40
Teknium (e/λ)
Teknium (e/λ)@Teknium·
Just got an Nvidia Spark setup. Hermes Agent installed without any issues. Now lets see what model it should be powered by 😉
English
37
5
283
12.3K
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@Teknium for compiling llama.cpp you need to find the best flags for maximum performance, this is what worked for me and when you launch use these x.com/PulseChainLIVE… --no-mmap is very important for smooth loading of large models in unified memory.
⬣PulseChain LIVE⬣ 💥 tweet media⬣PulseChain LIVE⬣ 💥 tweet media⬣PulseChain LIVE⬣ 💥 tweet media⬣PulseChain LIVE⬣ 💥 tweet media
English
0
0
1
155
Teknium (e/λ)
Teknium (e/λ)@Teknium·
Getting Hermes ready to work with the spark over here
Teknium (e/λ) tweet media
English
8
3
107
13.7K
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@Teknium If 10 t/s doesn t mind you, try this: mradermacher/MiniMax-M2.1-REAP-139B-A10B-GGUF compile llama.cpp with the right flags and launch with c=1 ctx=192K this is the closest you can get to premier models for what you need.
English
0
0
2
128
⬣PulseChain LIVE⬣ 💥 retweetet
Gray Swan AI
Gray Swan AI@GraySwanAI·
Your AI agent can be hijacked by a prompt injection and you'd never know! The attack executes. The response looks normal. And the user moves on. We ran the largest public competition testing this exact threat across tool use, coding, and computer use agents. 464 participants, 272K attacks, 13 frontier models. Every model proved vulnerable.
Gray Swan AI tweet media
English
1
15
47
11.5K
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@tmophoto @Teknium yes, done that. Even in llama.cpp you make a router script and llama has also the web UI so you can use it too while the agents use it. Qwen3.5-35B concurrency=1 is 50 t/s, if you use parallelism you get hundreds t/s out.
English
0
0
0
18
tmo
tmo@tmophoto·
@Teknium can you install multiple medium sized models and fit them both so one agent accesses one model and another accesses another model (general use / coding)?
English
1
0
1
334
⬣PulseChain LIVE⬣ 💥
⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·
@Teknium Did it came with the latest version of DGX OS with NVIDIA AI Workbench ? I reinstalled mine today just to have a clean slate and I was very happy to see that It was an updated version that looks like some extensive work was done on it versus what I had before.
English
0
0
0
218