Ben Barrey

171 posts

Ben Barrey banner
Ben Barrey

Ben Barrey

@barrey_ben

Poetic Engineer. Built @Turi_ai to run an AI company. Teaching small businesses how to deploy AI safely & effectively.

The Matrix Katılım Kasım 2021
421 Takip Edilen17 Takipçiler
Ben Barrey
Ben Barrey@barrey_ben·
@kloss_xyz Me explaining to this to my wife. She still doesn’t want to stop the guac
GIF
English
0
0
1
0
klöss
klöss@kloss_xyz·
Americans: “no way I’m spending $2,000/mo on frontier AI models…” also Americans: buys $17 salads from Whole Foods, pays $3 extra at Chipotle for guacamole, willingly spends $9 on oat milk lattes, has 4 family streaming subscriptions they forgot about, and tips 30% on UberEats
English
12
1
23
1.3K
sudo rm -rf
sudo rm -rf@itsjustmarky·
Where can I buy one?
English
1
1
1
48
0xSero
0xSero@0xSero·
Once I get all 4x 6000s I will be giving away 4 of the 3090s to people on X and 4 will go to an up and coming lab.
English
23
2
137
2.5K
0xSero
0xSero@0xSero·
288gb VRAM - 8x RTX 3090 - 1x RTX 6000 Unknown amount of mixed memory expecting 176gb 128gb Framework 16gb Mac mini 32gb Mac M1 Max 464gb total memory
0xSero tweet media
English
48
5
373
10.1K
Ben Barrey retweetledi
Lance Martin
Lance Martin@RLanceMartin·
a great way to get started is to use our skill, built into Claude Code. get the latest Claude Code release: $ claude update then start Claude Code and run our subcommand: $ claude /claude-api managed-agents-onboarding
English
16
6
46
11.4K
Ahmad
Ahmad@TheAhmadOsman·
What do you want to see me running on these 4x DGX Sparks today?
Ahmad tweet media
English
165
13
527
35.1K
Ben Barrey
Ben Barrey@barrey_ben·
Hot take: you’ve had your supercomputer human brain your whole life and used no more than 10% of it. You’ve had Opus-4.6 and GPT-5.4 for two months and used probably 5% of its capabilities. You don’t need Mythos to get what you need done.
English
0
0
0
21
Ben Barrey
Ben Barrey@barrey_ben·
Yea don’t overthink it, it’s basically a multimodal notes app. Get Git and Templater plugins. Get a theme/skin you like. The sauce is how you setup your frontmatter templates to auto generate on new notes and structure of your vault so agents can have easy access/write/retrieve. The karpathy wiki is just a start, adding custom workflow/skills and how you want your links setup is where it gets interesting.
English
0
0
3
214
Ben Barrey
Ben Barrey@barrey_ben·
Oh I don’t disagree with you. It’s a young format having growing pains, but it’s yet to be seen what software improvements and new models could improve it. Right now sure I’d probably take the M5 instead. My initial point was vs. RTX GPUs and how it’s kinda apples to oranges to compare the spark to
English
0
0
1
17
David Hendrickson
David Hendrickson@TeksEdge·
💨 Running Gemma-4-26B-A4B-it on a NVIDIA DGX Spark (GB10 Blackwell) at 37 tps just feels more pure than running on a huge costly RTX-5090 gaming rig. • 26B total params / ~4B active (MoE magic) • Running smooth with NVFP4 quantization • Hitting 37 tokens/sec decode on interactive chats & agents Feels snappy. Only ~16GB loaded → 256K context has tons of room left.
David Hendrickson tweet media
English
46
15
306
28.7K
Ben Barrey
Ben Barrey@barrey_ben·
@shitcoinity @TeksEdge I wouldn’t write off the spark, depends on adoption of NVFP4 vs. MLX and which architecture actually ends up benefiting more from newer model quants. Plus Mac’s are generally more expensive comparatively. Spark clusters could still be very interesting. I see competition here.
English
1
0
0
28
Shitcoinity
Shitcoinity@shitcoinity·
@barrey_ben @TeksEdge Insanely better value, especially for the 512 Ultra. Which is insane considering we're talking about Apple.
English
1
0
0
20
Shitcoinity
Shitcoinity@shitcoinity·
@barrey_ben @TeksEdge Spark is utter garbage in every single way. It's an insult to consumers. They push me towards Mac for the first time in my life.
English
1
0
0
42
Ben Barrey retweetledi
Konny
Konny@konnydev·
Qwen 3.6 Plus just launched 🤯 • 1M token context • Plans, tests, and iterates like an engineer • Turns UI screenshots into working code • Free on OpenRouter and Venice Why are you still paying???💰
Konny tweet media
English
58
12
387
29.6K
Ben Barrey
Ben Barrey@barrey_ben·
@HowToAI_ 5-7 tok/s wouldn’t call that solved just yet
English
1
0
5
5.2K
How To AI
How To AI@HowToAI_·
🚨 Microsoft has solved the biggest problem with AI. They open-sourced bitnet.cpp. It’s a 1-bit inference framework that runs massive 100B parameter models directly on your CPU without GPUs. it uses 82% less energy.. 100% open-source.
English
143
387
3.3K
477.7K
Josh Schultz
Josh Schultz@joshuamschultz·
I have a partnership agreement with Dell and Nvidia - so different. When I sell these to small businesses and set them up its about 5k for a spark a few hundred per cable and then dev time (unless you do yourself) you'll need - triton server and/or vllm - docker / k3 setups (which openshell uses) - the vpn setup (I just built in/through unifi) - then the actual agent setup (I use Arc (mine), openclaw, and hermes) - and then the orchestration (ArcTeam, and Paperclip for me) So about 25k and a lot of time
English
1
0
9
391
Josh Schultz
Josh Schultz@joshuamschultz·
The system is coming together... 1 @nvidia Spark 4 @Dell GB10s 5 Blackwell GPUs for a mini home cluster. 3 networked together for ability for a 400+ billion param model (3 petaflops of compute at a unified 384 GB of memory) ... serving 3 models to the other 2 units hosting various agents, agent teams along with @nvidia personaplex claude code instances knowledge bases etc Currently playing with models including splits/routing between - kimi (reasoning) - minimax - nemotron super (agent) - qwen (coding and routing) working on training as well with the opus reasoning traces dataset in @huggingface Getting close to fully owning the intelligence and the inference!
Josh Schultz tweet mediaJosh Schultz tweet media
English
43
11
269
21K
Ben Barrey
Ben Barrey@barrey_ben·
@0xSero Let everyone sleep and keep buying up the 3090s, keeps the price low on Blackwell lol
English
0
0
0
75
Mr V
Mr V@MrV_777·
@0xSero think the 16gb 5060ti is still a great option too
English
1
0
1
214
Ben Barrey retweetledi
Om Patel
Om Patel@om_patel5·
GEMMA 4 JUST CASUALLY DESTROYED EVERY MODEL ON THIS LEADERBOARD EXCEPT OPUS 4.6 AND GPT-5.2 31 billion parameters at $0.20 per run a model you can run locally on your own machine is now competing with the best closed-source models on the planet it beat every other model they tested on their benchmark. the only two it couldn't touch were Opus 4.6 and GPT-5.2 for context those models cost 10-50x more to run with Opus being 180x more to run. open source is catching up faster than anyone expected
Om Patel tweet media
English
36
26
327
26.4K