Jerrill Johnson 🇺🇸🗽 أُعيد تغريده

@RapidResponse47 Mr. President, we are just getting started 🇺🇸
English
Jerrill Johnson 🇺🇸🗽
811 posts














New video. My attempts are clustering DGX Sparks (and giveaway)

Cost: $35k for 12 tokens/second on a 32B model at FP8 with 1120W of power. Tons of cables, setup, networking. Alternatively: One $9k GPU (RTX Pro 6000 MaxQ): 22 tokens/second for one request at ~500W of power. Fits in the desktop you already have.


8 sparks cluster running




