ett0re
722 posts





I love pretty much every pre-1990 computer ever made. But my favorite by a long shot is the Apollo AGC. This is my second time through this book.

$500,000 to @rileyholterhus through Cantina Bounties. 🪐 The researchers who consistently find the bugs that matter don't chase volume. They follow programs where scope is tight, triage is fast, and rewards match actual impact. Well done, Riley!





Clustering NVIDIA DGX Spark + M3 Ultra Mac Studio for 4x faster LLM inference. DGX Spark: 128GB @ 273GB/s, 100 TFLOPS (fp16), $3,999 M3 Ultra: 256GB @ 819GB/s, 26 TFLOPS (fp16), $5,599 The DGX Spark has 3x less memory bandwidth than the M3 Ultra but 4x more FLOPS. By running compute-bound prefill on the DGX Spark, memory-bound decode on the M3 Ultra, and streaming the KV cache over 10GbE, we are able to get the best of both hardware with massive speedups. Short explanation in this thread & link to full blog post below.

It seems like local AI is becoming mainstream and I'm super excited!! I just reviewed the Nvidia DGX Spark but I really want to pin it against a Mac Studio and a @FrameworkPuter AI machine. Would ya'll watch that?
















