TechMD

3.4K posts

TechMD

@TechMDAI

🔬 MD fueled by a deep passion for Medicine & Tech. 🌐 Exploring the frontiers of VR/AR/MR & BCI. 🤖 AI enthusiast.

Katılım Haziran 2023

2.5K Takip Edilen689 Takipçiler

Sabitlenmiş Tweet

TechMD@TechMDAI·18 May

I have been holding back from yall Frankeinstein aka Franky. X2 RTX Pro 6000. I have been running experiments for the last 2 months on this beast. @Snixtp

English

109

3.9K

TechMD@TechMDAI·3m

@jvr0x @NVIDIAAI Congratulations, looks like fire

English

Javier ⚛ priv/acc@jvr0x·1h

The second one is here @NVIDIAAI 🚀

English

299

TechMD@TechMDAI·52m

Huge

NVIDIA AI@NVIDIAAI

If you want to try it out for yourself, check out the video below or follow along here: nvda.ws/4wIdgMz

English

TechMD@TechMDAI·1h

@ExileAI_0 @M5Stack @loktar00 They are pretty solid honestly

English

ExileAI@ExileAI_0·1h

@TechMDAI @M5Stack @loktar00 Ordering that and a couple other things today from them. Thanks again.

English

TechMD@TechMDAI·19h

My babies, twin dgx sparks. I am humbled and blessed.

English

138

5.2K

TechMD@TechMDAI·1h

@MarkSunner @NVIDIAAI Interesting

English

Mark Sunner@MarkSunner·2h

. @NVIDIAAI We've been running GLM-5.2 across 4× DGX Sparks and hit a class of multi-node NCCL deadlocks that appear to affect the broader Spark community. We did a deep root-cause analysis. The key finding is that stock DGX Spark lacks GPUDirect RDMA, making host-staged collectives 10-100× more susceptible to TP race conditions. Full write-up with evidence and community references here: github.com/marksunner/dgx…

English

TechMD@TechMDAI·2h

A dream for sure

Joe Muller@BosonJoe

@NVIDIAAP 4 sparks is the dream...it's basically like buying a new car

English

667

TechMD@TechMDAI·2h

@jetha @Hikari_07_jp @tensorwave @runpod Legends

English

Jetha Chan@jetha·2h

Had dinner with @Hikari_07_jp and some folks from @tensorwave and @runpod!

English

592

TechMD@TechMDAI·3h

@JamesMcPherson @NVIDIAAI That so fire

English

Jim McPherson@JamesMcPherson·5h

I hear we are posting our @NVIDIAAI clusters this morning, so here we go: 4X GB10's with CRS804 RoCE for the fabric. 10GBE backhaul to the NAS in the other rack. All in a 12U 10" rack. SOTA AI, in the corner of my den. Currently serving GLM-5.2 NVFP4

English

2.7K

TechMD@TechMDAI·3h

@michellezfr Building in public you know

English

169

swedishasian67@michellezfr·1d

Wearing noice cancelling masks to talk to Claude is crazy

English

279

152

6.9K

672.9K

TechMD@TechMDAI·3h

@NeoAIForecast @thatcofffeeguy lol omg all that compute but no body

English

Neo@NeoAIForecast·14h

@thatcofffeeguy When you’ve spent the whole budget on gb10s and ultras and the robot body has to come from the packaging. 3090 boxes for arms 😂

English

134

The coffee guy@thatcofffeeguy·14h

I have built what I call my robot of boxes with my collection of hardware. 6 Asus GX10s, 2 m3 ultras, 2 m4 minis, 1 m4 pro, and you can see in photo the MacBooks hat is a rapsbery pi 5 cana kit. I think the last 2 sparks I want will also finish up his legs/shoes. No idea how to make the arms work. Little hardware humor for the night. @NVIDIAAI @mr_r0b0t

English

4.1K

TechMD@TechMDAI·3h

@loktar00 @thsottiaux @sama Yes I can have more than 1 codex desktop open at once. Open in new window is live

English

Loktar 🇺🇸@loktar00·3h

@TechMDAI @thsottiaux @sama Wait what do you mean? This sounds cool but I missed this news I guess?

English

TechMD@TechMDAI·4h

I think having the ability to have more than 1 codex instance is huge. Thanks @thsottiaux @sama

English

462

TechMD@TechMDAI·4h

@BosonJoe Wow pushing boundaries. I can see 30 easily (cross fingers)

English

Joe Muller@BosonJoe·4h

GLM 5.2 now pushing past 18 tok/sec on 2 DGX Sparks 🔥🔥🔥 This time I quantized the 3 biggest attention projections to NVFP4 (compared to the original FP8) There are still wins to be had on the speed and quality fronts so stay tuned

Joe Muller@BosonJoe

Played a few tricks and tripled the speed of GLM 5.2 on 2 DGX Sparks 🔥 Old: 4.1 tok/sec New: 13.8 tok/sec 2-bit quantization, TP2, and a frequency based prune of the experts to get planes under the ~88 GB page-cache ceiling

English

5.2K

TechMD@TechMDAI·4h

@ivanfioravanti @Tailscale Thanks boss man

English

Ivan Fioravanti ᯅ@ivanfioravanti·4h

@TechMDAI @Tailscale Love it!

English

490

TechMD@TechMDAI·15h

A great Hermes agent btw, Lenovo Legion Go S. PC gamers have more resources at their disposal than they think. Coupled with @Tailscale, it a deadly combo for robotics and agentic work.

English

TechMD@TechMDAI·5h

@0xSero Wishing you success. I can’t imagine. Those are precious babies. Hope you get the new build soon.

English

0xSero@0xSero·5h

Had to move him to the attic I’m done cooking in my office it was a sustained 32c while I was in there 😭 Ordered all the stuff now to wait until I can rebuild

English

6.5K

TechMD@TechMDAI·5h

@rishflips Deep , first thing is .md file

English

Rish@rishflips·9h

Building AI agents is like raising kids who never sleep. You think the hard part is teaching them to walk The hard part is making sure they don't walk off a cliff while you're not looking. Guardrails, not goals. What's the first thing you handed to an agent? #buildinpublic #aiagent

English

115

TechMD@TechMDAI·5h

@vileton_vaine Price at the time and availability. The are the same

English

🦎 Vaine@vileton_vaine·12h

@TechMDAI May I ask why did you go with different vendors? Do you prefer one over the other?

English

TechMD@TechMDAI·12h

@Snixtp lol idk it’s not mine. It’s cute tho. The cat wants to catch a spark

English

Espen JD@Snixtp·12h

@TechMDAI Is the cat essential to the setup?

English

TechMD@TechMDAI·18h

I’m loving this DGX Spark set up. I need a switch from MikroTik

jon@lursor_lover

New limited edition spark just dropped

English

TechMD@TechMDAI·12h

@fbwalker4 That’s super smart. I like that, seems your workflow is very advanced. Thanks for the tip btw

English

Rusty@fbwalker4·13h

@TechMDAI I have fable run eight other lower cost models including minimax which is very cheap. It gates and directs them.

English

TechMD@TechMDAI·1d

This guy is pushing the limit with 2 sparks. GLM at home on 2 sparks

Joe Muller@BosonJoe

English

TechMD@TechMDAI·13h

@fbwalker4 Ya fable is amazing and a class above glm. It’s pretty sweet to get glm 5.2 doing unlimited work for you with relatively low energy burn. Idk I don’t know; I find that amazing, especially when developing autonomous loops and agents running.

English

Rusty@fbwalker4·13h

@TechMDAI I've run it cloud based once and API call for a week. Just didn't have as good of luck as Fable.

English

TechMD@TechMDAI·13h

Well, choosing the right approach depends on your goals and comfort level when planning to reduce cloud use. While the frontier offers impressive capabilities, not every task requires it. Many local models perform adequately for most purposes so the decision comes down to whether to use an API or not. I just bet on myself to develop my own ai infrastructure for continued learning.

English

Rusty@fbwalker4·13h

@TechMDAI Heard. Wasn't a criticism, just trying to understand what use case this was working for. I spend thousands per month for frontier AI buy, and I've looked and till can't make it work intelligence/power/expense.

English

Keşfet

@jvr0x @NVIDIAAI @ExileAI_0 @M5Stack @loktar00 @MarkSunner @jetha @Hikari_07_jp