Sabitlenmiş Tweet
the tiny corp
4.1K posts

the tiny corp
@__tinygrad__
We make tinygrad; sell tinybox for the GPU middle class. Our mission is to commoditize the petaflop.
Hong Kong Katılım Haziran 2023
184 Takip Edilen72.2K Takipçiler

@spikedoanz hmm, it should. if you post more about the error on discord we can look. it's *tons* faster, CL is a shallow backend.
English

gist.github.com/spikedoanz/de5…
for those interested, some basic steps to run tinygrad with the opencl backend on a ryzen ai max 395+. unfortunately this doesn't use the npu.
English

@AxcanNathan @RGBiverton They are close. 160W -> 110W cuts about 20% more off this, but they are still similar.

English

@RGBiverton @__tinygrad__ neither of these come close to HW4 at 110W right
English

With a consumer eGPU powered by tinygrad plugged into the USB3 port, comma will be the most powerful embedded device in the world. Beyond Tesla HW4, beyond NVIDIA Jetson Thor. When you need a brain for your robot, don't overpay, just use a normal GPU!
comma@comma_ai
And we've got a teardown! facebook.com/groups/Electro…
English

@__tinygrad__ Thanks a lot❤️. The code was before rangeify and I feel far from ready to even post in the Discord. Would you mind to make a Rockchip-hardware channel?
English

@AxcanNathan Like with the exabox, the user needs to provide the plug. The common path will probably be a midrange GPU power limited to 110W, which pulls from the cigarette lighter jack available on almost all cars and is still competitive with Tesla HW4.
English

@AlexBowden52 @gazorp5 @comma_ai You are limited largely by RAM bandwidth on modern BS=1 models. The 845 has 30 GB/s, even the 8 Gen 5 only has 85 GB/s, just 2.8x.
Want to see some power? The 9060XT eGPU has 320 GB/s and good L2 cache speeds unlike mobile chips.
English

And we've got a teardown! facebook.com/groups/Electro…



comma@comma_ai
FYI tearing down your comma four doesn't void the warranty :)
English

@nath_simard @dhh @AMD @AnushElangovan Nope, actually needed to disable it in BIOS to make deep sleep work. I only ever use webcam at my desk and I have a USB one (single Thunderbolt plug for charging+monitor+USB btw, another must have for a laptop)
English

@__tinygrad__ @dhh @AMD @AnushElangovan Did you manage to get the webcam working on Linux? That's the only drawback for the HP imo. Otherwise great laptop, performance mode is great when plugged in, and battery's quite good when unplugged.
English

@dhh @AMD @AnushElangovan @HP Oh and @dhh while I have you here, this is what I'm looking for in a theme. Maybe Vantablack is moving things in this direction? github.com/geohot/omarchy…
English

@dhh @AMD @AnushElangovan Oh also @HP why does the stupid keyboard backlight draw 2W! Like did anyone care about this? github.com/geohot/ztop for the measurement tool so you can replicate this.

English

@dhh It took tuning, but I got it down to 5W at screen on idle and almost perfect sleep. With 74 Wh battery it's not too bad. But yea the idle power issue is fixable in software if AMD would improve the SMU firmware. The GPU and RAM bandwidth are almost double Panther Lake.
English

@shikharontwt Of course not, we are just an indicator. It went up because AMD made good decisions, and how you do one thing is how you do everything. geohot.github.io/blog/jekyll/up…
English

@__tinygrad__ Come on, the stock didn't go up because of you🤣
English

This megakernel is using a 3090. Stock tinygrad beats this (420 tok/sec!) using a cheaper 7900XTX. With our custom driver AMD hardware can really shine.

mrciffa@davideciffa
I love tinygrad, but with our megakernel you can go to 415 tok/s in decoding speed 🚄
English

@davideciffa On a 3090, not an M3 Max! That's like saying I love Lance Armstrong, but with my Ferrari I can go 211 mph 🏎️
English

I love tinygrad, but with our megakernel you can go to 415 tok/s in decoding speed 🚄
the tiny corp@__tinygrad__
We set out to replicate Kimi's 193 tok/s Qwen3.5-0.8B on M3 Max. Our baseline is already 178 tok/s, beating LMStudio (160) and llama.cpp (140) out of the box, but with tinygrad's custom kernel feature Claude cranked it to 195.7!
English





