罗杰斯

490 posts

罗杰斯 banner
罗杰斯

罗杰斯

@dhbrojas

AGI @ https://t.co/vrJX6VOASs, 清华大学

Paris, France Katılım Nisan 2020
1.1K Takip Edilen333 Takipçiler
Jakub
Jakub@JCzarlinski·
@dhbrojas @__tinygrad__ was thinking of getting the p150a to tinker with myself - curious how it goes!
English
1
0
0
92
罗杰斯
罗杰斯@dhbrojas·
I will definitely regret this but the @__tinygrad__ backend is not going to write itself...
罗杰斯 tweet media
English
5
2
106
16.7K
Igor Michalak
Igor Michalak@igorjmichalak·
I need a @tenstorrent accelerator with this form factor and 2-4 W power draw. I have a Google Coral TPU, but I'm not too keen on TensorFlow Lite and abandoned proprietary software.
Igor Michalak tweet media
English
6
1
100
7.6K
罗杰斯
罗杰斯@dhbrojas·
@__tinygrad__ I don’t plan on using anything from @tenstorrent beside the driver. But yeah, the first part will be documenting everything. I’ll see how much can be extracted from the TT codebases. It will probably (P=0.95) fail, but hey, it’s fun to try!
English
0
0
10
848
the tiny corp
the tiny corp@__tinygrad__·
@dhbrojas Agents will not be able to do anything close to mergable. You'll end up with hacks, the tenstorrent stack isn't properly abstracted at all. The first task should be to completely document the chip to the level of the RDNA ISA manuals. This is probably a 2 year project to do right
English
3
3
238
14.2K
罗杰斯 retweetledi
the tiny corp
the tiny corp@__tinygrad__·
@dhbrojas Looks like it's still missing a lot of Blackhole. It's crazy to me that $10M+ tapeouts are done without a full spec of each instruction + cycle accurate simulator.
English
2
1
54
4.2K
罗杰斯
罗杰斯@dhbrojas·
More seriously I think writing a software stack (runtime + compiler) for @tenstorrent is the perfect long-horizon task to evaluate coding agents on. I think I can probably spin it into a benchmark with different milestones, etc. Will report back!
English
3
0
17
1.4K
gnosisfan
gnosisfan@gnosisfanxyz·
@__tinygrad__ @tenstorrent Isn’t the TT arch meaningfully different to the multithreaded simd gpu cores TG currently targets so effectively? I think frameworks embracing the spatial dataflow possibilities of TT’s hardware is the right approach
English
1
0
3
511
罗杰斯 retweetledi
the tiny corp
the tiny corp@__tinygrad__·
.@tenstorrent when you are ready, we'll get you on MLPerf for $10M. Ground up stack, one 1MB pip install, zero C++20 (pure Python). From how many people I see on X trying to rewrite it, your current software approach isn't working.
English
15
12
446
30.8K
罗杰斯
罗杰斯@dhbrojas·
Give us your money and we'll give it back to you as discounted World Cup tickets for a headline, hurray!
English
0
0
1
209
罗杰斯
罗杰斯@dhbrojas·
Mamdani's $50 World Cup tickets are distasteful. Ideological attempt at buying the sympathy of his voter base. Any marginal tax payer dollar should go to essential functions of the city, the sick or the homeless.
English
1
0
2
384
will brown
will brown@willccbb·
this skill could’ve been a specialized subagent
English
20
6
188
10.9K
Sumit Behal
Sumit Behal@sumitkbehal·
never ask a woman her age, a man his salary and a value investor his semiconductor exposure
English
50
231
4.7K
234.6K
罗杰斯
罗杰斯@dhbrojas·
@remilouf C’est beaucoup trop. Il faut taxer les levées de fonds.
Français
1
0
45
3.3K
罗杰斯
罗杰斯@dhbrojas·
@xeophon Finally we're having this conversation
English
0
0
2
55
Florian Brand
Florian Brand@xeophon·
Imagine SF but European (drinking alcohol before/during/after work)
English
19
0
68
5.5K
罗杰斯 retweetledi
elie
elie@eliebakouch·
separating infra and science for long context doesn't make sense, most long context science is about making computation and memory (capacity and bandwidth) feasible at scale. today's infra wouldn't support MHA on a 1T model at 1M context
Jack Morris@jxmnop

it is endlessly fascinating to me that we still don't have a true 1M-context model it's an unusual case where the infra is far ahead of the science. Claude discontinued 1M+ context bc it didn't really work past ~200k we don't have the right data? training techniques? not sure

English
7
2
88
8K
Eiso Kant
Eiso Kant@eisokant·
@zhijianliu_ fan of what you're doing at z lab! Would love to chat about getting Laguna XS.2 dflashed (is that a term 😅)
English
1
1
12
910