grant pal

8K posts

grant pal banner
grant pal

grant pal

@itsgrantpal

a little bit of this, and a little bit of that

Chicago, IL Katılım Kasım 2007
570 Takip Edilen456 Takipçiler
Sabitlenmiş Tweet
grant pal
grant pal@itsgrantpal·
A reply personified in a structure jsonified from tokens classified by an algorithm amplified with hardware commodified in a world forever unsatisfied
English
0
0
3
2.6K
Sam
Sam@SamSullivan·
Old nuclear site on the left. Indian Point. Also, happy 15 years on X to me. Kinda miss the old days. Road tripping upstate this weekend. ☀️☀️☀️
Sam tweet media
English
3
1
8
208
gabriel
gabriel@gabriel1·
this took so long for me to understand: the bottleneck to more innovation is not more high intelligence people, but more people having an interest in hard problems it's impossible to create new useful things if you don't get immense happiness from making that thing
English
68
105
1.4K
150.7K
grant pal
grant pal@itsgrantpal·
@OrdinaryInds people who use hand sanitizer end up with these keyboards 3-5x as fast btw
English
0
0
1
88
Max Weinbach
Max Weinbach@mweinbach·
You can look at the math to complete the operation and the memory bandwidth to generate a token Both of these are set in hardware as peak performance. You can make the math less intensive (generally helps prefill) but decode is bound by memory bandwidth. You can speed this up with smaller models for speculative decoding (generates a token and larger model approves or denies), but you still have a compute cost that’s limited and you’re able to calculate You could MAYBE do 30 tok/s but this doesn’t meaningfully change it. There’s bottlenecks everywhere.
English
2
0
0
57
Roy
Roy@usr_bin_roygbiv·
@wolfie_ rent some blackwells
English
1
0
4
247
wolfie
wolfie@wolfie_·
glm 5.2 at glm 5.1 speeds would be so OP
San Francisco, CA 🇺🇸 English
2
0
10
507
CXCarroll
CXCarroll@CXCarroll·
@itsgrantpal @mweinbach The software primitives for mathematical operations are written in Assembly and are highly optimized for each hardware stack (Intel OneMKL, AMD BLIS, etc.). The engineers know what the theoretical limits are (because it's math) and they're basically at the limit.
English
1
0
0
63
grant pal
grant pal@itsgrantpal·
@hotschmoe 2-3 years earlier than I was dabbling with deltas! and even then ('17) it was like spinning 200 plates to yield a successful print. couldn't imagine..
English
0
0
1
13
StrongEngineer_
StrongEngineer_@hotschmoe·
even though we had a $10,000+ printer in the office, it was still geniuely faster to build models with foam board (circa 2015)
StrongEngineer_ tweet mediaStrongEngineer_ tweet media
grant pal@itsgrantpal

@hotschmoe early 3d print days were ROUGH

English
1
0
1
113
grant pal
grant pal@itsgrantpal·
@CXCarroll @mweinbach i still believe optimizing the stack yields benefits beyond calling hardware an immovable metric
English
1
0
1
141
CXCarroll
CXCarroll@CXCarroll·
@itsgrantpal @mweinbach Inference is basically a ton of linear algebra. From a Comp Sci standpoint, math like that is "solved" in terms of how much can theoretically be done on a given piece of hardware in a given period of time. The software primitives for math are highly optimized. Max is correct.
English
1
0
1
148
ham mike
ham mike@griffraff97·
$55 estate sale loot crate
ham mike tweet mediaham mike tweet media
Română
6
3
107
5K
Alex Kravetz
Alex Kravetz@adkravetz·
Sealed the countertop, conditioned the butcher block AND changed my HVAC filter in one day. pls clap
Alex Kravetz tweet media
English
2
0
16
1.8K
OldestZoomer
OldestZoomer@Mhenderson550·
Me and dookers vs the world
OldestZoomer tweet media
English
5
0
22
396
grant pal
grant pal@itsgrantpal·
i have very few regrets in life but malort is one of them
English
0
0
0
19
grant pal
grant pal@itsgrantpal·
@mweinbach i guess i don't think I follow and i don't have the qualities to state my position well enough
English
1
0
0
160
Max Weinbach
Max Weinbach@mweinbach·
@itsgrantpal This is at the peak hardware performance you can calculate what the theoretical best is pretty well That’s theoretical best
English
1
0
5
178