Dry

72 posts

Dry banner
Dry

Dry

@DryLdn

Martinique Katılım Ekim 2012
495 Takip Edilen42 Takipçiler
Dry retweetledi
Paul Xue
Paul Xue@pxue·
Remember Devin? Apparently demo's fake. Paul is sad. 😥
Paul Xue tweet media
English
133
354
4.2K
894.1K
Dry retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch (fp32, forward pass) compared to 4 days ago, when it was at 4.2X slower 📈 The biggest improvements were: - turn on TF32 (NVIDIA TensorFLoat-32) instead of FP32 for matmuls. This is a new mathmode in GPUs starting with Ampere+. This is a very nice, ~free optimization that sacrifices a little bit of precision for a large increase in performance, by running the matmuls on tensor cores, while chopping off the mantissa to only 10 bits (the least significant 19 bits of the float get lost). So the inputs, outputs and internal accumulates remain in fp32, but the multiplies are lower precision. Equivalent to PyTorch `torch.set_float32_matmul_precision('high')` - call cuBLASLt API instead of cuBLAS for the sGEMM (fp32 matrix multiply), as this allows you to also fuse the bias into the matmul and deletes the need for a separate add_bias kernel, which caused a silly round trip to global memory for one addition. - a more efficient attention kernel that uses 1) cooperative_groups reductions that look much cleaner and I only just learned about (they are not covered by the CUDA PMP book...), 2) the online softmax algorithm used in flash attention, 3) fused attention scaling factor multiply, 4) "built in" autoregressive mask bounds. (big thanks to ademeure, ngc92, lancerts on GitHub for writing / helping with these kernels!) Finally, ChatGPT created this amazing chart to illustrate our progress. 4 days ago we were 4.6X slower, today we are 2X slower. So we are going to beat PyTorch imminently 😂 Now (personally) going to focus on the backward pass, so we have the full training loop in CUDA.
Andrej Karpathy tweet media
English
109
350
4.2K
1.4M
Dry retweetledi
Yohei
Yohei@yoheinakajima·
Seems like a busy game of hungry hungry hippo happened this weekend
English
4
3
52
6.2K
Dry retweetledi
Tech Burrito
Tech Burrito@TechBurritoUno·
Human evolution process
English
56
201
692
102.3K
Dry retweetledi
🌴Le Média des Antilles🌴
Le 8 Mai 1902, la Ville de Saint-Pierre, ancienne capitale de la Martinique, disparaissait suite à une éruption Volcanique 🌋🙏🏾
🌴Le Média des Antilles🌴 tweet media
Français
0
239
551
29.3K
Dry retweetledi
Outremers360
Outremers360@outremers360·
#Aérien : @CorsairFr s’est associé au chef guadeloupéen Jimmy Bibrac pour ses repas en business au départ des #Antilles 🍽️🛫Une façon de mettre en avant "la #gastronomie créole" et le "terroir antillais" ➡️bit.ly/3AJ5neO
Outremers360 tweet media
Français
1
45
95
11.6K
Dry retweetledi
Watcher.Guru
Watcher.Guru@WatcherGuru·
JUST IN: #Bitcoin whitepaper is hidden on every Apple MacBook computer running recent versions of macOS software.
English
597
3.1K
13.1K
2.9M
Dry retweetledi
Elon Musk
Elon Musk@elonmusk·
Old joke about agnostic technologists building artificial super intelligence to find out if there’s a God. They finally finish & ask the question. AI replies: “There is now, mfs!!”
English
14.6K
19.1K
199.4K
57.9M
Dry retweetledi
Prof. Feynman
Prof. Feynman@ProfFeynman·
There are two rules in life: 1) Never give out all the information.
English
339
3.8K
25.6K
3.6M
Dry retweetledi
Ouragans.com
Ouragans.com@ouragans·
L'exercice international de préparation aux tsunamis #CaribeWave2023 se déroule aujourd'hui dans toute la Caraïbe à partir de 10h, avec comme scénario, l'effondrement d'un pan de la Montagne Pelée causant une vague destructrice. Mais savez-vous quoi faire en cas de tsunami ?
Ouragans.com tweet media
Français
0
22
17
2.3K
Dry retweetledi
Mediavenir
Mediavenir@Mediavenir·
🇫🇷 FLASH - "Si les Français étaient vraiment en colère, je n'aurais pas été réélu il y a un an", affirme Emmanuel #Macron. (France 2) #Macron13h #reformedesretraites
Français
2.3K
3.4K
26.9K
5.3M
Dry retweetledi
Tech Burrito
Tech Burrito@TechBurritoUno·
Are you?
Tech Burrito tweet media
English
1K
1.8K
21.6K
1.6M
Dry retweetledi
Elon Musk
Elon Musk@elonmusk·
Elon Musk tweet media
ZXX
24.7K
43.8K
388.3K
53.6M
Dry retweetledi
Elon Musk
Elon Musk@elonmusk·
“I used to be in crypto, but now I got interested in AI"
English
28.9K
19.3K
241K
50.9M
Dry retweetledi
POOJA!!!
POOJA!!!@PoojaMedia·
There was no VAR check for that penalty call. Wow
English
1.2K
2.4K
21.4K
1.8M
Dry retweetledi
(fan) Trey
(fan) Trey@UTDTrey·
Never seen them give one man so many penalties in one competition, this shit is rigged
English
2.4K
7.4K
56.1K
4M