Dry

72 posts

Dry

@DryLdn

Martinique Katılım Ekim 2012

495 Takip Edilen42 Takipçiler

Dry retweetledi

Massimo@Rainmaker1973·17 Nis

Using an hologram fan to visualize 360° ancient relics. twitter.com/i/status/17806…

English

556

13K

127.5K

12.6M

Dry retweetledi

Paul Xue@pxue·13 Nis

Remember Devin? Apparently demo's fake. Paul is sad. 😥

English

133

354

4.2K

894.1K

Dry retweetledi

Andrej Karpathy@karpathy·13 Nis

A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch (fp32, forward pass) compared to 4 days ago, when it was at 4.2X slower 📈 The biggest improvements were: - turn on TF32 (NVIDIA TensorFLoat-32) instead of FP32 for matmuls. This is a new mathmode in GPUs starting with Ampere+. This is a very nice, ~free optimization that sacrifices a little bit of precision for a large increase in performance, by running the matmuls on tensor cores, while chopping off the mantissa to only 10 bits (the least significant 19 bits of the float get lost). So the inputs, outputs and internal accumulates remain in fp32, but the multiplies are lower precision. Equivalent to PyTorch `torch.set_float32_matmul_precision('high')` - call cuBLASLt API instead of cuBLAS for the sGEMM (fp32 matrix multiply), as this allows you to also fuse the bias into the matmul and deletes the need for a separate add_bias kernel, which caused a silly round trip to global memory for one addition. - a more efficient attention kernel that uses 1) cooperative_groups reductions that look much cleaner and I only just learned about (they are not covered by the CUDA PMP book...), 2) the online softmax algorithm used in flash attention, 3) fused attention scaling factor multiply, 4) "built in" autoregressive mask bounds. (big thanks to ademeure, ngc92, lancerts on GitHub for writing / helping with these kernels!) Finally, ChatGPT created this amazing chart to illustrate our progress. 4 days ago we were 4.6X slower, today we are 2X slower. So we are going to beat PyTorch imminently 😂 Now (personally) going to focus on the backward pass, so we have the full training loop in CUDA.

English

109

350

4.2K

1.4M

Dry retweetledi

Yohei@yoheinakajima·20 Kas

Seems like a busy game of hungry hungry hippo happened this weekend

English

6.2K

Dry retweetledi

Tech Burrito@TechBurritoUno·13 Eki

Human evolution process

English

201

692

102.3K

Dry retweetledi

🌴Le Média des Antilles🌴@antilles97x·8 May

Le 8 Mai 1902, la Ville de Saint-Pierre, ancienne capitale de la Martinique, disparaissait suite à une éruption Volcanique 🌋🙏🏾

Français

239

551

29.3K

Dry retweetledi

Outremers360@outremers360·1 May

#Aérien : @CorsairFr s’est associé au chef guadeloupéen Jimmy Bibrac pour ses repas en business au départ des #Antilles 🍽️🛫Une façon de mettre en avant "la #gastronomie créole" et le "terroir antillais" ➡️bit.ly/3AJ5neO

Français

11.6K

Dry retweetledi

Watcher.Guru@WatcherGuru·6 Nis

JUST IN: #Bitcoin whitepaper is hidden on every Apple MacBook computer running recent versions of macOS software.

English

597

3.1K

13.1K

2.9M

Dry retweetledi

Elon Musk@elonmusk·30 Mar

Old joke about agnostic technologists building artificial super intelligence to find out if there’s a God. They finally finish & ask the question. AI replies: “There is now, mfs!!”

English

14.6K

19.1K

199.4K

57.9M

Dry retweetledi

Prof. Feynman@ProfFeynman·29 Mar

There are two rules in life: 1) Never give out all the information.

English

339

3.8K

25.6K

3.6M

Dry retweetledi

Ouragans.com@ouragans·23 Mar

L'exercice international de préparation aux tsunamis #CaribeWave2023 se déroule aujourd'hui dans toute la Caraïbe à partir de 10h, avec comme scénario, l'effondrement d'un pan de la Montagne Pelée causant une vague destructrice. Mais savez-vous quoi faire en cas de tsunami ?

Français

2.3K

Dry retweetledi

Mediavenir@Mediavenir·22 Mar

🇫🇷 FLASH - "Si les Français étaient vraiment en colère, je n'aurais pas été réélu il y a un an", affirme Emmanuel #Macron. (France 2) #Macron13h #reformedesretraites

Français

2.3K

3.4K

26.9K

5.3M

Dry retweetledi

Tech Burrito@TechBurritoUno·20 Mar

Are you?