kyle yu

86

heave@heave448·3h

@brrrkyle Cool UI but I like modal.com/gpu-glossary much more

English

0

1

192

kyle yu@brrrkyle·16 Nis

this is how i wish i learned GPU fundamentals not a lengthy textbook. not a static image. every concept is an interactive visualization. covering the SM architecture, memory coalescing, synchronization, and more. what concepts do you want to see next? brrrviz.com

English

3

8

142

23.7K

kyle yu retweetledi

Elliot Arledge@elliotarledge·10h

great, intuitive resource. worth a few mins playing with as a refresher even if you've been through the fundamentals

this is how i wish i learned GPU fundamentals not a lengthy textbook. not a static image. every concept is an interactive visualization. covering the SM architecture, memory coalescing, synchronization, and more. what concepts do you want to see next? brrrviz.com

English

9

210

21.9K

kyle yu@brrrkyle·22h

@sadjikun 🫡

QME

0

16

ahmad.@sadjikun·22h

best thing i've seen so far :o

@goyal__pramod check out brrrviz.com for more gpu visuals 🤙

English

0

1

144

kyle yu@brrrkyle·1d

@XandarXam Thanks for the support man!

English

1

15

Xandar@XandarXam·1d

Check out this incredible project from @brrrkyle. This is for anyone interested in GPU programming or performance engineering. brrrviz.com

this is how i wish i learned GPU fundamentals not a lengthy textbook. not a static image. every concept is an interactive visualization. covering the SM architecture, memory coalescing, synchronization, and more. what concepts do you want to see next? brrrviz.com

English

0

1

48

kyle yu@brrrkyle·1d

@goyal__pramod glad you like it 🤙

English

1

902

kyle yu retweetledi

Pramod Goyal@goyal__pramod·1d

HOLY JESUS THIS IS AMAZING

@goyal__pramod check out brrrviz.com for more gpu visuals 🤙

English

24

494

42.9K

kyle yu@brrrkyle·2d

@goyal__pramod check out brrrviz.com for more gpu visuals 🤙

English

4

13

138

45.8K

Pramod Goyal@goyal__pramod·3d

It's a crime that more people have not read these beautiful blogs! Beautiful visuals, simple explanations, code anyone can understand. I have a new bar for my future blogs now...

English

9

75

902

37K

kyle yu@brrrkyle·6d

Chapter 9 of BrrrViz walks you through both scenarios. brrrviz.com

English

93

kyle yu@brrrkyle·6d

The cost: serialization. Threads queue at the address one at a time. The more threads contend for the same location, the more your parallelism collapses into a bottleneck. This is why real GPU kernels accumulate locally in registers first, then do a single atomicAdd at the end.

English

0

1

105

kyle yu@brrrkyle·6d

Most GPU bugs don't crash your program. They just give you the wrong answer. Silently. When thousands of threads try to update the same memory address simultaneously, each one does three things: 📖 read the current value ⚡ execute their computation ✍ write back the result

English

2

184

Zak 🦈 (e/acc)@ZakShark·5 May

Formez vous à l'inference/kernel engineering. Savoir bien optimiser les GPU kernels dans les workloads d'inference vaut de l'or. Maitriser CUDA ou Triton, vLLM, SGLang, TensorRT-LLM est un vrai plus si vous voulez vous démarquer pour 2026-2027 en que AI/ML Engineer.

Français

11

47

499

21.1K

kyle yu@brrrkyle·6 May

@ZakShark shameless plug brrrviz.com

English

10

kyle yu@brrrkyle·6 May

@karan_bangia @ZakShark brrrviz.com is the way

English

10

karan bangia@karan_bangia·5 May

@ZakShark any guides on how to start in this domain,

English

4

0

2

544

kyle yu@brrrkyle·5 May

@Banshee2507 @retr0sushi_ hope brrrviz.com can help ya out 🤝

English

1

26

Banshee@Banshee2507·4 May

CUDA DAY 1 It’s a lot to take in, but really interesting so far. Notes: past-porpoise-332.notion.site/CUDA-DAY-1-356… Also sorry if this reads rough I used AI to help since writing isn’t my strong suit. #CUDA #GPU #Learning

English

6

2

44

1.7K

kyle yu@brrrkyle·4 May

@Banshee2507 @retr0sushi_ i got u w brrrviz.com

English

0

1

52

Banshee@Banshee2507·3 May

Started learning CUDA today parallel computing feels like a whole new mindset. Any tips, resources, or beginner pitfalls I should know about? #CUDA #GPU #Learning

English

20

8

144

8.6K

kyle yu@brrrkyle·2 May

@retr0sushi_ heyyy brrrviz.com

0

2

179

himanshu@retr0sushi_·2 May

always a beginner :) ps : if you have resources or roadmaps don't be shy to share them with me pls!

English

6

1

42

3K

kyle yu@brrrkyle·29 Nis

Chasing utilization without this perspective often means optimizing the wrong thing. Understanding where your kernel sits on this diagram helps you execute better optimizations. Find it at chapter 3 of BrrrViz 👉 brrrviz.com

English

58

kyle yu@brrrkyle·29 Nis

Memory-bound means your hardware is waiting on data. Fix data movement, locality, and reuse. Compute-bound means the data is there, but the math is slow on the hardware. Fix precision, use tensor cores, or change instruction path.

English

0

63

kyle yu@brrrkyle·29 Nis

Stop tuning the wrong bottleneck. GPU optimization isn’t one ceiling, it’s memory bandwidth vs peak compute. The roofline plots both, so you see which one limits your kernel.

English