alvin

422 posts

alvin banner
alvin

alvin

@e3he0

19, data eng

mars Katılım Eylül 2024
264 Takip Edilen76 Takipçiler
alvin
alvin@e3he0·
anyone at yc??? wld love to meet you!
alvin tweet mediaalvin tweet media
English
1
0
2
41
alvin
alvin@e3he0·
which can be loaded into cache somehow but even if we managed to do that we could only do that for one token after activation vector is multiplied with weights its useless and should be disregarded because its in cache and you need it again when you do the same ops another token
English
0
0
1
28
alvin
alvin@e3he0·
Modern Computer architecture works on assumption that if you access X, then you'll prob access X again sooner and it works like magic fuck that was genius but when itcomes to llm for the the weights size is in megabytes
English
1
0
1
35
alvin
alvin@e3he0·
Ran a 1B parameter LLM on my CPU and profiled it. 30 seconds total. 0.058 seconds of actual compute. 39.76% cache miss rate. The CPU spent 99.8% of the time waiting for data. Inference isn't slow because your CPU is weak. It's slow because weights can't move fast enough from RAM.
alvin tweet media
English
2
1
2
57
alvin
alvin@e3he0·
wrote forward propagation by hand on paper today just to actually understand it. not gonna lie derivatives and the idea of slope finally clicked. building toward making inference cheaper on my RX 7600. Maybe JUST maybe its a pipedream but whatever...
English
0
0
2
24
alvin retweetledi
Atlas Press
Atlas Press@realAtlasPress·
Nassim Taleb, damn
Atlas Press tweet media
English
107
1.7K
11.8K
357.8K
alvin
alvin@e3he0·
been so longg
English
1
0
1
36
alvin
alvin@e3he0·
i wanna redesign the whole internet
English
1
0
3
58
alvin
alvin@e3he0·
kinda got employed latelyyy
English
0
0
2
61
alvin
alvin@e3he0·
@Hi_Mrinal we all know that you’ll get it 🤌
English
0
0
1
79
Mrinal
Mrinal@Hi_Mrinal·
Applied to Anthropic as a SDE
Mrinal tweet mediaMrinal tweet media
English
44
29
2.1K
105.5K
Mike Chong
Mike Chong@realMikeChong·
@mazeincoding capcut: - expensive - stealing data / copyright - slow and buggy opencut: - free - opensource - founder is a great ** poster on X!
English
2
0
15
1.9K
Maze
Maze@mazeincoding·
claude: - expensive - doesn't follow instructions - is lazy gpt 5: - cheaper - follows instructions - is not lazy GPT 5 IS BETTER THAN CLAUDE
English
307
59
1.7K
143.2K
alvin
alvin@e3he0·
@RubenVeidt lately i’ve been loving your posts a bit too muchh
English
1
0
1
156
Ruben Veidt
Ruben Veidt@RubenVeidt·
to appreciate css, one must write vulkan , to use vulkan to the max, one must understand css this 2 line of css .container { height: 100vh; } to do the same in vulkan, you need to Initialize the vulkan library, select a physical gpu, create a logical device and command queues, create a window surface and a swapchain to present images to it, setup up a render pass that describes the structure of a rendering operation, define the entire graphics pipeline, including vertex shaders, fragment shaders, rasterization state, blending state, etc...., create framebuffers, allocate and manage memory for the vertex data, create command buffers and record the exact drawing commands, use fences and semaphores to synchronize the cpu and gpu to make things happen in the correct order this is not even recreating the css stuff, its just setting up vulkan to even start doing that the more you understand what's happening under the hood, the more you appreciate when and what you don't have to think about it
English
6
0
82
3.6K
alvin
alvin@e3he0·
I seee…
alvin tweet media
0
0
0
71
alvin
alvin@e3he0·
@theo its another way of procrastination lol
English
0
0
0
67
Theo - t3.gg
Theo - t3.gg@theo·
I really hate the “market your product before building it” trend.
English
327
147
4.1K
447K