alvin

422 posts

alvin

@e3he0

19, data eng

mars Katılım Eylül 2024

264 Takip Edilen76 Takipçiler

alvin@e3he0·18 Nis

anyone at yc??? wld love to meet you!

English

alvin@e3he0·8 Nis

yhhh okayy we’ll be at @ycombinator soonn

English

alvin@e3he0·15 Mar

which can be loaded into cache somehow but even if we managed to do that we could only do that for one token after activation vector is multiplied with weights its useless and should be disregarded because its in cache and you need it again when you do the same ops another token

English

alvin@e3he0·15 Mar

Modern Computer architecture works on assumption that if you access X, then you'll prob access X again sooner and it works like magic fuck that was genius but when itcomes to llm for the the weights size is in megabytes

English

alvin@e3he0·15 Mar

Ran a 1B parameter LLM on my CPU and profiled it. 30 seconds total. 0.058 seconds of actual compute. 39.76% cache miss rate. The CPU spent 99.8% of the time waiting for data. Inference isn't slow because your CPU is weak. It's slow because weights can't move fast enough from RAM.

English

alvin@e3he0·10 Mar

wrote forward propagation by hand on paper today just to actually understand it. not gonna lie derivatives and the idea of slope finally clicked. building toward making inference cheaper on my RX 7600. Maybe JUST maybe its a pipedream but whatever...

English

alvin retweetledi

Jamieson O'Reilly@theonejvo·25 Oca

x.com/i/article/2015…

ZXX

424

2.6K

1.2M

alvin@e3he0·9 Oca

@portal2urmom @grok whats ghe context

English

239

gordon freeman from black mesa@portal2urmom·8 Oca

Everyone! Free up like, 61.67GB on your drive! Now!

English

1.2K

86.9K

alvin@e3he0·9 Oca

probably the cleanest context engineering guide that i've read in a while... github.com/humanlayer/adv…

English

alvin retweetledi

Atlas Press@realAtlasPress·3 Oca

Nassim Taleb, damn

English

107

1.7K

11.8K

357.8K

alvin@e3he0·28 Ara

been so longg

English

alvin@e3he0·28 Eyl

i wanna redesign the whole internet

English

alvin@e3he0·26 Eyl

kinda got employed latelyyy

English

alvin@e3he0·18 Eyl

@Hi_Mrinal we all know that you’ll get it 🤌

English

Mrinal@Hi_Mrinal·17 Eyl

Applied to Anthropic as a SDE

English

2.1K

105.5K

alvin@e3he0·17 Eyl

@WildCat_io @mazeincoding @grok wtf does this mean

English

Mike Chong@realMikeChong·16 Eyl

@mazeincoding capcut: - expensive - stealing data / copyright - slow and buggy opencut: - free - opensource - founder is a great ** poster on X!

English

1.9K

Maze@mazeincoding·16 Eyl

claude: - expensive - doesn't follow instructions - is lazy gpt 5: - cheaper - follows instructions - is not lazy GPT 5 IS BETTER THAN CLAUDE

English

307

1.7K

143.2K

alvin@e3he0·15 Eyl

@RubenVeidt lately i’ve been loving your posts a bit too muchh

English

156

Ruben Veidt@RubenVeidt·14 Eyl

to appreciate css, one must write vulkan , to use vulkan to the max, one must understand css this 2 line of css .container { height: 100vh; } to do the same in vulkan, you need to Initialize the vulkan library, select a physical gpu, create a logical device and command queues, create a window surface and a swapchain to present images to it, setup up a render pass that describes the structure of a rendering operation, define the entire graphics pipeline, including vertex shaders, fragment shaders, rasterization state, blending state, etc...., create framebuffers, allocate and manage memory for the vertex data, create command buffers and record the exact drawing commands, use fences and semaphores to synchronize the cpu and gpu to make things happen in the correct order this is not even recreating the css stuff, its just setting up vulkan to even start doing that the more you understand what's happening under the hood, the more you appreciate when and what you don't have to think about it

English

3.6K

alvin@e3he0·6 Eyl

I seee…