Zeke

5 posts

Zeke

@meep414

Katılım Eylül 2025

47 Takip Edilen0 Takipçiler

Zeke@meep414·1 Nis

@clattner_llvm Does the solver recover the exact same schedule as flash attention? It would be cool if there were multiple optimal solutions or if it found something faster.

English

113

Chris Lattner@clattner_llvm·31 Mar

Pipelining AI kernels is required to get full perf/utilization out of modern chips. However, no one has been able to crack "full control over the hardware" without "having to micromanage it". Let's crack this open: kernel authors deserve a powerful scheduler they can control. 💪

Modular@Modular

FA4 on Blackwell: 14 ops, 5 hardware units, 28 dependency edges. One wrong sync = a race condition sanitizers won't catch. We built a constraint solver that derives the pipeline schedule automatically, in Mojo 🔥 Part 1 of our series is out → modular.com/blog/software-…

English

255

24.6K

Zeke@meep414·14 Şub

@_xjdr What do you like about them?

English

xjdr@_xjdr·13 Şub

gpt-oss and gemma-3n are such good architectures . i didnt really appreciate them until i really put them to the test through rigorous ablations

English

186

12.2K

Zeke@meep414·10 Şub

@elliotarledge @AlpinDale @HazyResearch Did you write this with a llm?

English

202

Elliot Arledge@elliotarledge·10 Şub

After diving head first into the deep end of squeezing every last drop out of inference megakernels, I decided to write a book about my journey, as well as how others like @AlpinDale and @HazyResearch architect their megakernels on hopper and blackwell. This assumes comfort with CUDA and LLM inference. elliotarledge.gumroad.com/l/grokking-meg…

English

129

4.9K

Zeke@meep414·10 Ara

@alz_zyd_ the normie's confusion with calculus is totally understandable. suppose i told you in 1650 that, in 1660, i could produce a closed form equation for the area under any continuous function between two points. you'd obviously think i'm a complete nutcase

English

alz@alz_zyd_·9 Ara

The normies' confusion with AI is totally understandable. Suppose I told you in 2015 that, in 2025, gradient descent would produce black magic billionparameter bots which could think and talk and solve math questions. You'd obviously think I'm a complete nutcase

English

1.2K

69.6K

Zeke@meep414·19 Kas

@GregKamradt @arcprize

QME

353

Greg Kamradt@GregKamradt·18 Kas

Open questions with Gemini Deep Think and @arcprize: - Why doesn't it get 100% on ARC-AGI? We're trying to understand failure modes, need to inspect tasks more - Why a slight jump in ARC-AGI-1, but a 2x SOTA in ARC-AGI-2? Here are tasks that stood out in our early testing:

Greg Kamradt@GregKamradt

After we got early access to Gemini 3 Pro, it was SOTA, we were impressed and then Google told us, "there's one more thing..." Deep Think sets the new high water mark on ARC-AGI-2

English

490

103K

Keşfet

@clattner_llvm @_xjdr @elliotarledge @AlpinDale @HazyResearch @alz_zyd_ @GregKamradt @arcprize