Alex Zhurkevich

339 posts

Alex Zhurkevich

@cudagdb

Select difficulty 👁️‍🗨️

United States شامل ہوئے Aralık 2017

540 فالونگ578 فالوورز

Alex Zhurkevich@cudagdb·1d

@Simon_Vt I think the flahsinfer repo is Apache 2. Pls consult with your lawyer.

English

Simon V@Simon_Vt·1d

@cudagdb @cudagdb license wise: can we use this code as we please or do we have to take care of some licensing issue github.com/aleozlx/flashi…

English

Alex Zhurkevich@cudagdb·4 Nis

Trtllmgen kernels are now open. Fastest prefill and decode kernels for our target workloads. We wrote these to win InferenceX, MLPerf, other benchmarks. Powering some of today’s top served models. Dive in, learn, use them, or level up your own. Enjoy. github.com/flashinfer-ai/…

English

331

142.2K

Alex Zhurkevich@cudagdb·8 Nis

Give Meta 5 years of life, they'll call you a pokemon.

Soumith Chintala@soumithchintala

@marksaroufim You build communities that last, and that’s rare. GPU Mode, PyTorch… they worked because you showed up with honesty and zero ego. This makes you a rare pokemon. Thanks for your service. Excited for what’s next. I'm sure you’re definitely not done leveling up.

English

Alex Zhurkevich@cudagdb·5 Nis

@alphatozeta8148 🫡🫡🫡

QME

201

Dhruv Singal@alphatozeta8148·5 Nis

@cudagdb Alex and Nikita lfg

English

248

Alex Zhurkevich@cudagdb·5 Nis

@tenderizzation

QME

214

tender@tenderizzation·4 Nis

openai reseacher running in place towards anthropic every day poetic

gabriel@gabriel1

realized the door im running in front of every day of is anthropics office let's see attendance on a saturday

English

137

7.9K

401.2K

Alex Zhurkevich@cudagdb·4 Nis

@MaziyarPanahi @vllm_project It's been there for a while. However, I don't think it's used to full potential

English

975

Maziyar PANAHI@MaziyarPanahi·4 Nis

@cudagdb merge it so we can have it in @vllm_project 🔥

English

1.4K

Alex Zhurkevich@cudagdb·4 Nis

@PhilSustainable We don't know how to

English

289

Phil BuildTheFutureNow 🇺🇸🦅🌲💙@PhilSustainable·3 Nis

I can’t believe we built this in the late 2025’s and then just never built anything like it again

Phil BuildTheFutureNow 🇺🇸🦅🌲💙 tweet media

English

1.9K

219.1K

Alex Zhurkevich@cudagdb·4 Nis

@Leik0w0 Trtllmgen predates cute dsl. We prefer writing CUDA+PTX kernels, though Cutlass and Cute are also excellent. It’s just a matter of style - our code even uses some Cutlass components like pipelines.

English

Léo@Leik0w0·4 Nis

@cudagdb Insane drop! I’m curious though, why not CuTe DSL ? Are these perfs not yet achievable with CuTe DSL ?

English

2.9K

Alex Zhurkevich@cudagdb·4 Nis

@typedfemale 🫡🫡🫡

QME

518

typedfemale@typedfemale·4 Nis

+4,906,004 😵‍💫

Alex Zhurkevich@cudagdb

125

20.1K

Alex Zhurkevich@cudagdb·3 Nis

@tenderizzation Generational

English

tender@tenderizzation·3 Nis

cash me outside picojoule for picojoule how bout dah

stochasm@stochasticchasm

impressive, very nice. now let's compare a 31b dense to a 31b active 670b total instead. flop for flop

English

1.7K

Alex Zhurkevich@cudagdb·1 Nis

@tenderizzation

QME

tender@tenderizzation·31 Mar

deepmind more like deep in the mines haha amirite

sophie@netcapgirl

google shutting down a deepmind hedge fund

English

1.7K

Alex Zhurkevich@cudagdb·25 Mar

@tenderizzation Pure gold

English

312

tender@tenderizzation·25 Mar

the size of a MoE is determined not by how high the parameter count is, but by how low the mfu is

Zach Mueller@TheZachMueller

What do we call a "large scale MoE" nowadays? 100B+ 500B+?

English

137

10.9K

Alex Zhurkevich@cudagdb·25 Mar

@stuart_sul May I ask what is special about your novel variant of NVFP4?

English

174

Stuart Sul@stuart_sul·25 Mar

Happy to share this technical report! Building MXFP8/NVFP4 training kernels for Composer 2 with ThunderKittens/ParallelKittens was a lot of fun. We share some details in the report, including our novel variant of NVFP4:

Cursor@cursor_ai

We're releasing a technical report describing how Composer 2 was trained.

English

200

18.7K

Alex Zhurkevich@cudagdb·25 Mar

@boneGPT @nikitabier

QAM

bone@boneGPT·24 Mar

Elon won. Sora is defeated. Long live Grok Imagine.

English

453

25.5K

Alex Zhurkevich@cudagdb·15 Mar

Do not forget to enable CUTLASS_ENABLE_PERFORMACE or you won't get any

Aaron Gokaslan@SkyLi0n

If you do this, don’t forget to set CUTLASS_LIBRARY_KERNELS to all True too. Enjoy the dreadful compile times though!

English

1.4K

Alex Zhurkevich@cudagdb·11 Mar

Pay attention to this one. Anne and folks are cooking smth special 👩🏻‍🍳

Anne Ouyang@anneouyang

Excited to share @Standard_Kernel's seed round and some reflections on what we’ve learned about kernel generation and what we believe is next. Grateful to our amazing team, supporters, and the broader community pushing this space forward.

English

991

Alex Zhurkevich@cudagdb·2 Mar

@nateberkopec DAM

Nate Berkopec@nateberkopec·1 Mar

-n 50

@levelsio@levelsio

Let's pay $99/mo to read my logs How about TAIL??? tail -n 50 /var/log/{php*.log,nginx/error.log,syslog} 2>/dev/null | claude "analyze these logs for errors or issues"

1.4K

313.7K

Alex Zhurkevich ری ٹویٹ کیا