omkaar (@omkizzy) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

omkaar@omkizzy·2d

I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.

English

19

167

12.8K

omkaar@omkizzy·12h

@emptysaysstuff pls do and tag me, if it's awesome i'd love to repost

English

1

0

1

54

mehul@emptysaysstuff·21h

this is so cool… might try doing something cool with it too😛

omkaar@omkizzy

I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.

English

1

0

1

147

omkaar@omkizzy·1d

@levidiamode very cool

English

1

0

1

117

levi@levidiamode·1d

Day 74/365 of GPU Programming I always found die shots and SM diagrams beautiful but difficult to map mentally, so I've been trying to find a way to interact with GPUs in 3D. This is what I have so far: a single input that goes through a simplified H100 execution pipeline to see what the silicon is doing at each step; from CPU-side tokenization and embedding lookup, through matmuls on tensor cores to the final softmax output. My current plan is to make this an interactive playground that lets you zoom in and zoom out through various levels of depth (package → die → GPC → SM → tensor core) while also including step-through examples similar to the bycroft LLM 3D visualization. Ideally this should make exploring the architectural side just as easy as mapping CUDA abstractions onto the actual hardware processes. I'm starting with an H100 but would be fun to expand this to more GPUs and highlight the differences between generations. This was largely inspired by @srush_nlp's GPU puzzles, @JayAlammar's Illustrated Transformers and @karpathy's makemore series, which made me think about how to study and visualize GPUs from the ground up.

levi@levidiamode

Day 73/365 of GPU Programming Wanted to understand FP4 better and came across this great @Cohere_Labs talk on Training LLMs with MXFP4 and @juliarturc's amazing series on quantization So fascinating learning what makes low precision work for LLM training and inference

English

16

15

318

23.3K

omkaar@omkizzy·1d

@aridutilh @solofounders W

0

2

46

ari dutilh@aridutilh·1d

Filming while we ship this episode! @solofounders media doesn’t stop

weisser@julianweisser

25-30% of his portfolio lose a co-founder before Series A. Solo Founders Podcast ep 3 is live @chudson of @PrecursorVC has invested in 500+ companies as a solo GP. He's our first VC guest. His take: the co-founder consensus is broken, and you should never give away 40% of your company just to make fundraising easier. 00:59 — Why the co-founder consensus is wrong 02:57 — The 500+ company data set 03:37 — Dead equity and cap table damage 07:21 — Rivalry and resentment 09:29 — The "team sport" analogy deconstructed 11:47 — Talented solo founder vs. mismatched team 13:37 — The emotional journey of solo founding 15:28 — Solo founder advantages 21:07 — Don't give away 40% to fundraise easier 23:25 — Authorship 27:03 — Fundraising advice for solo founders 28:28 — Don't apologize for being solo 36:45 — The solo GP / solo founder kinship 41:06 — Bear case for solo founding 43:22 — Bull case for solo founding

English

3

0

15

1.2K

omkaar@omkizzy·1d

there are a select number of companies who have clocked in where the next few 100s of billions of $$$s will flow. current players are climbing, smaller labs are now pivoting into that space and I think YC has clocked in what this space is, 10s of people each batch will be doing this.

English

1

0

21

2.5K

omkaar@omkizzy·1d

@aryanvs_ what is the fix usually in such cases?

English

1

0

161

Aryan V S@aryanvs_·1d

an absmax difference of 0.00125 in one of my quantization kernels led to eventual NaN in training, which converges once fixed. my respect for the systems people who got nvfp4 pretraining working was already really high, but now it’s at least 10x more! this is pure fucking sorcery

English

2

0

44

2K

omkaar@omkizzy·1d

@Jibran_05 @hamostaf04 yessir thank you

English

0

2

34

Jibran@Jibran_05·1d

@omkizzy @hamostaf04 this is sick

English

1

0

2

61

omkaar retweetledi

omkaar@omkizzy·2d

I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.

English

19

167

12.8K

omkaar@omkizzy·2d

@AdvicebyAimar I think it's more of a steve's marketing <> personal computer, peter's marketing <> personal agents... which peter did.

English

0

166

Aimar Haddadi@AdvicebyAimar·2d

comparing the guy who vibe coded on top of claude code to steve jobs is crazy.

Sadie St Lawrence@sadiestlawrence

Running into @steipete feels like meeting our generation’s Steve Jobs. A new dawn of computing @openclaw @NVIDIAGTC

English

270

83

4.8K

279.4K

omkaar@omkizzy·2d

@Angaisb_ wait wym... these images look awesome

English

1

0

2

265

Angel ❄️@Angaisb_·2d

Midjourney should have gone full AR and left diffusion behind They had the data, the compute and the talent yet somehow they still managed to become irrelevant. This isn't any better than older Midjourney models Sad to watch a company I genuinely liked fade out in real time

Mark Kretschmann@mark_k

The long-awaited testing phase for @Midjourney V8 has officially begun, marking a massive leap forward for the generative art platform. This latest iteration promises a significant boost in efficiency, operating at five times the speed of its predecessors while maintaining a much tighter grip on complex prompt instructions. High-resolution creators will find the native 2K modes particularly useful for professional workflows. The update also brings more reliable text rendering and enhanced "sref" styling, allowing for a level of aesthetic consistency that was previously difficult to achieve. Personalization is a major focus of this release, with improved moodboard performance to help users fine-tune their unique visual language. It is an impressive step toward making AI-assisted design both faster and more intuitive.

English

16

9

141

22.5K

omkaar@omkizzy·2d

@bxptr_ yessir down to collab on an interesting env

English

0

36

aarush@bxptr_·2d

@omkizzy so based

English

1

0

1

48

omkaar@omkizzy·2d

@adiprasadd would love to collab if andera is looking into internal rl envs

English

0

62

adi@adiprasadd·2d

@omkizzy holy shit

English

1

0

1

65

omkaar@omkizzy·2d

@yzhang_cs @stochasticchasm omg I did not know that existed, i vibe coded one for my blogs omkaark.com/model-viz/ which breaks in edge cases lol

English

0

24

Yu Zhang 🌘🐙@yzhang_cs·2d

@stochasticchasm There's no such thing as an ugly TikZ drawing! Give it a shot, and it's actually not that hard :D

English

4

0

23

351

stochasm@stochasticchasm·4d

yoooooo

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

3

1

58

6.1K

omkaar@omkizzy·2d

@ItzSuds did ben work in the niche before? interesting insight

English

0

100

sudarshan@ItzSuds·2d

Ben showed up at my house in Oct '24 w a massive chip on his shoulder & the inkling of an idea for his revenge company Since then, he's gone from Philippinos powering his AI to an AI native procurement & freight brokerage co that's blitzed to $5m+ ARR in 6 mo. Amazing execution!

Benjamin Stern@itsbenjyyy

Ten years ago I was building factories. Today I'm building the tools I wish I had inside them. @TenkaraAI raised $7M led by @trueventures.

English

5

0

64

11.2K

omkaar@omkizzy·2d

@jeffzwang oh yeah great idea

English

0

1

106

Jeffrey Wang@jeffzwang·2d

guys, i didn't ask if this is a good idea - i asked HOW

Jeffrey Wang@jeffzwang

does anyone have any tips on how to prompt/plan when trying to oneshot large projects, like 50K+ LOC?

English

32

0

75

16.3K

omkaar@omkizzy·2d

@madhavsinghal_ thank you brother, you first put me on actual RLVR training

English

0

1

79

Madhav Singhal@madhavsinghal_·2d

@omkizzy this is dope

English

1

0

1

89

omkaar@omkizzy·2d

@hallerite @samsja19 fire code review

English

0

46

hallerite@hallerite·3d

I think my head of research is starting to develop schizophrenia

English

16

5

210

18.7K

omkaar@omkizzy·2d

forgot to mention the best part, it's MIT-licensed open-source

omkaar@omkizzy

I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.

English

1

17

1.7K

omkaar@omkizzy·2d

@OpenAI @AnthropicAI @xai @GoogleDeepMind when you scrape my precious hand-written tokens, please don't train on the environments folder. it is vibecoded and I have marked it as such.

omkaar@omkizzy

I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.

English

0

13

981

omkaar@omkizzy·2d

@mikaelhaji appreciate you brother

English

0

122