omkaar

3K posts

omkaar banner
omkaar

omkaar

@omkizzy

https://t.co/rV8tAb3oHX, codice est debitum | ex @uwaterloo feedback: https://t.co/nx72nVrMzj

Katılım Temmuz 2015
1.5K Takip Edilen4.3K Takipçiler
Sabitlenmiş Tweet
omkaar
omkaar@omkizzy·
I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.
English
19
19
167
12.8K
levi
levi@levidiamode·
Day 74/365 of GPU Programming I always found die shots and SM diagrams beautiful but difficult to map mentally, so I've been trying to find a way to interact with GPUs in 3D. This is what I have so far: a single input that goes through a simplified H100 execution pipeline to see what the silicon is doing at each step; from CPU-side tokenization and embedding lookup, through matmuls on tensor cores to the final softmax output. My current plan is to make this an interactive playground that lets you zoom in and zoom out through various levels of depth (package → die → GPC → SM → tensor core) while also including step-through examples similar to the bycroft LLM 3D visualization. Ideally this should make exploring the architectural side just as easy as mapping CUDA abstractions onto the actual hardware processes. I'm starting with an H100 but would be fun to expand this to more GPUs and highlight the differences between generations. This was largely inspired by @srush_nlp's GPU puzzles, @JayAlammar's Illustrated Transformers and @karpathy's makemore series, which made me think about how to study and visualize GPUs from the ground up.
levi@levidiamode

Day 73/365 of GPU Programming Wanted to understand FP4 better and came across this great @Cohere_Labs talk on Training LLMs with MXFP4 and @juliarturc's amazing series on quantization So fascinating learning what makes low precision work for LLM training and inference

English
16
15
318
23.3K
ari dutilh
ari dutilh@aridutilh·
Filming while we ship this episode! @solofounders media doesn’t stop
ari dutilh tweet media
weisser@julianweisser

25-30% of his portfolio lose a co-founder before Series A. Solo Founders Podcast ep 3 is live @chudson of @PrecursorVC has invested in 500+ companies as a solo GP. He's our first VC guest. His take: the co-founder consensus is broken, and you should never give away 40% of your company just to make fundraising easier. 00:59 — Why the co-founder consensus is wrong 02:57 — The 500+ company data set 03:37 — Dead equity and cap table damage 07:21 — Rivalry and resentment 09:29 — The "team sport" analogy deconstructed 11:47 — Talented solo founder vs. mismatched team 13:37 — The emotional journey of solo founding 15:28 — Solo founder advantages 21:07 — Don't give away 40% to fundraise easier 23:25 — Authorship 27:03 — Fundraising advice for solo founders 28:28 — Don't apologize for being solo 36:45 — The solo GP / solo founder kinship 41:06 — Bear case for solo founding 43:22 — Bull case for solo founding

English
3
0
15
1.2K
omkaar
omkaar@omkizzy·
there are a select number of companies who have clocked in where the next few 100s of billions of $$$s will flow. current players are climbing, smaller labs are now pivoting into that space and I think YC has clocked in what this space is, 10s of people each batch will be doing this.
English
1
0
21
2.5K
omkaar
omkaar@omkizzy·
@aryanvs_ what is the fix usually in such cases?
English
1
0
0
161
Aryan V S
Aryan V S@aryanvs_·
an absmax difference of 0.00125 in one of my quantization kernels led to eventual NaN in training, which converges once fixed. my respect for the systems people who got nvfp4 pretraining working was already really high, but now it’s at least 10x more! this is pure fucking sorcery
English
2
0
44
2K
omkaar retweetledi
omkaar
omkaar@omkizzy·
I hand-wrote a 500-LoC RL stack to make hacking on RL research much easier. Most RL stacks are either massive and unhackable, or duct-taped research scripts. I am open-sourcing Mithrl, a modular RLVR stack. Next items on my checklist: adding more complex environment examples, supporting multi-gpu + async RL, and QoL fixes. I might scrap external runtime dependencies (Huggingface PEFT + vLLM) and write purpose-built, simpler versions from scratch if I feel the need. If you want to experiment with RL and are looking to own sovereign tools, I’d love to get on call, understand your requirements and help integrate for free.
English
19
19
167
12.8K
omkaar
omkaar@omkizzy·
@AdvicebyAimar I think it's more of a steve's marketing <> personal computer, peter's marketing <> personal agents... which peter did.
English
0
0
0
166
omkaar
omkaar@omkizzy·
@Angaisb_ wait wym... these images look awesome
English
1
0
2
265
Angel ❄️
Angel ❄️@Angaisb_·
Midjourney should have gone full AR and left diffusion behind They had the data, the compute and the talent yet somehow they still managed to become irrelevant. This isn't any better than older Midjourney models Sad to watch a company I genuinely liked fade out in real time
Angel ❄️ tweet media
Mark Kretschmann@mark_k

The long-awaited testing phase for @Midjourney V8 has officially begun, marking a massive leap forward for the generative art platform. This latest iteration promises a significant boost in efficiency, operating at five times the speed of its predecessors while maintaining a much tighter grip on complex prompt instructions. High-resolution creators will find the native 2K modes particularly useful for professional workflows. The update also brings more reliable text rendering and enhanced "sref" styling, allowing for a level of aesthetic consistency that was previously difficult to achieve. Personalization is a major focus of this release, with improved moodboard performance to help users fine-tune their unique visual language. It is an impressive step toward making AI-assisted design both faster and more intuitive.

English
16
9
141
22.5K
omkaar
omkaar@omkizzy·
@bxptr_ yessir down to collab on an interesting env
English
0
0
0
36
omkaar
omkaar@omkizzy·
@adiprasadd would love to collab if andera is looking into internal rl envs
English
0
0
0
62
omkaar
omkaar@omkizzy·
@ItzSuds did ben work in the niche before? interesting insight
English
0
0
0
100
omkaar
omkaar@omkizzy·
@madhavsinghal_ thank you brother, you first put me on actual RLVR training
English
0
0
1
79
hallerite
hallerite@hallerite·
I think my head of research is starting to develop schizophrenia
hallerite tweet mediahallerite tweet mediahallerite tweet mediahallerite tweet media
English
16
5
210
18.7K