Karthik Inbasekar

879 posts

Karthik Inbasekar

Karthik Inbasekar

@Karthik_Inb

I like to build things || Principal Researcher @moonmathai

Katılım Ocak 2022
516 Takip Edilen302 Takipçiler
Sabitlenmiş Tweet
Karthik Inbasekar
Karthik Inbasekar@Karthik_Inb·
Video AI benchmarks are broken. VBench requires 6,230 videos per model eval. Scores cluster near the ceiling. Yes-bias makes every model look good. Rankings don't match what humans prefer. We built WorldJen to fix this. 🧵
English
1
1
5
500
Karthik Inbasekar
Karthik Inbasekar@Karthik_Inb·
If you're building or evaluating generative video models, WorldJen is the benchmark you actually want. Human-preference-aligned. Scalable. Open. Built by @moonmathai
English
0
0
1
42
Karthik Inbasekar
Karthik Inbasekar@Karthik_Inb·
Video AI benchmarks are broken. VBench requires 6,230 videos per model eval. Scores cluster near the ceiling. Yes-bias makes every model look good. Rankings don't match what humans prefer. We built WorldJen to fix this. 🧵
English
1
1
5
500
Karthik Inbasekar retweetledi
Ynet Global
Ynet Global@ynetnews·
Farewell to Prof. Michael Rabin, godfather of Israeli computer science A pioneer of modern computing, Rabin shaped algorithms, cryptography and AI foundations, mentored generations and remains the only Israeli to wi... ynetnews.com/health_science…
Ynet Global tweet media
English
0
3
10
1.2K
Karthik Inbasekar retweetledi
MoonMath.ai
MoonMath.ai@moonmathai·
The community invested enormous efforts in optimizing attention, but the large `nn.Linear` layers that surround attention? Largely untouched! Introducing LiteLinear: a drop-in video DiT acceleration that compress nn.Linear layers via calibration-aware low-rank decomposition + quantization. Targets both FFN and attention projection linears (Q/K/V/O) without retraining We are releasing LiteLinear support for both @nvidia Hopper and @AMD Instinct, together with a proof of concept on @Lightricks LTX-2 FFN: 22.5% faster transformer compute 11.5% peak memory reduction 7.6% faster end-to-end inference Blog: moonmath.ai/posts/liteline… Code: github.com/moonmath-ai/Li…
English
0
2
7
658
Karthik Inbasekar retweetledi
MoonMath.ai
MoonMath.ai@moonmathai·
🧑‍🏭 LiteRunner 🧑‍🏭 MLOps-Style Tracking Without Touching the Code (New Tool) TL;DR: LiteRunner adds lightweight tracking to any CLI command without changing the model, saving params, outputs, and metrics locally and in W&B so every run stays reproducible and organized. Code (open source!): github.com/moonmath-ai/Li… Blog: moonmath.ai/posts/literunn… Contributions are welcome 🙌 More background: When running video generation experiments with diffusion models, the workflow quickly turns into bookkeeping. Every run starts with hand-editing long CLI commands, quoting paths, swapping flags manually, and each run produces a different combination of config, output videos, metrics, and debug data. Output files end up scattered across multiple folders and machines with no central record, sometimes even overwriting each other. Moving those files and recording runs becomes tedious, and inevitably the one run that wasn’t properly recorded turns out to be the one that matters. Revisiting an old experiment often means digging through notes just to figure out whether it used seed 10 or 42. When you own the code, you can wire in an MLOps tool to solve this. But often you’re just a user of someone else’s model, and modifying their source just to get proper tracking isn’t practical. That’s when the idea comes up: instead of changing the model code, bring MLOps-style logging to arbitrary CLI commands, so experiments can be tracked without touching the original implementation.
English
0
5
5
1.2K
Karthik Inbasekar retweetledi
Omer Shlomovits
Omer Shlomovits@OmerShlomovits·
Introducing BackLite: Attention Backpropagation Acceleration Using Dynamic Sparsity 👀 👊Blog post: moonmath.ai/posts/introduc… 👊Code (open source!) : github.com/moonmath-ai/Ba… 👊Integration example to nanochat: github.com/karpathy/nanoc… It is well known that the attention matrix is highly sparse. Several works have used this sparsity to speed up the forward pass. What if we could also use it to speed up the backward pass? BackLite is a novel algorithm designed to dynamically discover and exploit the sparsity inherent in attention to skip computation while mathematically approximating the gradients through the attention layer. Our idea: Simply track the sparsity in the attention matrix during the forward pass and use it to skip computation during the backward pass. Under the hood: 🌊 Uses the forward pass to track attention matrix tile weights at negligible overhead 🌊 Builds a mask by skipping tiles with cumulative weight less than a threshold 🌊 Skips masked tiles during backward 👉 Same forward, same model, fewer backward FLOPs Drop-in kernel replacement, tested on LLMs and video diffusion models, especially good for long sequence lengths 💪 Disclaimers: Image shows nanochat leaderboard *IF* @karpathy/ @OxyKodit will merge our PR. Yes, there's still much work to do on the code and tests to run. Contributions/questions are welcome.
Omer Shlomovits tweet media
English
3
9
23
2.7K
Karthik Inbasekar retweetledi
Israel Defense Forces
On this day, we honor and salute former Israeli Prime Minister and IDF Chief of the General Staff Yitzhak Rabin, who was brutally assassinated 30 years ago. Today, we commemorate his dedication to the defense and livelihood of the State of Israel. May his memory be a blessing 🕯️
Israel Defense Forces tweet media
English
220
499
3.4K
108.9K
Karthik Inbasekar retweetledi
ארץ נהדרת
ארץ נהדרת@Eretz_Nehederet·
לפני שמונה חודשים ביצענו כאן את השיר ״שמש״ בתפילה לשחרורו של אלון אהל - הערב סגרנו מעגל עם אלון, שחזר להיות איתנו כאן, מתחת לשמיים. לקמפיין מימון ההמונים של אלון אהל> charidy.com/alonohel
עברית
116
418
3K
202.1K