
Otsar
120 posts

Otsar
@godstar28
Father, Husband, Marketing man for @ingo_zk.




Introducing BackLite: Attention Backpropagation Acceleration Using Dynamic Sparsity 👀 👊Blog post: moonmath.ai/posts/introduc… 👊Code (open source!) : github.com/moonmath-ai/Ba… 👊Integration example to nanochat: github.com/karpathy/nanoc… It is well known that the attention matrix is highly sparse. Several works have used this sparsity to speed up the forward pass. What if we could also use it to speed up the backward pass? BackLite is a novel algorithm designed to dynamically discover and exploit the sparsity inherent in attention to skip computation while mathematically approximating the gradients through the attention layer. Our idea: Simply track the sparsity in the attention matrix during the forward pass and use it to skip computation during the backward pass. Under the hood: 🌊 Uses the forward pass to track attention matrix tile weights at negligible overhead 🌊 Builds a mask by skipping tiles with cumulative weight less than a threshold 🌊 Skips masked tiles during backward 👉 Same forward, same model, fewer backward FLOPs Drop-in kernel replacement, tested on LLMs and video diffusion models, especially good for long sequence lengths 💪 Disclaimers: Image shows nanochat leaderboard *IF* @karpathy/ @OxyKodit will merge our PR. Yes, there's still much work to do on the code and tests to run. Contributions/questions are welcome.








We're excited to announce the release of ICICLE v3! ICICLE is a library for cryptographers, designed to accelerate ZK provers. Our latest version includes a powerful CPU backend, extending ICICLE’s exceptional performance and user-friendly polynomial API to standard processors, enabling greater flexibility and accessibility. Full announcement: @ingonyama/icicle-v3-1c9cc2f94402" target="_blank" rel="nofollow noopener">medium.com/@ingonyama/ici…
Docs: dev.ingonyama.com/icicle/overview








