A.G retweetledi
A.G
236 posts

A.G
@AyGhriTweets
ML Research https://t.co/klG0EgFJSm
Boulder, CO Katılım Ocak 2026
33 Takip Edilen9 Takipçiler
A.G retweetledi

@bremen79 Consider zero shot pruning or quantization of NN: the objective is of the form tr( (W-Z)^T H (W-Z))... so the convexity is close, but with non-convex constraints (binary entries in W or projection on discrete values).
W is usually a large matrix, so online approach might help
English

My personal view is that this result is mainly interesting from a theoretical standpoint. To be fair, I feel the same about much of the current research on optimization for deep learning: the gap between the assumptions and the practical problem is often too large to draw any real conclusion.
The better approach would be to first construct a model that captures the structure of the real problem. Only then does solving that model become practically useful. It strikes me as designing an airplane assuming it to be spherical: It might still result in interesting theory, but unlikely to be practical.
English

New blog post: From Online Learning to Non-convex Non-smooth optimization
This is the last post in my series to show that online learning is more than just online learning.
This reduction is surprising, but the proof is simple 🙂
Feedback is welcome!
parameterfree.com/2026/04/06/fro…
English

At this point, the most unlikely outcome on @Polymarket is the most likely one in US politics.
English

Anirban’s new book perused by @JosephJacks_ ‘Sillicon’ apparent typo (for ‘Sillycon’), will revert to Silicon. Over 100 hand drawn illustrations by Anirban
English

@lumpialogic It's common in Muslim societies and especially in Shi'a preaching
x.com/MHozeh/status/…
مصطفى@MHozeh
هرکس به جنگ ما میآید قبلش داستان کربلا را بخواند
English

It’s interesting how Iranian men openly cry in public during funerals. Masculinity that has no insecurity with showing emotions projects a very confident and balanced man.
Islam atrees 🇪🇬🇵🇸📚@AtreesIslam
وزير الخارجية الإيراني عباس عراقجي يبكي أمام جثمان قائد الحرس الثوري و صديق عمره العزيز الشهيد حسين سلامي
English

@JamesTate121 Abu Musa narrated:
"I visited the Prophet ﷺ with two men. One of them said, 'O Messenger of Allah, appoint us to a position of leadership,' and the other followed suit. The Prophet replied: 'Truly, we do not entrust this office to those who seek it, nor to those who crave it.'"
English

@bozavlado I got that. in this case (attention), the sqrt(d) scaling can change the function you target from convex to non-convex. It's doesn't only affect the conditioning of the problem, but its convexity as well
English

@AyGhriTweets It should not matter whether you do QK/sqrt(d) or do QK with Q and K having much smaller init. The training dynamics should be the same, but with current optimizers they are not.
And better optimizer should pull you out of sloppy init too.
English


@wildiris19 @amahury0 The point I'm trying to make is that when dealing when highly technical and computational claims, one needs to formulate concepts and steps rigorously. Handwavy arguments lead to nowhere other than an endless loop of unsubstantiated claims.
English

@wildiris19 @amahury0 I see many leaps. let's start with basics: laws of physics are purely descriptive and don't explain anything. Have you seen a "proof" of law of gravity?
We know it correlates perfectly with observed phenomenon (so far) but that explains nothing.Subsequent claims fail accordingly.
English

Too late, 35 years too late.
Robert Rosen did it in 1991.
Prof. Lee Cronin@leecronin
Today I’m trying to write the framework that explains why biology is not Turing complete.
English
A.G retweetledi

@wildiris19 @amahury0 "The computational theory of finite state automata is all you need to do everything that any living system can do." Is there a proof for this?
English

No, I meant Turing complete. I’m completely frustrated with constant references to “Turing completeness.” You are correct, no finite system can be Turing complete. Which means it’s a completely useless metric when approaching any finite computational system; whether biological or digital. I only wish people would just stop using the word and move on. Every computational system that can be physically built will be nothing more than finite state machines, random-access memory elements, and sufficient combinational logic to glue it all together. The computational theory of finite state automata is all you need to do everything that any living system can do. Sigh!
English

A.G retweetledi










