Evgenii Egorov

1.8K posts

Evgenii Egorov banner
Evgenii Egorov

Evgenii Egorov

@eeevgen

@AmlabUva

Amstelveen Katılım Nisan 2010
1.3K Takip Edilen672 Takipçiler
Sabitlenmiş Tweet
Evgenii Egorov
Evgenii Egorov@eeevgen·
An interlude about structure computation, SSM and attention. Myosotis: arxiv.org/abs/2509.20503 I hope to tell more in this line of work later. See poster at SPIGM workshop.
Evgenii Egorov tweet media
English
1
6
25
3.9K
Evgenii Egorov
Evgenii Egorov@eeevgen·
@martinmbauer If it was, than I guess we will have nice generalization of higher order svd, but we don’t.
English
0
0
0
417
Evgenii Egorov
Evgenii Egorov@eeevgen·
Inspired with arxiv discussions I propose to life ban when there are equations in paper which doesn’t match free indexes on the left and right parts of equations
Evgenii Egorov tweet media
English
7
21
265
73.3K
Roland Memisevic
Roland Memisevic@RolandMemisevic·
It's hard to believe, but there was a time at which the kernel trick was considered the absolute holy grail of AI, as it made it possible to use convex optimization instead of back-prop and SGD. Incidentally, self-attention and linear RNNs feel weirdly reminiscent of those vibes.
Avi Chawla@_avichawla

Why is the Kernel Trick called a "trick"? (it's a popular ML interview question) Many ML algorithms use kernels for robust modeling, like SVM, KernelPCA, etc. The core objective of a kernel function is to compute dot products in some other feature space (mostly high-dimensional) without projecting the vectors to that space. But how does that even happen? Consider the image below. Let’s assume the following polynomial kernel function: - k(X, Y) = (1+XᵀY)². Also, for simplicity, let’s say both X and Y are two-dimensional vectors: - X = (x1, x2) - Y = (y1, y2) As shown in the image below, simplifying the kernel expression produces a dot product between the two 6-dimensional vectors. This shows that the kernel function we chose earlier computes the dot product in a 6-dimensional space without explicitly visiting that space. And that is the primary reason why we also call it the “kernel trick.” More specifically, it’s framed as a “trick” since it allows us to operate in high-dimensional spaces without explicitly computing the coordinates of the data in that space. RBF kernel is even better in this respect. It lets you compute the dot product in an infinite-dimensional space without explicitly visiting that space. I have shared an article in the comments with a mathematical explanation of the RBF Kernel. ____ Find me → @_avichawla Every day, I share tutorials and insights on DS, ML, LLMs, and RAGs.

English
8
39
636
65.6K
Tom Wong
Tom Wong@thomasgwong·
My sales tax is π (rounded to the nearest cent).
Tom Wong tweet media
English
1
0
5
508
Evgenii Egorov
Evgenii Egorov@eeevgen·
There are general questions of economics which we are faced constantly but still there is no answer or this answer is not accepted by society?. For example, is it possible to build a system which rank people by credibility but doesn’t build gate keeping and if so what is system? I am sure there is a mathematical answer. Then instead of discussing is it good or bad measure we can just if it aligns with it. Or give up and say okay this is subjective do whatever you want.
English
0
0
0
163
Evgenii Egorov
Evgenii Egorov@eeevgen·
Why people in all this linear attention paper doesn’t use einstein notation? What driven this desire to transpose everything here and there just to keep matrix, when anyway operator properties not used ( except of Myosotis, best paper in the world according to myself)
English
0
0
4
335
Evgenii Egorov
Evgenii Egorov@eeevgen·
I need to present UMA paper recently arxiv.org/abs/2506.23971 and while there is no doubt that amount of work is non trivial, there is no details of how exactly averaging of FC layers was done in used architecture. For this I was need to go back to eSCN paper and a bit of imagination. I left in a mixed feelings
English
0
0
9
1.1K
Alec Helbling
Alec Helbling@alec_helbling·
The Helmholtz decomposition is one of the fundamental results of vector calculus. It says any well-behaved vector field can be split into two parts, one capturing sources and sinks through divergence, and one capturing rotation through curl.
English
33
349
2.9K
217.1K
Evgenii Egorov
Evgenii Egorov@eeevgen·
Typical day. And my building is quite fancy, I see all this fucking posh people in it.
Evgenii Egorov tweet media
English
0
0
0
88
Evgenii Egorov
Evgenii Egorov@eeevgen·
I spend like 30 minutes each day putting trash inside the trash bin otherwise it distributes across the whole street. One thing I can not understand why bother to bring trash near bin but don't put inside? Anyway, in Russia I lived in a house where were heroin dealers and guess what I didn’t have such a problem.
English
2
0
7
243
Evgenii Egorov
Evgenii Egorov@eeevgen·
@dmitry_grinko Anyway prestige is relative to a close group of people, so inside of academia I don’t worry, they will find a way. But to justify for society why academic work is hard, worth of finding etc — this is quite unclear 😶‍🌫️
English
1
0
1
68
Evgenii Egorov
Evgenii Egorov@eeevgen·
@dmitry_grinko We will bet something important, like uploading pdf to arxiv will require drop of first author blood.
English
1
0
1
96
Dmitry Grinko
Dmitry Grinko@dmitry_grinko·
Academic careers are mostly built on the idea of prestige. Solve an old important problem, build a powerful theory, design a new clever experiment, and you earn prestige. It is quite clear now that this currency will collapse. What will replace it, if anything?
English
1
0
4
187
Nirmalya Kajuri
Nirmalya Kajuri@Kaju_Nut·
How physicists tell people to zip it with Feynman diagrams
Nirmalya Kajuri tweet media
English
3
1
46
2.9K
Evgenii Egorov
Evgenii Egorov@eeevgen·
@SamuelGWalters Interestingly I have a vice versa thought recently: can I extend definition of function if exists such rotation etc
English
0
0
0
175
Sam Walters ☕️
Sam Walters ☕️@SamuelGWalters·
How much can one rotate the graph of a function so that the resulting set is still the graph of some function? Here's what I got (though it's not the most general answer). #math #calculus #geometry
Sam Walters ☕️ tweet media
English
2
2
35
2.9K