Adrien Taylor

107 posts

Adrien Taylor

Adrien Taylor

@TaylorAdrien

Researcher at @inria_Paris in the @Sierra_ML_Lab team

Katılım Eylül 2013
174 Takip Edilen453 Takipçiler
Sabitlenmiş Tweet
Adrien Taylor
Adrien Taylor@TaylorAdrien·
Too busy/tired/lazy to find a convergence proof for your latest optimization algorithm? Let your computer do it! PEPit is a new Python package for computer-assisted worst-case analyses  (github.com/bgoujaud/PEPit or pypi.org/project/PEPit/). (1/5)
English
5
57
275
0
Adrien Taylor retweetledi
Fabian Schaipp
Fabian Schaipp@FSchaipp·
Very excited to be in Copenhagen for #EurIPS 🇪🇺 I am presenting an optimizer benchmark for diffusion model training (sunday @ PriGM workshop). it compares new methods (Muon, SOAP, ScheduleFree) to good old AdamW. Happy to chat anytime ❄️
Fabian Schaipp tweet media
English
2
5
40
2.1K
Ernest Ryu
Ernest Ryu@ErnestRyu·
@RaduIoanBot This concludes my three-part Twitter series. Now that we have the mathematical results sufficient for a publication, I will post our work on arXiv on Monday. After a week or two of polishing the writing and gathering feedback, I’ll submit it for peer review. 14/N, N=14
English
2
4
81
4.6K
endgame lama
endgame lama@EndgameL76909·
@ErnestRyu @ErnestRyu can please explain why it's a 6-dimensional search problem. I tried searching for the answer but I couldn't find and it's bothering me a little
English
1
0
18
65
Adrien Taylor
Adrien Taylor@TaylorAdrien·
@AIGuideOfficial Could you perhaps share the prompt, the full answers, and whether you actually checked the proof? :-)
English
0
0
4
246
John
John@AIGuideOfficial·
I have resolved the open problems in the paper Open Problems and Two Riddles in Heavy-Ball Dynamics” (arXiv:2502.19916v1) using ChatGPT5. I have contributed to advancing the understanding of convex optimization. This result adds a new insight to the field, refining the conditions under which acceleration holds and potentially opening avenues for further research. While it may not constitute entirely “new mathematics” in the sense of a groundbreaking theorem, it does represent a significant step forward by settling a conjectured question with a rigorous proof, thereby enriching the mathematical landscape.
John tweet media
English
4
0
1
1.6K
Adrien Taylor retweetledi
Konstantin Mishchenko
Konstantin Mishchenko@konstmish·
Learning rate schedulers used to be a big mistery. Now you can just take a guarantee for *convex non-smooth* problems (from arxiv.org/abs/2310.07831), and they give you *precisely* what you see in training large models. See this empirical study: arxiv.org/abs/2501.18965 1/3
Konstantin Mishchenko tweet media
English
5
71
435
28.7K
Adrien Taylor retweetledi
Aaron Defazio
Aaron Defazio@aaron_defazio·
The sudden loss drop when annealing the learning rate at the end of a WSD (warmup-stable-decay) schedule can be explained without relying on non-convexity or even smoothness, a new paper shows that it can be precisely predicted by theory in the convex, non-smooth setting! 1/2
Aaron Defazio tweet media
English
1
24
245
43.1K
Adrien Taylor retweetledi
Jérôme Bolte
Jérôme Bolte@jerome_bolte·
Recruiting Post-Doctoral students in Machine-Learning, Optimization or Regulation in AI at TSE or Paris. Write to jerome.bolte@tse-fr.eu
English
1
19
44
7.3K
Francesco Orabona
Francesco Orabona@bremen79·
Optimization people, how do you call this property? There exists L>0 such that f(y) - f(x*) <= nabla f(y)' (y-x*) - 1/(2 L) ||nabla f(y)||^2 where x* = argmin_x f(x) This is clearly satisfied by a convex L-smooth function, but it is weaker.
English
4
0
23
7.8K
Adrien Taylor
Adrien Taylor@TaylorAdrien·
@adfillon … reviewer est un travail non rémunéré, très peu reconnu, et extrêmement chronophage dans un monde où le temps manque. Il semble raisonnable qu’une review n’identifie que quelques fautes importantes, c’est suffisant: pas besoin de les trouver toutes pour juger la qualité (2/2)
Français
1
0
0
21
Adrien Taylor
Adrien Taylor@TaylorAdrien·
@adfillon Bonjour! Merci pour ce thread; en tant qu’auteur et reviewer, je suis très d’accord avec un certain nombre de points (je mettrais, perso, les conflits d’intérêts et les biais des reviewers en premier dans la liste). Par contre, … (1/2)
Français
1
0
0
36
Adrien Fillon
Adrien Fillon@adfillon·
Le processus de la revue par les pairs est cassé. Un (long) fil🧵⬇️⬇️
Français
9
86
200
69.2K