Christophe Roux

22 posts

Christophe Roux

@chrisrx13

PhD Student in Optimization/ML, @ZuseInstitute and @TUBerlin

Katılım Nisan 2020

230 Takip Edilen43 Takipçiler

Christophe Roux retweetledi

Sebastian Pokutta@spokutta·6d

For a decade it was open whether Frank-Wolfe's O(1/√ε) rate on strongly convex sets is tight. We show it is: Ω(1/√ε), even for a simple quadratic on a unit ball. With J. Halbey, D. Deza, @maxzimmerberlin, @chrisrx13, @b_stellato. 1/2

English

8.1K

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·20 Mar

Everyone's hyped about @karpathy's autoresearch! We've been working on "The Agentic Researcher" for the past year, and now share our learnings and framework, including: 6 case studies in ML and Math! Paper: arxiv.org/abs/2603.15914 Code: github.com/ZIB-IOL/The-Ag…

English

751

Christophe Roux retweetledi

Ben Grimmer@prof_grimmer·27 Şub

There is (at least one) strange thing in accelerated convex optimization theory: Since the 80s, in unconstrained minimization by gradient methods, smoothness is known to allow a fast O(1/T^2) convergence rate by Nesterov. Nemirovski and Yudin give matching lower bounds.

English

11.8K

Christophe Roux retweetledi

Louis Schiekiera@LJS_Berlin·3 Şub

🚨 New preprint available! 🚨 We test how much of an LLM's internal semantic geometry can be recovered from behavior alone. Across 8 LLMs and 17.5M trials, forced-choice tasks align with hidden-state structure much better than free association. Preprint: arxiv.org/pdf/2602.00628

English

149

Christophe Roux retweetledi

Sebastian Pokutta@spokutta·16 Eyl

1/ Why lowering conference acceptance rates doesn’t change #accepted_papers (but explodes reviewer workload). A short queueing story about ML/AI conferences, resubmissions, and Little’s Law #neurips #ml #ai @montreal_ai #icml @MarkSchmidtUBC @roydanroy 🧵

English

4.9K

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·15 Nis

Come visit our #ICLR2025 poster „On the Byzantine-Resilience of Distillation-Based Federated Learning“ in Poster Session 3 (iclr.cc/virtual/2025/p…), joint work with @chrisrx13 and @spokutta! Informal thread in video below

English

678

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·14 Nis

Now with a blogpost (and the first one on my website!): maxzimmer.org/blog/2025/esti…

Jan Pauls@jan_pauls_

Our paper "Estimating Canopy Height at Scale" has been accepted to #ICML24, where we significantly advance global canopy height mapping. w/ @maxzimmerberlin, U. Kelly, M. Schwartz, S. Saatchi, @ciais_philippe , @spokutta , @matin_brandt, F. Gieseke arxiv.org/abs/2406.01076 🧵 1/5

English

298

Christophe Roux retweetledi

Berkant Turan@BerkantTuran_·27 Tem

Join our poster session *today* at #ICML2024 in the TF2M Workshop. Looking forward to the inspiring discussions.

Berkant Turan@BerkantTuran_

Excited to be at #ICML2024! Grzegorz Gluch, @SaiGaneshNagar1, and I will present our paper "Unified Taxonomy in AI Safety: Watermarks, Adversarial Defenses, and Transferable Attacks" at the Workshop on Theoretical Foundations of Foundation Models (TF2M). DM me to grab a coffee!☕️

English

325

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·24 Tem

A good time to share our #ICLR2023 paper: How I Learned to Stop Worrying and Love Retraining We explore sparsity-adaptive LR schedules and show that with proper LR care, simple pruning can outperform complex methods that 'learn' the sparsity. 📜 arxiv.org/abs/2111.00843 🧵1/n

Lucas Beyer (bl16)@giffmana

I'm not really an expert on sparsity, but I enjoy using this template, and reminding about learning-rate, whenever I can. So I will:

English

4.7K

Christophe Roux@chrisrx13·22 Tem

For details, see arxiv.org/abs/2403.10429 or come talk to us at the poster session on Tuesday 3pm in Hall C 4-9. 8/8

English

Christophe Roux@chrisrx13·22 Tem

Removing the assumption of bounded iterates uncovers a complex landscape of tradeoffs between oracle complexity, bounds on D, efficient computability of updates, and whether prior knowledge of the initial distance to the optimizer is needed. 7/8

English

Christophe Roux@chrisrx13·22 Tem

In our paper "Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point" at #ICML2024, we examine a blind spot in the Riemannian opt literature: Most works simply 𝘢𝘴𝘴𝘶𝘮𝘦 that the iterates stay in a bounded set. This is a problem because 1/8

English

178

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·14 Tem

🌟 Join our Team in Berlin 🌟 We are seeking highly motivated PhD students to work on (efficient) Deep Learning, preferably with strong math/CS background and PyTorch experience. Happy to answer questions here, via DM or at #icml2024! Apply at iol.zib.de/openings! Please RT

English

1.8K

Christophe Roux retweetledi

Max Zimmer@maxzimmerberlin·5 May

On my way to Vienna for #ICLR2024 with our paper "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging". We address the challenge of creating Model Soups from Sparse Neural Networks while preserving their sparsity patterns! arXiv: arxiv.org/abs/2306.16788 🧵1/n

English

116

15.1K

Christophe Roux@chrisrx13·25 Mar

@yenhuan_li Glad you liked it 🙂

English

Yen-Huan Li@yenhuan_li·24 Mar

Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point arxiv.org/abs/2403.10429 Reverse em-problem based on Bregman divergence and its application to classical and quantum information theory arxiv.org/abs/2403.09252 (2/n)

English

404

Yen-Huan Li@yenhuan_li·24 Mar

==== My recommendations today ==== Majority-of-Three: The Simplest Optimal Learner? arxiv.org/abs/2403.08831 Directional Smoothness and Gradient Methods: Convergence and Adaptivity arxiv.org/abs/2403.04081 (1/n for a large n...)

English

7.1K

Keşfet

@maxzimmerberlin @b_stellato @karpathy @montreal_ai @MarkSchmidtUBC @roydanroy @spokutta @yenhuan_li