

someone analyzed all 5000+ accepted papers at ICLR 2026, and it's a good signal who's pushing the research of AI: > China has surpassed the US with 43.7% of the papers > Europe's contribution is surprisingly small (5.3% including UK)
Konstantin Dobler ✈️ ICLR
174 posts

@konstantdobler
ELLIS PhD student @hpi_de, prev intern @apple @instadeepai @sap | Multilingual LLMs, tokenization, embeddings


someone analyzed all 5000+ accepted papers at ICLR 2026, and it's a good signal who's pushing the research of AI: > China has surpassed the US with 43.7% of the papers > Europe's contribution is surprisingly small (5.3% including UK)





@Hesamation Version using the same unique counting scheme (i.e. each institution is counted once per paper even if there's 50 institutions on a single paper). China's share grows. Maybe more cross-institution collaboration or larger projects involving many labs + industry?


someone analyzed all 5000+ accepted papers at ICLR 2026, and it's a good signal who's pushing the research of AI: > China has surpassed the US with 43.7% of the papers > Europe's contribution is surprisingly small (5.3% including UK)










Add tokens to an LLM without retraining the whole model. We introduce Token Distillation: attention-aware input embeddings for new tokens that match the model’s original behavior. How does it work? Check out the thread!


here's @JeffDean talking about how labs will do multi-epoch pretraining with heavy regularization to keep scaling even with limited data. no wonder slowrun gets so much attention from pretraining teams at big labs. pretraining is about to look very very different.



