Matteo Tiezzi

257 posts

Matteo Tiezzi banner
Matteo Tiezzi

Matteo Tiezzi

@TiezziMatteo

PostDoctoral Researcher @Pavis_iit @IITalk | Continual/Lifelong Learning and Graph Representation Learning | @CoLLAs_conf Local Chair

Genova, Italy Katılım Mayıs 2020
465 Takip Edilen292 Takipçiler
Sabitlenmiş Tweet
Matteo Tiezzi
Matteo Tiezzi@TiezziMatteo·
🧐Interested in the details of the latest SOTA architectures (#RWKV , #RetNet, #Mamba, #Griffin) for long sequences? 📢 "State-space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era" arxiv.org/abs/2406.09062
English
1
8
12
1.1K
Matteo Tiezzi
Matteo Tiezzi@TiezziMatteo·
📣"State-space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era" is available open-access in Elsevier Neural Networks! Updated version with insights on #Mamba2, #RWKV 5/6/7, #DeltaNet, #GLA etc. 📜Check it out: sciencedirect.com/science/articl…
Matteo Tiezzi@TiezziMatteo

🧐Interested in the details of the latest SOTA architectures (#RWKV , #RetNet, #Mamba, #Griffin) for long sequences? 📢 "State-space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era" arxiv.org/abs/2406.09062

English
0
0
2
134
Matteo Tiezzi retweetledi
Riccardo Grazzi
Riccardo Grazzi@riccardograzzi·
@julien_siems @leloykun @jyo_pari In our DeltaProduct work we also add a bit of theory to DeltaNet, showing that it can solve Dihedral groups, which are the groups of symmetries of regular polygons, with only two layers. This includes S3 (symmetries of the equilateral triangle).
Riccardo Grazzi tweet media
English
1
5
22
2.3K
Matteo Tiezzi retweetledi
Julien Siems @ ICLR 2026
Julien Siems @ ICLR 2026@julien_siems·
1/9 There is a fundamental tradeoff between parallelizability and expressivity of Large Language Models. We propose a new linear RNN architecture, DeltaProduct, that can effectively navigate this tradeoff. Here's how!
Julien Siems @ ICLR 2026 tweet media
English
4
36
189
35K
Matteo Tiezzi retweetledi
Andrea Cossu
Andrea Cossu@Cossu94·
Art, nature and Artificial Intelligence at LoT Spring school which just started. With 50+ students, we are fully booked and eager to learn (over time). @StefanoMelacci @v_lomonaco @TiezziMatteo Alessandro Betti, Marco Gori
Andrea Cossu tweet mediaAndrea Cossu tweet mediaAndrea Cossu tweet mediaAndrea Cossu tweet media
English
0
3
4
394
Matteo Tiezzi retweetledi
Sarath Chandar
Sarath Chandar@apsarathchandar·
If you want to achieve AGI, you need to solve continual learning! @CoLLAs_Conf is the premier venue for continual learning research! Check this blog post to read more about CoLLAs 2025!
CoLLAs 2026@CoLLAs_Conf

🚀 Why Lifelong Learning Matters 🚀 Modern ML systems struggle in non-stationary environments, while humans adapt seamlessly. How do we bridge this gap? 📖 Read our latest blog on the vision behind #CoLLAs2025 and the future of lifelong learning research: 🔗 lifelong-ml.cc/blogs/1 #MachineLearning #ContinualLearning #AI #LifelongLearning

English
0
3
21
1.9K
Matteo Tiezzi retweetledi
Ali Behrouz
Ali Behrouz@behrouz_ali·
Attention has been the key component for most advances in LLMs, but it can’t scale to long context. Does this mean we need to find an alternative? Presenting Titans: a new architecture with attention and a meta in-context memory that learns how to memorize at test time. Titans are more effective than Transformers and modern linear RNNs, and can effectively scale to larger than 2M context window, with better performance than ultra-large models (e.g., GPT4, Llama3-80B).
Ali Behrouz tweet media
English
82
573
3.4K
639.3K
Matteo Tiezzi
Matteo Tiezzi@TiezziMatteo·
🚀 Learning Over Time (LOT) Spring School 📅 24-27 March 2025 | 📍 Siena, Italy 💡 Are you considering models that continuously adapt over time instead of learning "offline" from pre-designed-huge collections of data? 🚀 🔗sites.google.com/unisi.it/lot-s…
English
1
1
4
165