Alberto Cazzaniga @ NeurIPS2025

45 posts

Alberto Cazzaniga @ NeurIPS2025 banner
Alberto Cazzaniga @ NeurIPS2025

Alberto Cazzaniga @ NeurIPS2025

@albecazzaniga

Geometry and Deep Learning @areasciencepark

Trieste Katılım Mart 2013
211 Takip Edilen122 Takipçiler
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
Come find us at NeurIPS Poster Session 6 #1114 4PM - 7PM to hear how multimodal models develope localized image-text communication in their internal representations. #NeurIPS2025 @alexserra1998 @francescortu @lorebasile @DiegoDoimo
Alessandro Pietro Serra@aleserra1998

Excited to present "The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models" at #NeurIPS2025 and #EurIPS2025! 📍NeurIPS: Poster Session 6 #1114 (Fri Dec 5 4pm-7pm) 📍EurIPS: Poster Session D3 #88 (Wed Dec 3 10:30am-12:30pm) 🧵👇

English
0
2
7
468
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
Pop by to find out how we can detect head specialization in generative multimodal models, and how we can use it to improve them! @lorebasile @ValeMaiorca @DiegoDoimo @FrancescoLocat8
Valentino Maiorca@ValeMaiorca

Interested in the semantic specialisation of attention heads in generative models (LLMs/VLMs)? Stop by our spotlight “Head Pursuit” poster at #NeurIPS2025 ! 🗓️ Dec 3 • 11am–2pm PST 📍 Exhibit Hall C/D/E #1013

English
0
0
3
124
Alberto Cazzaniga @ NeurIPS2025 retweetledi
Alessandro Pietro Serra
Alessandro Pietro Serra@aleserra1998·
Excited to present "The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models" at #NeurIPS2025 and #EurIPS2025! 📍NeurIPS: Poster Session 6 #1114 (Fri Dec 5 4pm-7pm) 📍EurIPS: Poster Session D3 #88 (Wed Dec 3 10:30am-12:30pm) 🧵👇
English
1
2
7
832
Alberto Cazzaniga @ NeurIPS2025 retweetledi
Lorenzo Basile
Lorenzo Basile@lorebasile·
Just landed in San Diego to present "Head Pursuit: Probing Attention Specialization in Multimodal Transformers", our spotlight paper @NeurIPSConf! Don't miss poster #1013 on Wednesday, Dec 3 at 11AM. [1/6]
English
1
5
14
1.1K
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
Looking forward to welcoming (virtually :-)) @GiorgosNik02 and @tommaso_mncttn from @EPFL and @GladiaLab at our Laboratory of Data Engineering @AreaSciencePark
Area Science Park@AreaSciencePark

🤖Seminar Series Next week, Giorgos Nikolaou and Tommaso Mencattini (@EPFL) will present "Language Models are Injective and Hence Invertible". @GiorgosNik02 @tommaso_mncttn 📅Nov 27, 11 CET (Online) 👉Register to attend: bit.ly/4ieKRYq

English
0
0
5
91
Siva Reddy
Siva Reddy@sivareddyg·
Luke Zettlemoyer (@LukeZettlemoyer) plenary talk on scalable architectures for multimodal language modeling #COLM2025 Chameleon: autoregressive multimodal language models -- treat image as tokens -- works but harder to scale -- modality gap seems to be a big problem Transfusion -- integrate diffusion models into transformer models -- still a lot of interaction in paramater space -- images are generated using diffusion -- easier to scale -- impressive image generation that preserves faithfulness across multiple generations Mixture-of-Transformers -- separate transformers for each modality -- self-attention spans across transformers -- scaling behavior seems smoother
Siva Reddy tweet mediaSiva Reddy tweet mediaSiva Reddy tweet mediaSiva Reddy tweet media
English
2
13
117
14.4K
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
💡 Our interests include (but are not limited to): - AI for biological systems - Reasoning and planning - Interpretability 📍 Location: Trieste, Italy  🚨 🚨 🚨 Deadline: June 6th, 2025
English
1
0
2
93
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
🔥 Two PhD positions open @UniTrieste funded by @AreaSciencePark! 🔥 Join the Laboratory of Data Engineering to advance research in AI and its scientific applications. We’re looking for motivated students ready to dive into interdisciplinary research in deep learning and AI.
English
1
3
4
345
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
Really excited to share our latest interpretability work on multimodal models! The communication between image and text is localised in a single token in multimodal-output vision-language models. Paper: arxiv.org/html/2412.0664… Happy to discuss it at #NeurIPS2024 More below 👇
francesco ortu@francescortu

🚨🚨 Excited to share our latest paper, now on @arxiv! 🖼️ We studied how unified VLMs, trained to generate both text and images (e.g., @MetaAI's Chameleon), exchange information between modalities, comparing them to standard VLMs. Deep dive:👇

English
0
1
4
270
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
Presenting our work on "The representation landscape of few-shot learning and fine-tuning in large language models" @NeurIPSConf Poster Session East 1 Wednesday 1 h. 11-14 #3303 Great work with @DiegoDoimo @aleserra1998 @ansuin @AreaSciencePark More below
Diego@DiegoDoimo

I just landed in Vancouver to present @NeurIPSConf the findings of our new work! Few-shot learning and fine-tuning change the hidden layers of LLMs in a dramatically different way, even when they perform equally well on multiple-choice question-answering tasks. 🧵1/6

English
0
1
2
201
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
If you missed out @francescortu and @ZhijingJin presentation at #ACL2024 today, you can still catch up on our work on the Competition of Mechanisms in LLMs at the YouTube or ArXiv links below :-)
francesco ortu@francescortu

Excited to be in Bangkok for my first #ACL2024! 🇹🇭 If you're interested in LLM interpretability, join @ZhijingJin and me to discuss our recent work on the Competition of Mechanisms in LLMs. We'll be presenting at Poster Session N.2 this afternoon from 2:00 to 3:30 pm. 👋"

English
0
0
6
290
Alberto Cazzaniga @ NeurIPS2025 retweetledi
Giuditta De Lorenzo
Giuditta De Lorenzo@serdelor·
📢Are you a scientist whose work can be applied to reach new heights in the study of pathogens? 👩‍🔬👨‍💻 The International Conference on Pandemic Preparedness is your unmissable chance to connect in a multidisciplinary network 🤝#overview" target="_blank" rel="nofollow noopener">pathogen-ri.eu/conference/#ov… 1/4 #prpconference2024
Giuditta De Lorenzo tweet media
English
2
21
30
6.4K
Alberto Cazzaniga @ NeurIPS2025
Alberto Cazzaniga @ NeurIPS2025@albecazzaniga·
How do Language Models process facts and counterfactuals? In our latest ACL 2024 contribution (arxiv.org/abs/2402.11655) we study the information flow of these mechanisms in LLMs, identifying the components that promote and suppress factual responses. #ACL2024NLP @aclmeeting
francesco ortu@francescortu

🎉🎉 I'm thrilled to announce that our paper, "Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals", has been accepted at the ACL 2024 main conference! 🚨 Check it out here: arxiv.org/abs/2402.11655 👇 #NLProc #ACL2024NLP

English
1
0
4
202