francesco ortu

34 posts

francesco ortu

francesco ortu

@francescortu

NLP & Interpretability | PhD Student @UniTrieste & Data Engineering Lab @AreaSciencePark | Prev @MPI_IS intern https://t.co/RM03tgnbJP

Trieste, Friuli-Venezia Giulia Katılım Mart 2011
1.2K Takip Edilen267 Takipçiler
francesco ortu
francesco ortu@francescortu·
I'm really excited to be in Paris for the second annual conference of the @IASEAIorg, where yesterday I presented our newly released paper: “Preserving Historical Truth: Detecting Historical Revisionism in Large Language Models” 🇫🇷 Paper link below ⬇️
francesco ortu tweet mediafrancesco ortu tweet media
English
1
1
17
3.8K
francesco ortu retweetledi
Zhijing Jin
Zhijing Jin@ZhijingJin·
First day at UNESCO: We presented our Detecting LLM Historical Revisionism paper by @FrancescOrtu @JoeunYk05 @psyonp @KeenanSamway @BSchoelkopf @AlbeCazzaniga @RadaMihalcea @ZhijingJin and will present Accidental Vulnerability by @psyonp @SimkoSamuel @KellinPelrine @ZhijingJin!
Zhijing Jin tweet mediaZhijing Jin tweet mediaZhijing Jin tweet media
Zhijing Jin@ZhijingJin

Excited to have 3 accepted papers & 9 members of our @JinesisLab at #IASEAI2026, held at UNESCO, Paris🇫🇷! We reveal hidden authoritarian biases in #LLMs, and that fine-tuning can quietly erode model safety, exploring the risks we don't always see in AI 🔍🛡️ 🧵👇

English
2
6
54
6.2K
francesco ortu
francesco ortu@francescortu·
@benno_krojer Thanks for sharing! We were honestly pretty surprised when we first saw the narrow gate effect and how strongly it shows up in the models
English
1
0
1
49
Benno Krojer
Benno Krojer@benno_krojer·
A paper that should get more attention, for those interested in building truly multimodal models (aka not just plugging stuff post hoc into LLMs): arxiv.org/abs/2412.06646 TLDR: Counterintuitively, native multimodal modes seem *less* unified internally (interp)
English
3
2
15
823
francesco ortu retweetledi
Zhijing Jin
Zhijing Jin@ZhijingJin·
We're at #NeurIPS2025 with papers, posters, and talks across many workshops. Come learn about our latest research and explore our newest breakthroughs in #LLMs, #Causality, #AIforScience, and many others!
Zhijing Jin tweet media
English
4
8
42
3.4K
francesco ortu retweetledi
Alessandro Pietro Serra
Alessandro Pietro Serra@aleserra1998·
Excited to present "The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models" at #NeurIPS2025 and #EurIPS2025! 📍NeurIPS: Poster Session 6 #1114 (Fri Dec 5 4pm-7pm) 📍EurIPS: Poster Session D3 #88 (Wed Dec 3 10:30am-12:30pm) 🧵👇
English
1
2
7
843
francesco ortu
francesco ortu@francescortu·
Really excited to see this work recognized! I'm so thankful to have been involved alongside such brilliant minds 🎉
Zhijing Jin@ZhijingJin

Honored to receive the Best Paper Award (the top prize!) at the #NeurIPS2024 Workshop on Pluralistic Alignment @pluralistic_ai! Many thx to my wonderful coauthors, who taught me so much about this interdisciplinary field of #LLMs and moral reasoning: @maxhkw @giorgiopiatti @sydneymlevine @Jiarui_Liu_ @fer_adauto @francescortu Andras Strausz @mrinmayasachan @radamihalcea @YejinChoinka @bschoelkopf. Also thank you so much to all the co-organizers and especially @Ruyuan_Wan for the wonderful photo capturing this memorable moment! 🎉 Check out our "Language Model Alignment in Multilingual Trolley Problems" at arxiv.org/pdf/2407.02273!

English
0
0
2
280
francesco ortu
francesco ortu@francescortu·
Thanks, @DiegoDoimo and @albecazzaniga, for the fantastic mentorship and support! 🙏🎉 They are also attending #NeurIPS, so feel free to reach out to them to discuss our results. I’m excited to keep pushing forward on these topics! 🚀
English
0
0
0
181
francesco ortu
francesco ortu@francescortu·
🚨🚨 Excited to share our latest paper, now on @arxiv! 🖼️ We studied how unified VLMs, trained to generate both text and images (e.g., @MetaAI's Chameleon), exchange information between modalities, comparing them to standard VLMs. Deep dive:👇
francesco ortu tweet media
English
1
4
16
2K