Frederick Riemenschneider

43 posts

Frederick Riemenschneider

Frederick Riemenschneider

@bowpis

Heidelberg, Germany Katılım Eylül 2022
193 Takip Edilen68 Takipçiler
Sabitlenmiş Tweet
Frederick Riemenschneider
How and when do multilingual LMs achieve cross-lingual generalization during pre-training? And why do later, supposedly more advanced checkpoints, lose some language identification abilities in the process? Our #ACL2025 paper investigates.
Frederick Riemenschneider tweet media
English
1
4
8
2.1K
Frederick Riemenschneider retweetledi
AI Coffee Break with Letitia
AI Coffee Break with Letitia@AICoffeeBreak·
The world’s largest NLP conference with almost 2,000 papers presented, ACL 2025 just took place in Vienna! 🎓✨Here is a quick snapshot of the event via a short interview with one of the authors whose work caught my attention. 🎥 Watch: youtu.be/GBISWggsQOA #acl2025NLP #acl2025
YouTube video
YouTube
AI Coffee Break with Letitia tweet media
English
1
4
12
1K
Frederick Riemenschneider
I am honored to receive the 2025 #GSCL Best Thesis Award at #KONVENS in Hildesheim for my Master’s thesis, which investigates multilinguality and develops language models for Ancient Greek and Latin. Thank you to my mentors and collaborators. I look forward to what comes next.
English
1
1
5
471
Frederick Riemenschneider
Looking at Bruegel's Tower of Babel in Vienna makes you wonder: How can multilingual language models overcome the language barriers? Find out tomorrow! 📍 Level 1 (ironic, right?), Room 1.15-1 🕐 2 PM #ACL2025NLP
Frederick Riemenschneider tweet media
Frederick Riemenschneider@bowpis

How and when do multilingual LMs achieve cross-lingual generalization during pre-training? And why do later, supposedly more advanced checkpoints, lose some language identification abilities in the process? Our #ACL2025 paper investigates.

English
0
3
13
1.6K
Frederick Riemenschneider
How and when do multilingual LMs achieve cross-lingual generalization during pre-training? And why do later, supposedly more advanced checkpoints, lose some language identification abilities in the process? Our #ACL2025 paper investigates.
Frederick Riemenschneider tweet media
English
1
4
8
2.1K
Frederick Riemenschneider
@crestonbrooks As for loss patterns: Unfortunately, detailed loss analyses are challenging with BLOOM since only 6 checkpoints were published. I'm currently investigating potential grokking phenomena in more controlled toy settings where we can track loss curves more comprehensively.
English
0
0
1
17
Frederick Riemenschneider
@crestonbrooks Thanks for the interesting questions! Regarding concept curriculum: We examined relatively "general" concepts and couldn't identify a clear pattern in which concepts get "translated" first. However, we did observe that different languages follow distinct patterns.
English
1
0
1
20
Frederick Riemenschneider
This phenomenon has a visible effect on text generation: In BLOOM-560m, activating 'earthquake' neurons derived from Spanish data at checkpoint 10,000 generates Spanish text. At checkpoint 400,000, the same method yields English text!
Frederick Riemenschneider tweet media
English
1
0
3
112
Frederick Riemenschneider
Our work brings new computational methods to a field traditionally dominated by manual scholarship, potentially accelerating the discovery of textual errors that have remained hidden for centuries.
English
1
0
2
48
Frederick Riemenschneider
What did Aristotle actually write? We think we know, but reality is messy. As Ancient Greek texts traveled through history, they were copied and recopied countless times, accumulating subtle errors with each generation. Our new #NAACL2025 findings paper tackles this challenge.
English
1
2
7
550