Frederick Riemenschneider

43 posts

Frederick Riemenschneider

@bowpis

Heidelberg, Germany Katılım Eylül 2022

193 Takip Edilen68 Takipçiler

Sabitlenmiş Tweet

Frederick Riemenschneider@bowpis·6 Haz

How and when do multilingual LMs achieve cross-lingual generalization during pre-training? And why do later, supposedly more advanced checkpoints, lose some language identification abilities in the process? Our #ACL2025 paper investigates.

English

2.1K

Frederick Riemenschneider retweetledi

AI Coffee Break with Letitia@AICoffeeBreak·14 Eyl

The world’s largest NLP conference with almost 2,000 papers presented, ACL 2025 just took place in Vienna! 🎓✨Here is a quick snapshot of the event via a short interview with one of the authors whose work caught my attention. 🎥 Watch: youtu.be/GBISWggsQOA #acl2025NLP #acl2025

YouTube

AI Coffee Break with Letitia tweet media

English

Frederick Riemenschneider@bowpis·14 Eyl

ACL paper: aclanthology.org/2023.acl-long.… Models: #language-models" target="_blank" rel="nofollow noopener">github.com/Heidelberg-NLP… Read more: cl.uni-heidelberg.de/nlpgroup/news/… Morphological Analysis Demo: huggingface.co/spaces/bowphs/… Machine Translation Demo: huggingface.co/spaces/bowphs/ Best Thesis Award: gscl.org/en/activities/…

English

110

Frederick Riemenschneider@bowpis·14 Eyl

I am honored to receive the 2025 #GSCL Best Thesis Award at #KONVENS in Hildesheim for my Master’s thesis, which investigates multilinguality and develops language models for Ancient Greek and Latin. Thank you to my mentors and collaborators. I look forward to what comes next.

English

471

Frederick Riemenschneider@bowpis·28 Tem

Looking at Bruegel's Tower of Babel in Vienna makes you wonder: How can multilingual language models overcome the language barriers? Find out tomorrow! 📍 Level 1 (ironic, right?), Room 1.15-1 🕐 2 PM #ACL2025NLP

Frederick Riemenschneider@bowpis

English

1.6K

Frederick Riemenschneider@bowpis·6 Haz

English

2.1K

Frederick Riemenschneider@bowpis·7 Haz

@crestonbrooks As for loss patterns: Unfortunately, detailed loss analyses are challenging with BLOOM since only 6 checkpoints were published. I'm currently investigating potential grokking phenomena in more controlled toy settings where we can track loss curves more comprehensively.

English

Frederick Riemenschneider@bowpis·7 Haz

@crestonbrooks Thanks for the interesting questions! Regarding concept curriculum: We examined relatively "general" concepts and couldn't identify a clear pattern in which concepts get "translated" first. However, we did observe that different languages follow distinct patterns.

English

Frederick Riemenschneider@bowpis·6 Haz

Read the full paper here: arxiv.org/pdf/2506.01629 Reach out if you have any questions or if you are attending ACL and want to say hi. 🙋

English

Frederick Riemenschneider@bowpis·6 Haz

This phenomenon has a visible effect on text generation: In BLOOM-560m, activating 'earthquake' neurons derived from Spanish data at checkpoint 10,000 generates Spanish text. At checkpoint 400,000, the same method yields English text!

English

112

Frederick Riemenschneider@bowpis·1 May

Read the full paper: aclanthology.org/2025.findings-… Work by @crestonbrooks, Johannes Haubold, Charlie Cowen-Breen, Jay White, Desmond DeVaul, me, Karthik Narasimhan, and Barbara Graziosi

English

Frederick Riemenschneider@bowpis·1 May

Our work brings new computational methods to a field traditionally dominated by manual scholarship, potentially accelerating the discovery of textual errors that have remained hidden for centuries.

English

Frederick Riemenschneider@bowpis·1 May

What did Aristotle actually write? We think we know, but reality is messy. As Ancient Greek texts traveled through history, they were copied and recopied countless times, accumulating subtle errors with each generation. Our new #NAACL2025 findings paper tackles this challenge.

English

550

Keşfet

@crestonbrooks @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine