XiulinYang

35 posts

XiulinYang banner
XiulinYang

XiulinYang

@xiulin_yang

PhC student in Computational Linguistics@GUCL

Katılım Ocak 2020
201 Takip Edilen63 Takipçiler
XiulinYang
XiulinYang@xiulin_yang·
@_Suresh2 @weGotlieb Thanks for your question! Based on our current results, it’s hard to interpret whether frequency or position matter more, but we do find that disrupting structural predictability makes the LMs drop the most (Table 2 & Figure 7)
English
0
0
0
7
XiulinYang
XiulinYang@xiulin_yang·
🎉 Happy to share that our paper on function words & language learning (w/ Heidi Getz & @weGotlieb) is accepted to #ACL2026! A little late to the party, but still worth celebrating 🥳 We ask: what statistical properties help a learner abstract grammatical knowledge from linear input? Turns out function words, though often overlooked, play an important role. Check out our updated preprint: arxiv.org/pdf/2601.21191 🧵 1/4
XiulinYang tweet media
English
2
13
34
2.7K
XiulinYang
XiulinYang@xiulin_yang·
One more interesting finding is that a 5-gram LM shows a different learning pattern as transformers in terms of function words. This suggests that transformers develop syntactic generalization beyond linear heuristic rules. 5/4
English
0
0
3
84
XiulinYang
XiulinYang@xiulin_yang·
Finally, we probe LM internals via attention analysis and ablation experiments to understand how much models actually *rely* on function words during processing. - When all three properties are intact -> models show the largest performance drop when function words are removed - When structural predictability is disrupted -> models shift away from function words and lean on other cues instead I'll be at ACL this summer and happy to chat with anyone interested!😀 4/4
English
1
1
4
110
XiulinYang retweetledi
Kanishka Misra 🌊
Kanishka Misra 🌊@kanishkamisra·
Announcing a new version of our 2024 paper on linguistic hypothesis generation from LMs! @najoungkim and I have systematized our hypothesis generation framework, added stringent criteria for model selection, 10x-ed our learning trials, and included an epigraph from Jeff Elman!
Kanishka Misra 🌊 tweet media
English
1
10
28
3.2K
XiulinYang retweetledi
Catherine Arnett
Catherine Arnett@linguist_cat·
I’m at #HSP2026 at MIT this week! I’ll be giving a talk Friday at 5:25pm entitled “Structural Priming Effects in Language Models are Less Human-like in Languages Other Than English”. Looking forward to chatting to everyone!
English
0
1
15
643
XiulinYang retweetledi
Cui Ding
Cui Ding@CuiDing_CL·
👀Ever wondered how visual information quality affects reading and language processing? Our new #EMNLP2025 paper, “Modeling Bottom-up Information Quality during Language Processing”, bridges psycholinguistics and multimodal LLMs. 🧠💡👇 arxiv.org/pdf/2509.17047
Cui Ding tweet media
English
1
4
19
1.2K
XiulinYang retweetledi
Ethan Gotlieb Wilcox
Ethan Gotlieb Wilcox@weGotlieb·
I will be recruiting PhD students via Georgetown Linguistics this application cycle! Come join us in the PICoL (pronounced “pickle”) lab. We focus on psycholinguistics and cognitive modeling using LLMs. See the linked flyer for more details: bit.ly/3L3vcyA
Ethan Gotlieb Wilcox tweet media
English
1
7
33
3.5K
XiulinYang retweetledi
Catherine Arnett
Catherine Arnett@linguist_cat·
I’m in Vienna all week for @aclmeeting and I’ll be presenting this paper on Wednesday at 11am (Poster Session 4 in HALL X4 X5)! Reach out if you want to chat about multilingual NLP, tokenizers, and open models!
Catherine Arnett@linguist_cat

✨New pre-print✨ Crosslingual transfer allows models to leverage their representations for one language to improve performance on another language. We characterize the acquisition of shared representations in order to better understand how and when crosslingual transfer happens.

English
7
8
63
7.1K
XiulinYang
XiulinYang@xiulin_yang·
🥳Thrilled to present our #ACL2025 paper with @t_aoyam, @yuekun_yao, and @weGotlieb on language modeling! Can LMs’ learning dynamics distinguish typologically attested from unattested and from truly impossible languages? We test this across 12 languages and find: yes, but…👀
English
2
3
12
1.8K
XiulinYang retweetledi
Albert Gu
Albert Gu@_albertgu·
Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.
Albert Gu tweet media
Sukjun (June) Hwang@sukjun_hwang

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

English
61
192
1.2K
230.5K
XiulinYang retweetledi
Ahnaf Mozib Samin
Ahnaf Mozib Samin@im_samin·
@xiulin_yang Catastrophic forgetting of the language model? It happens when you sequentially finetune with new datasets.
English
1
0
0
45
XiulinYang retweetledi
Malvina Nissim
Malvina Nissim@MalvinaNissim·
📢 *Looking for a PhD student!* @hedderik and I are starting a super exciting project on using LLMs and adaptive methods to improve learning of complex knowledge, and address implicit biases. Want to do a PhD with us? Get in touch (asap)! @GroNlp @univgroningen @JTSchool_UG
English
3
23
46
6.4K
XiulinYang retweetledi
Andrew Akbashev
Andrew Akbashev@Andrew_Akbashev·
This is how I advise my #PhD students to write research manuscripts (in case someone finds it helpful). General points: 1. Research questions addressed by your manuscript are key and should guide you. 2. Don’t view your manuscript as an article. See it as a STORY. 3. Pick the writing style that is easily understood by a broader community. Make reading easy. 4. Most of data should get into the paper. If some doesn’t support the hypothesis, it still must be in the Suppl. Information. It must show the reproducibility limits. 5. Make the paper shorter, not longer. Cut out things that may sound like ‘bluff’ or ‘decoration’ of the story. Use well-defined terminology, don’t invent it unless clearly necessary. 6. Focus on reporting & explaining the numbers. Minimize discussions of qualitative outcomes and your imagination. ▫️ Specific steps: 1️⃣ First, formulate and polish the key questions that your study addresses. It may take hours or even days (even though you've been doing research in this area for years). A single study should address no more than 1-3 key questions. It’s your perfect start for writing. 2️⃣ Write down the structure of your STORY first: Sections and Subsections that will answer those questions. Into each subsection, put 1-2 sentences that formulate the message(s) from this subsection. It will hugely help you navigate the manuscript later and save a lot of time. 3️⃣ Write approximate messages in the conclusion section. Usually, no more than 1-4 sentences. ▫️ At this point, SHARE your [structure+questions+messages] document with your advisor for feedback. Toss it back and forth until you both converge. You can also include major collaborators if needed. ▫️ 4️⃣ Write the introduction part. Put down the paragraphs that introduce a reader into the key question(s) of the manuscript and the background of your story. 5️⃣ Write the main text for each section, smoothly and firmly. Each paragraph should add a separate value and end with a message-like sentence. Follow the “First… Second… Third…” structure for paragraphs when possible, it gives rigor and readability to your story. 6️⃣ Write the conclusions. Add a broader perspective that is justified and not generic. 7️⃣ Write the abstract. It must have simple terminology and clearly explain what readers can find inside the paper. It also should contain the key conclusions. 8️⃣ Write up 4-5 different titles and spend >30 mins with your team discussing which title sounds best. Finally, iterate on the resulting draft within your team. The number of drafts can easily exceed 20. ▫️ ❗In addition, I always emphasize that a high quality of your research paper: - sharpen your writing and analytical skills. - shapes your reputation. - shows who you are as a researcher and communicator. ▫️ p.s. Everyone has a different style of advising and writing. You can adopt only some specific steps if you find them helpful. p.p.s. Another way that we sometimes use is by starting with figures ('story in figures' style). #AcademicTwitter #AcademicChatter
English
62
1.3K
5.7K
859.4K