Roy Mayan

4 posts

Roy Mayan

Roy Mayan

@Roym4498

NLP and Computational Linguistics 💬

Katılım Mart 2024
37 Takip Edilen8 Takipçiler
Roy Mayan retweetledi
Yoav Gur Arieh
Yoav Gur Arieh@GurYoav·
🧠 To reason over text and track entities, we find that language models use three types of 'pointers'! They were thought to rely only on a positional one—but when many entities appear, that system breaks down. Our new paper shows what these pointers are and how they interact 👇
GIF
English
2
15
75
16.5K
Roy Mayan retweetledi
Yoav Gur Arieh
Yoav Gur Arieh@GurYoav·
Can we precisely erase conceptual knowledge from LLM parameters? Most methods are shallow, coarse, or overreach, adversely affecting related or general knowledge. We introduce🪝𝐏𝐈𝐒𝐂𝐄𝐒 — a general framework for Precise In-parameter Concept EraSure. 🧵 1/
Yoav Gur Arieh tweet media
English
2
9
71
8.1K
Roy Mayan retweetledi
Mor Geva
Mor Geva@megamor2·
How can we interpret LLM features at scale? 🤔 Current pipelines use activating inputs, which is costly and ignores how features causally affect model outputs! We propose efficient output-centric methods that better predict how steering a feature will affect model outputs. New preprint led by my student @GurYoav with dream team @Roym4498, Chen Agassy, and Atticus Geiger 🧵1/
GIF
English
6
25
114
7.4K