Roy Mayan

@Roym4498

NLP and Computational Linguistics 💬

Katılım Mart 2024

37 Takip Edilen8 Takipçiler

Roy Mayan retweetledi

Yoav Gur Arieh@GurYoav·8 Eki

🧠 To reason over text and track entities, we find that language models use three types of 'pointers'! They were thought to rely only on a positional one—but when many entities appear, that system breaks down. Our new paper shows what these pointers are and how they interact 👇

GIF

English

16.5K

Roy Mayan retweetledi

Yoav Gur Arieh@GurYoav·29 May

Can we precisely erase conceptual knowledge from LLM parameters? Most methods are shallow, coarse, or overreach, adversely affecting related or general knowledge. We introduce🪝𝐏𝐈𝐒𝐂𝐄𝐒 — a general framework for Precise In-parameter Concept EraSure. 🧵 1/

English

8.1K

Roy Mayan retweetledi

Mor Geva@megamor2·15 Oca

How can we interpret LLM features at scale? 🤔 Current pipelines use activating inputs, which is costly and ignores how features causally affect model outputs! We propose efficient output-centric methods that better predict how steering a feature will affect model outputs. New preprint led by my student @GurYoav with dream team @Roym4498, Chen Agassy, and Atticus Geiger 🧵1/

GIF

English

114

7.4K

Roy Mayan@Roym4498·20 Mar

These bracelets are awesome! Thank you Wiz 😎 @wiz_io #SecDevLove

English

Keşfet

@GurYoav @wiz_io @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA