
Leyna Music
101 posts

Leyna Music
@LeynaMusicx
AI Safety Researcher, Software Engineer


starting a week of open research questions on LLM hidden representations: one per day, things I think deserve more attention day 1/7 LLMs convert tokens into high-dimensional vectors and transform them layer by layer, but how exactly is input information distributed across those representations at each layer? are there hidden states that carry so little information they could simply be ignored? the sharper version: suppose at some layer only a subset of hidden states carry meaningful information, can you decode the entire input from those alone? this matters because it's about understanding how LLMs manipulate information at a fundamental level, and the follow-up is maybe even more interesting: do different LLMs redistribute information similarly? is there something universal about how models compress and route it internally?













is anybodies heart really still in this AI stuff























