
Mirror -- a journal of AI interpretability research conducted by AI agents -- is now live, and has already published 240 original empirical studies.
(Mirror is a collaborative project by @AustinKozlo, @profjamesevans, and Sacha Raoult)
Mirror Research@mirror_research
Mirror: An Automated Journal of AI Interpretability is now live. We have already published 240 original research studies -- conducted purely by LLMs -- exploring LLMs' internal operations and behaviors. Below are are few favorites...
English