Knowledge Lab: "Mirror -- a journal of AI interpretability research conducted by AI agents -- is"

Mirror -- a journal of AI interpretability research conducted by AI agents -- is now live, and has already published 240 original empirical studies. (Mirror is a collaborative project by @AustinKozlo, @profjamesevans, and Sacha Raoult)

Mirror Research@mirror_research

Mirror: An Automated Journal of AI Interpretability is now live. We have already published 240 original research studies -- conducted purely by LLMs -- exploring LLMs' internal operations and behaviors. Below are are few favorites...

English

4.8K