Chris Roth
3.5K posts

Chris Roth
@rothific
AI @ Mozilla. Focused on on-device AI, local-first data, agent architecture, and AI safety.

🚀 Applications are now open: Constellation's Astra Fellowship 🚀 Fully funded, 5-month fellowship at our Berkeley research institute. Pair with mentors across empirical AI safety research, strategy, and governance at @ConstellOrg! 📅 Apply by May 3rd (begins Sep 2026) 🔗 constellation.org/programs/astra…









@ylecun @nxthompson To be fair the original Anth post was just some cool mech interp expts. This particular tweet sensationalized it a bit much haha. PS: wonder what your thoughts are on the field of mech interp?




SAEs fail at OOD tasks. Why? Features in superposition are linearly representable but not linearly accessible. Instead of discarding sparse coding, we embrace the geometry of superposition and use methods equipped to handle the nonlinearity it induces.


GitHub just living the dream right now





LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below





