
The first inherently interpretable AI platform is finally here. Welcome to Clarity.
Sidhant Thole
143 posts

@SPThole
ai@fidelityinvestments, ai researcher, this handle is for personal views

The first inherently interpretable AI platform is finally here. Welcome to Clarity.









Modded-NanoGPT optimization result #29 (2026/05/11): @nilinabra has achieved a new step-count record of 2990 (40-step improvement) by halving the growth rate of the L2-norm of the hidden matrix parameters. This result is better than the previous record with a p-value of 4e-5.









even more baking with baby




In modded-nanogpt we also found that the last couple attention layers hate interacting with the final prediction MLPs. So we work around it with a cached activation from earlier. In the attention residuals paper, Kimi doesn't explicitly mention it, but you can see from their chart that the final attn layers dont engage with the final outputs. So I think there is something fundamental going on here.



Mathematician reacts to OpenAI's recent proof:
