Clément Targe retweetledi

What is the true depth of an LLM?
Together with @DanielePaliotta, @MatPagliardini, M. Jaggi and @francoisfleuret we show that LLMs may have a smaller effective depth, and that it can be exploited to increase inference speeds on multi-GPU settings!
arxiv.org/abs/2502.02790
(1/N)

English
