
Miao LI
120 posts

Miao LI
@oaimli
Computers learn to reason. He/him 🤗



New blog: Do LLM benchmarks ignore NLG? I was very disappointed to realise that the evaluation suite for Amazon Nova has poor coverage of NLG tasks. Which is surprising since LLMs are largely used to generate texts ehudreiter.com/2024/12/26/do-…

feeling a bit under the weather this week … thus an increased level of activity on social media and blog: kyunghyuncho.me/i-sensed-anxie…




Intra-Document Causal Masking is one of the magic tricks behind LLaMA 3 and 3.1! It was proposed initially in @yuzhaouoe's ACL 2024 Oral "Analysing The Impact of Sequence Composition on Language Model Pre-Training" (arxiv.org/abs/2402.13991), and it makes a massive difference both in terms of pre-training dynamics and downstream accuracy on a wide array of downstream tasks 🚀🚀🚀








Represent!!! 🚀🚀🚀🚀🚀








