☔️
11.1K posts


米津玄師 - IRIS OUT (covered by なとり) #なとり #natori #나토리


Diffusion LLMs (DLLM) can do “any-order” generation, in principle, more flexible than left-to-right (L2R) LLM. Our main finding is uncomfortable: ➡️ In real language, this flexibility backfires: DLLMs become worse probabilistic models than the L2R / R2L AR LMs. This thread is about why “any order” turns into a curse. (Work with Xinyu Yang @Xinyu2ML , Min Lin @mavenlin , Chao Du @duchao0726 and the team.) Blog Link: #2af0ba07baa880c29fc4c8c198244cc8" target="_blank" rel="nofollow noopener">notion.so/Understanding-…

Ilya Sutskever: We are no longer in the age of scaling, we are back to the age of research

🚨BREAKING :A video documents a horrific Israeli crime: targeting ambulance and civil defense crews as they were rescuing victims and the wounded after Israel bombed Nasser Medical Complex in Khan Younis among them was journalist Hossam al-Masri.


“Israel has killed a classroom full of children every single day.” UNRWA Sam Rose tells BBC Radio 4. Children in #Gaza have been killed while sleeping, sheltering in schools, or queuing for water. Graphic by @TRTWorld
