Miao LI

120 posts

Miao LI banner
Miao LI

Miao LI

@oaimli

Computers learn to reason. He/him 🤗

Melbourne, Australia Katılım Haziran 2019
776 Takip Edilen130 Takipçiler
Miao LI retweetledi
Yann LeCun
Yann LeCun@ylecun·
Major difference in my mind: - an engineer, given a problem, invents and tries multiple solutions and stops when the solution is good enough. The goal is product innovation and shipping. - a scientist asks new questions, proposes various new solutions, compares them (sometimes with old ones), and writes about it. The methodology must be sound or else peers will sneer. The goal is scientific breakthroughs and technological progress. Both can be called "researchers". Many people can do both: these are activities, not identities. Importantly, most product innovations are built on scientific breakthroughs and technological innovations that happened 2, 5, 10, or 20 years earlier.
English
92
252
3K
197.7K
Miao LI
Miao LI@oaimli·
A seemingly better solution for peer-review: Stage-1: reviewers write comments, no scores; Stage-2: authors provide rebuttal/clarification for multi-round discussions; Stage-3: reviewers give scores with supporting evidence based on the discussions at least responses.
English
0
0
2
143
Miao LI retweetledi
Richard Socher
Richard Socher@RichardSocher·
I've talked about this in various panels, keynotes and forums: foundational model providers are likely going to be similar to large telecom providers. They provide crucial infrastructure, they're very expensive to build and maintain, they create a lot of value in the ecosystem but they might not capture that value long term. You can't build an Uber or Google maps or tiktok without good pervasive internet. But telcos don't get the majority of that value. Foundational model providers are likely similar and that's why we haven't invested in such companies at AIX Ventures. It's also why I'm very bullish on the future of ydc and the big partnerships we're working on with larger enterprises and publishers. We're helping them with accurate answers, agents and AGI over their own and public data. Just like the coal of Jevons paradox (im glad to see others have picked that up also) and internet bandwidth before it, more efficient intelligence will lead to us using it in more and unexpected places.
English
6
13
56
9.4K
Miao LI retweetledi
Marzena Karpinska
Marzena Karpinska@mar_kar_·
Can #LLMs truly reason over loooong context? 🤔 NoCha asks LLMs to verify claims about *NEW* fictional books 🪄 📚 ⛔ LLMs that solve needle-in-the-haystack (~100%) struggle on NoCha! ⛔ None of 11 tested LLMs reach human performance → 97%. The best, #GPT-4o, gets only 55.8%.
Marzena Karpinska tweet media
English
30
91
461
121.7K
Miao LI
Miao LI@oaimli·
This is one reason why I still keep working on language generation while people saying it is ‘solved’ by LLMs or not interesting anymore.
English
0
0
2
131
Miao LI retweetledi
Hamel Husain
Hamel Husain@HamelHusain·
My rule in meetings: “you are not allowed to say the word agents. Talk about the problem you are trying to solve”
English
14
10
303
15.3K
Miao LI retweetledi
Denny Zhou
Denny Zhou@denny_zhou·
Tree search, the key idea in classical AI, has little to do with true intelligence or reasoning, no matter which fun puzzle / games are well solved by search eg game 24. Search is just a tool usage. Surprised to see so many regard search as reasoning to pursue.
English
39
35
474
96.6K
Miao LI retweetledi
Ahmad Beirami
Ahmad Beirami@abeirami·
In today's publication culture, most authors are after being SOTA, showing tables with 𝐛𝐨𝐥𝐝 numbers, and writing the minimum viable paper! The goal of a scientific paper should be to push the field forward with new intuition/insights on how to think about solving a problem.
English
5
25
251
23K
Miao LI retweetledi
Yu Zhao
Yu Zhao@yuzhaouoe·
OLMo supports Intra-Document Causal Masking now 🤗
Yu Zhao tweet mediaYu Zhao tweet media
Pasquale Minervini@PMinervini

Intra-Document Causal Masking is one of the magic tricks behind LLaMA 3 and 3.1! It was proposed initially in @yuzhaouoe's ACL 2024 Oral "Analysing The Impact of Sequence Composition on Language Model Pre-Training" (arxiv.org/abs/2402.13991), and it makes a massive difference both in terms of pre-training dynamics and downstream accuracy on a wide array of downstream tasks 🚀🚀🚀

English
0
6
20
3.8K
Miao LI retweetledi
MIT CSAIL
MIT CSAIL@MIT_CSAIL·
How do black-box neural networks transform raw data into predictions? Inside these models are thousands of simple "components" working together. New MIT CSAIL research (bit.ly/473lcfE) introduces a method that helps us understand how these components compose to affect model behavior — a key step in making neural networks more interpretable. 🧵
English
4
59
248
23.3K
Miao LI retweetledi
Yu Zhao
Yu Zhao@yuzhaouoe·
I will present our poster "Analysing The Impact of Sequence Composition on Language Model Pre-Training" at 10:30am🚀🚀🚀 (Oral presentation had presented on Monday) Welcome to have a chat if you are interested in language model pre-training🤗 #ACL2024 #ACL2024NLP
Yu Zhao tweet mediaYu Zhao tweet media
English
0
12
39
3.7K
Miao LI
Miao LI@oaimli·
- A Sentiment Consolidation Framework for Meta-Review Generation, arxiv.org/abs/2402.18005 - NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism, arxiv.org/abs/2403.00862
English
0
0
1
84
Miao LI
Miao LI@oaimli·
Excited to be giving in-person presentations of two papers at ACL'24 on Tuesday next week in Bangkok. Look forward to meeting old friends and making new ones!
Miao LI tweet mediaMiao LI tweet media
English
1
3
12
1.2K