Yao

13 posts

Yao

Yao

@yaozhaoai

Researcher/engineer working on LLMs and agents. ex Google Deepmind.

California, USA Katılım Aralık 2019
386 Takip Edilen254 Takipçiler
Yao retweetledi
Peter J. Liu
Peter J. Liu@peterjliu·
One of the top machine learning conferences #ICLR2025 is this week. But there’s 3000+ accepted papers, which is a lot to sift through. Use RadPod to chat with them all and quickly hone in on your interests. Examples queries: “find papers with more than one OpenAI-affiliated author” “find papers that propose alternatives to Transformer architecture in LLM” “give an overview of all spotlight or oral papers with Yann Lecun as author” You can even get a link to the OpenReview reviews easily.
English
1
3
8
2.4K
Yao retweetledi
Peter J. Liu
Peter J. Liu@peterjliu·
Recently a huge new batch of files on the JFK Assassination was released by the National Archives as a result of a presidential executive order. A whopping ~80,000 pages of scanned PDFs -- available but not accessible. AI to the rescue! Except none of the AI apps can handle this type and amount of context ... until now. We built RadPod AI to enable highly-accurate, deep research on your (possibly huge) data.
English
3
2
22
3.8K
Yao retweetledi
Aran Komatsuzaki
Aran Komatsuzaki@arankomatsuzaki·
Transformer decoder with MoE and efficient attention has been available at tensor2tensor library since 2017 A paper that trained Transformer decoder with MoE, efficient attention and up to 11k context length was released in September 2017 (arxiv.org/abs/1801.10198).
English
2
40
193
26.4K
Denny Zhou
Denny Zhou@denny_zhou·
If letting you name one ml technique that was considered to be critical in building AGI but now you think it is irrelevant or at least not important, what is on top of your mind?
English
12
0
24
7.7K
Yao
Yao@yaozhaoai·
@GriffinAdams16 @peterjliu Thanks for sharing your paper, super interesting. Same scrutiny on HF data composition is much needed too!
English
0
0
2
80
Griffin Adams
Griffin Adams@GriffinAdams92·
@peterjliu @yaozhaoai Very cool and nice follow up to first SLiC! There’s also unexplored upside in how to construct these offline candidate sets. We show it has a large impact on performance in a new ACL preprint arxiv.org/abs/2305.07615. This line of work can be scaled to more diverse methods.
English
1
2
7
1.8K
Peter J. Liu
Peter J. Liu@peterjliu·
Here is our “slick” RLHF-alternative without RL: arxiv.org/abs/2305.10425 (SLiC-HF) TL;DR: Works as well as RLHF, but a lot simpler. About as easy and efficient as fine-tuning. Much better than simply fine-tuning on good examples. From great collaborators: @yaozhaoai, @rishabh_joshi4, Tianqi Liu, @khalman_m, @Mohamma78108419, @peterjliu.
Peter J. Liu tweet media
Peter J. Liu@peterjliu

The true star of RLHF is F=feedback. You may not need RL and you may not need humans.

English
10
157
799
214K
Yao
Yao@yaozhaoai·
Key of learning from feedback is a different signal than supervised fine-tuning: distinguish better/worse seqs vs generate plausible seqs Evidence: contrastive learning and RL learn from feedback equally well, both much better than fine-tune on only positive feedback.
Peter J. Liu@peterjliu

Here is our “slick” RLHF-alternative without RL: arxiv.org/abs/2305.10425 (SLiC-HF) TL;DR: Works as well as RLHF, but a lot simpler. About as easy and efficient as fine-tuning. Much better than simply fine-tuning on good examples. From great collaborators: @yaozhaoai, @rishabh_joshi4, Tianqi Liu, @khalman_m, @Mohamma78108419, @peterjliu.

English
0
0
8
1K
Peter J. Liu
Peter J. Liu@peterjliu·
Amazing how much progress in AI is due to two chain rules: one from calculus, the other from probability.
English
17
98
1.1K
188.9K
Yao retweetledi
Peter J. Liu
Peter J. Liu@peterjliu·
We are hiring for a full-time researcher/engineer in the Brain (Google Research) team who will focus on text generation research and its applications. A wide variety of backgrounds and experiences will be considered. DM if you're interested or have leads.
English
13
65
333
0
Yao retweetledi
Peter J. Liu
Peter J. Liu@peterjliu·
New SOTA results for abstractive summarization just posted to arxiv.org/abs/1912.08777! We have a new way to pre-train for summarization, and evaluated our PEGASUS model on 12 diverse downstream summarization tasks, achieving SOTA on all, in some cases by a significant margin.
Peter J. Liu tweet media
English
2
8
28
0