Jinfeng Rao

78 posts

Jinfeng Rao

@Jeffy_Sailing

TLM, Researcher at Google, work on LLM & NLP

San Francisco, CA Katılım Eylül 2012

598 Takip Edilen499 Takipçiler

Jinfeng Rao@Jeffy_Sailing·27 Haz

It’s our honor to have Prof. @mohitban47 to visit Pinterest to introduce his pioneering work on multi-modality LLMs!

Pinterest Engineering@PinterestEng

Join us tomorrow for an exclusive Pinterest Labs Talk Series event. Dr. Mohit Bansal will give a distinguished lecture titled "Multimodal Generative LLMs: Unification, Planning Agents, and Evaluation" RSVP here! 📌 …restlabstalkseriesjune.splashthat.com

English

Jinfeng Rao retweetledi

Tao(Thomas) Yu@tao_agi·21 May

🚀 Breaking news - Yi-Large LMSys Elo result has just surged to the top tier, almost on par with GPT-4-0125-preview! With improvement across all boards, especially reasoning & coding capabilities, we're excited to see what app can build on top of Yi-Large. Explore the API on platform.01.ai Huge congrats to my team from 01.AI for this incredible milestone! 🎉

Arena.ai@arena

Exciting leaderboard update🔥 We've added @01AI_Yi Yi-Large to Arena and collected 15K+ votes over the past week. Yi-Large's performance is super impressive, securing the #7 spot, almost on par with GPT-4-0125-preview! Huge congrats to 01.ai on this incredible achievement, as well as new Yi family launch! Additionally, we’ve added Zhipu AI’s GLM-4-0116 to the leaderboard at #15. Chinese LLMs are getting very competitive now!

English

11.1K

Jinfeng Rao@Jeffy_Sailing·9 Şub

@YiTayML 🤣🤣🤣

QME

1.3K

Yi Tay@YiTayML·9 Şub

glad that my perf rating this time sounds very much like a very famous architecture in deep learning.

English

38K

Jinfeng Rao@Jeffy_Sailing·25 May

Better prompting is all you need. 😂

Aran Komatsuzaki@arankomatsuzaki

Large Language Models are Zero-Shot Reasoners Simply adding “Let’s think step by step” before each answer increases the accuracy on MultiArith from 17.7% to 78.7% and GSM8K from 10.4% to 40.7% with GPT-3. arxiv.org/abs/2205.11916

English

Jinfeng Rao retweetledi

AK@_akhaliq·23 Kas

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning abs: arxiv.org/abs/2111.10952 EXT5 outperforms strong T5 baselines on SuperGLUE, GEM, Rainbow, Closed-Book QA tasks, and several tasks outside of EXMIX

English

132

Jinfeng Rao retweetledi

Aran Komatsuzaki@arankomatsuzaki·6 Eki

Efficiently Modeling Long Sequences with Structured State Spaces Achieves dramatic performance improvement over the SotA on Long-range Arena and comparable ppl to vanilla Transformer on Wikitext-103 w/ 60x faster generation. openreview.net/pdf?id=uYLFoz1…

English

108

Jinfeng Rao@Jeffy_Sailing·16 Nis

@lintool @xueguang_ma A relevant paper from FB Search: arxiv.org/pdf/2006.11632…

English

Jimmy Lin@lintool·14 Nis

If you're interested in dense retrieval, you'll want to check out this DPR replication effort led by @xueguang_ma arxiv.org/abs/2104.05740 tl;dr - BM25 is better than the original authors made it out to be, and free QA boost with better evidence fusion!

English

Jinfeng Rao retweetledi

Sebastian Ruder@seb_ruder·19 Oca

ML and NLP Research Highlights of 2020 It's been inspiring to look back on all the exciting advances that happened despite such a tumultuous year. Here's a selection of my highlights. ruder.io/research-highl…

English

305

924

Jinfeng Rao@Jeffy_Sailing·11 Kas

Interested in knowing which efficient transformer to use for the best speed/effectiveness tradeoff? Check out our paper on long-range X-formers.

Yi Tay@YiTayML

As a companion to our recent efficient Transformer survey, we designed "Long Range Arena" a new challenging benchmark to help understand and analyze trade-offs between recent efficient Transformer models. Check out our paper at arxiv.org/abs/2011.04006. @GoogleAI @DeepMind

English

Jinfeng Rao retweetledi

Jimmy Lin@lintool·14 Eki

Happy to share an early draft of "Pretrained Transformers for Text Ranking: BERT and Beyond", our forthcoming book (tentatively, early 2021) by @lintool @rodrigfnogueira @andrewyates arxiv.org/abs/2010.06467

English

262

Jinfeng Rao retweetledi

Andrej Karpathy@karpathy·3 Eki

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale openreview.net/forum?id=YicbF… v cool. Further steps towards deprecating ConvNets with Transformers. Loving the increasing convergence of Vision/NLP and the much more efficient/flexible class of architectures.

English

459

1.9K

Jinfeng Rao retweetledi

Aran Komatsuzaki@arankomatsuzaki·3 Eki

Long Range Arena: A Benchmark for Efficient Transformers Q: Which Transformer variant to use? A: It's a bit complicated: openreview.net/forum?id=qVyeW…

English

201

Jinfeng Rao retweetledi

Paul Katsen@Katsen·21 Tem

=GPT3()... the spreadsheet function to rule them all. Impressed with how well it pattern matches from a few examples. The same function looked up state populations, peoples' twitter usernames and employers, and did some math.

English

138

1.7K

8.7K

Jinfeng Rao@Jeffy_Sailing·10 Nis

@ytay017 Happy birthday!

English

Yi Tay@YiTayML·9 Nis

Turned 30 today but completely forgot until my wife wished me Happy birthday. I guess the first thing I did was to check for ICML reviews when I woke up. 😂

English

Jinfeng Rao@Jeffy_Sailing·3 Eki

Glad to present our recent work on building natural language generation system from @facebookai at UC Berkeley NLP Seminar! Find our slides at: jinfengr.github.io/publications/j…

English

Jinfeng Rao@Jeffy_Sailing·5 Eyl

In non-BERT news, I have an #emnlp19 paper w/ @lintool @likicode @ytay017 @victoryang118 about bridging semantic matching and relevance matching approaches to NLP and IR tasks. Sure, doesn't beat BERT, but I think we win in terms of new insights! Paper: bit.ly/2lzRfkI

English

Jinfeng Rao retweetledi

Bruce Croft@wbc11·22 Tem

Here's the slides from my SIGIR keynote talk ciir.cs.umass.edu/downloads/sigi…

English

Jinfeng Rao retweetledi

ACL2019@ACL2019_Italy·12 Haz

The list of accepted papers is finally out! acl2019.org/EN/program/pap…

English

219

Jinfeng Rao retweetledi

Nick Craswell@nick_craswell·4 Haz

Announcing: Deep Learning Track at TREC-2019. Coordinators: Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos. Large training data! Two datasets! Guidelines: microsoft.github.io/TREC-2019-Deep…

English

Jinfeng Rao retweetledi

Graham Neubig@gneubig·31 May

Cross-lingual transfer is a powerful tool for low-resource NLP. But when you build a system for a new language (say Bengali), what language do you transfer from? Our #ACL2019 paper "Choosing Transfer Languages for Cross-lingual Learning" asks this: arxiv.org/abs/1905.12688 1/7

English

246

Keşfet

@mohitban47 @YiTayML @lintool @xueguang_ma @rodrigfnogueira @andrewyates @facebookai @likicode