Yuning Mao

25 posts

Yuning Mao

@yuning_pro

TBD @AIatMeta, 🦙Post-training since Llama 2 https://t.co/OUjOWWA8kL

Moon 가입일 Haziran 2021

199 팔로잉307 팔로워

Yuning Mao 리트윗함

Yiqing Xie@YiqingXieNLP·23 Şub

Training on issue-solving only does NOT guarantee transfer to other tasks. 🎨Introducing Hybrid-Gym - synthetic training tasks for generalization (hybrid-gym.github.io) +25.4% on SWE-Bench / +7.9% on SWT-Bench / +5.1% on Commit-0 with NO issue-solving / test-gen/... training

English

102

15.7K

Yuning Mao 리트윗함

Xianjun Yang@xianjun_agi·22 Eki

I was laid off by Meta today. As a Research Scientist, my work was just cited by the legendary @johnschulman2 and Nicholas Carlini yesterday. I’m actively looking for new opportunities — please reach out if you have any openings!

Susan Zhang@suchenzang

👀

English

269

348

4.5K

1.8M

Yuning Mao 리트윗함

Tim Franzmeyer@frtimlive·5 Haz

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with @AIatMeta 🧵

English

9.5K

Yuning Mao 리트윗함

Yiqing Xie@YiqingXieNLP·13 Mar

How to construct repo-level coding environments in a scalable way? Checkout RepoST: an automated framework to construct repo-level environments using Sandbox Testing (repost-code-gen.github.io) Models trained with RepoST data can generalize well to other datasets (e.g., RepoEval)

English

10.6K

Yuning Mao 리트윗함

Xianjun Yang@xianjun_agi·21 Şub

📢My New Paper: Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder TLDR: We proposed to use features from SAEs as a measure for data diversity&complexity and proved it's effectiveness on data selection for LLM tuning. arxiv.org/pdf/2502.14050

English

180

18.6K

Yuning Mao 리트윗함

Thomas Wolf@Thom_Wolf·23 Tem

Among the most impressive aspect of the Llama 3.1 release is the accompanying research paper! Close to 100 pages of deep knowledge-sharing on LLMs like we havn't seen very often recently What a treat! It covers everything, pretrainining data, filtering, annealing, synthetic data, scaling laws, infrastructures, parallelism, training recipees, post-training adaptation, tool-use, benchmarking, inference strategies, quantization, vision, speech, videos... Mind-blown! Maybe the single paper you can read today to join the field of LLM from zero right to the frontier Read it here and feel the open-science ai.meta.com/research/publi…

English

250

1.1K

76.1K

Yuning Mao@yuning_pro·23 Tem

Go team🦙

AI at Meta@AIatMeta

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context window and improved support for 8 languages among other improvements. Llama 3.1 405B rivals leading closed source models on state-of-the-art capabilities across a range of tasks in general knowledge, steerability, math, tool use and multilingual translation. The models are available to download now directly from Meta or @huggingface. With today’s release the ecosystem is also ready to go with 25+ partners rolling out our latest models — including @awscloud, @nvidia, @databricks, @groqinc, @dell, @azure and @googlecloud ready on day one. More details in the full announcement ➡️ go.fb.me/tpuhb6 Download Llama 3.1 models ➡️ go.fb.me/vq04tr With these releases we’re setting the stage for unprecedented new opportunities and we can’t wait to see the innovation our newest models will unlock across all levels of the AI community.

English

161

Yuning Mao 리트윗함

AI at Meta@AIatMeta·18 Nis

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3 models — in the coming months we expect to introduce new capabilities, longer context windows, additional model sizes and enhanced performance + the Llama 3 research paper for the community to learn from our work. More details ➡️ go.fb.me/i2y41n Download Llama 3 ➡️ go.fb.me/ct2xko

English

340

1.4K

5.7K

1.1M

Yuning Mao 리트윗함

Mikayel Samvelyan@_samvelyan·27 Şub

Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile tool 🛠️ for diagnosing model vulnerabilities across domains and creating data to enhance robustness & safety 🦺 Co-lead w/ @sharathraparthy & @_andreilupu

English

179

56.4K

Yuning Mao 리트윗함

AI at Meta@AIatMeta·7 Ara

Announcing Purple Llama — A new project to help level the playing field for building safe & responsible generative AI experiences. Purple Llama includes permissively licensed tools, evals & models to enable both research & commercial use. More details ➡️ bit.ly/3ReRNHI

English

182

843

351K

Yuning Mao@yuning_pro·16 Kas

Happy to share our new work on advancing LLM safety with auto red-teaming

AI at Meta@AIatMeta

New AI research paper from Meta — MART, or Multi-round Automatic Red-Teaming is a framework for improving LLM safety that trains an adversarial and target LLM through automatic iterative adversarial red-teaming. Details in the paper ➡️ bit.ly/40H1l2z

English

834

Yuning Mao@yuning_pro·9 Ağu

Please remove the default system prompt if you find Llama 2-Chat too safe. Many of the false refusal examples people shared online can be addressed by this simple change

AI at Meta@AIatMeta

Hearing community feedback & following internal research & analysis, we've pushed two new updates to the Llama repo to reduce false refusal rates seen with Llama 2-Chat models & improve token sanitization. Full details ➡️ bit.ly/3QuRNER

English

1.4K

Yuning Mao@yuning_pro·18 Tem

Honored to be a core contributor of Llama 2 release. Give it a try and I hope you'll be pleasantly surprised. Feel free to DM me for any feedback (especially on safety).

AI at Meta@AIatMeta

We believe an open approach is the right one for the development of today's Al models. Today, we’re releasing Llama 2, the next generation of Meta’s open source Large Language Model, available for free for research & commercial use. Details ➡️ bit.ly/3Dh9hNp

English

5.3K

Yuning Mao 리트윗함

Yann LeCun@ylecun·22 May

LIMA : LLaMA 65B + 1000 supervised samples = {GPT4, Bard} level performance. From @MetaAI arxiv.org/abs/2305.11206

English

430

2.8K

628.9K

Yuning Mao 리트윗함

Anastasia Razdaibiedina@razdaibi·1 Şub

🔥Excited to present Progressive Prompts - a method for continual learning in language models: no data replay, trains 0.1% parameters, 20% improvement over previous SOTA on T5. #ICLR2023 🤖👀 arxiv.org/abs/2301.12314 GitHub: github.com/arazd/Progress… Details in🧵1/N

AK@_akhaliq

Progressive Prompts: Continual Learning for Language Models abs: arxiv.org/abs/2301.12314

English

3.1K

Yuning Mao 리트윗함

Davis Liang@LiangDavis·27 Oca

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models Joint work w/ @hila_gonen, @yuning_pro, @AuspiciousRay6, @NamanGoyal21, @gh_marjan, @LukeZettlemoyer, @MadianKhabsa Idea: increase vocabulary size (intelligently), & fix over-tokenization.🧵[1/5]

English

124

29.2K

Yuning Mao@yuning_pro·15 Eki

UniPELT often surpasses the upper bound when taking the best performance of all its submodules used individually on each task, indicating that a mixture of multiple PELT methods may be inherently more effective than single methods. 🧵[4/4]

English

Yuning Mao@yuning_pro·15 Eki

On the GLUE benchmark, UniPELT consistently achieves 1~3pt gains compared to the best individual PELT method that it incorporates and even outperforms fine-tuning under different setups, exhibiting superior model effectiveness and robustness. 🧵[3/4]

English

Yuning Mao@yuning_pro·15 Eki

🚨 Working on parameter-efficient language models? Check out our 🆕 preprint "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning" with @scottyih, @MadianKhabsa et al. at @facebookai, which achieves 1~3% gains on GLUE. arxiv.org/abs/2110.07577 🧵[1/4]

English

탐색

@johnschulman2 @AIatMeta @sharathraparthy @_andreilupu @metaai @hila_gonen @NamanGoyal21 @gh_marjan