Yang Liu (@nlpyang) - Profil Twitter | Zamantika Mersobahis Locabet

Tweet Disematkan

Yang Liu@nlpyang·3 Mar

Missing coding data in your R1? 🔥 Introducing KodCode—the largest verified synthetic coding dataset for Code LLM training! • 447K question–solution–test triplets • 12 diverse subsets • 10-trial solution verification for rock-solid correctness kodcode-ai.github.io

English

3

18

1.6K

Yang Liu me-retweet

Liliang Ren@liliang_ren·31 Mar

Introducing HyperP, a scaling framework gives you better compute efficiency & transferable stability. At 6e21 FLOPs, HyperP reaches 1.58× compute efficiency over a strong Muon baseline; +MoE further gets 3.38× over dense. Gains even grow with scale🤯 📖: arxiv.org/abs/2603.28743

English

7

41

271

30.3K

Yang Liu me-retweet

Zhangchen Xu@zhangchen_xu·6 Mar

Missing coding data in your R1? Introducing KodCode 🐱: a diverse, challenging, and verifiable synthetic dataset for LLM coding! With 447K verified question-solution-test triplets, KodCode is designed for supervised fine-tuning (SFT) and reinforcement learning (RL). 💡Key Features ✨Diverse & Challenging: 5 synthesis methods, 12 subsets covering multiple domains (algorithms to package-specific knowledge) and difficulty levels (basic exercises to competitive programming tasks). ✨Verifiable Correctness: Question-solution-test triplets are systematically validated via a self-verification process with GPT-4o. ✨ Supports RL & SFT: Unit tests enable RL tuning, plus verified CoT responses generated by DeepSeek-R1 🐳 via reject sampling to support SFT. --------------- >> Project Page: kodcode-ai.github.io >> KodCode-V1 (for RL): huggingface.co/datasets/KodCo… >> KodCode-V1-SFT-R1 (for SFT): huggingface.co/datasets/KodCo… >> Paper: arxiv.org/abs/2503.02951 >> Codebase for creating this dataset: github.com/KodCode-AI/kod… Thanks to my great mentor @nlpyang for invaluable guidance and support on this project! 🤩 [1/5]

English

1

3

18

8.7K

Yang Liu@nlpyang·3 Mar

Great work from @zhangchen_xu

English

0

1

262

Yang Liu@nlpyang·3 Mar

Missing coding data in your R1? 🔥 Introducing KodCode—the largest verified synthetic coding dataset for Code LLM training! • 447K question–solution–test triplets • 12 diverse subsets • 10-trial solution verification for rock-solid correctness kodcode-ai.github.io

English

3

18

1.6K

Yang Liu@nlpyang·27 Oca

I don't really understand why people think RL takes less compute than pretrain.

English

2

0

5

622

Yang Liu@nlpyang·22 Ağu

Check our paper so you can really challenge an LLM

Yulong Chen@Yulongchen1010

Evaluating LLMs usually requires sophisticated human designs and with the continuous improvement of LLMs, it is difficult for humans to find their limitations. Can LLMs find their own limitations by proposing questions to themselves? Check our new paper: arxiv.org/abs/2408.08978

English

0

8

1.1K

Yang Liu@nlpyang·15 Haz

@kdcreer Ah, thank you for saving this post

English

0

1.3K

Kraus Crius@kdcrius·15 Haz

@nlpyang Your DMs are closed BTW.

English

1

0

1.4K

Yang Liu@nlpyang·14 Haz

Microsoft GenAI is looking for a summer intern to work on Sparse LLMs, if you are interested, please DM me or send a resume to yaliu10 at microsoft dot com

English

6

39

229

74.9K

Yang Liu@nlpyang·14 Haz

If you want to know our recent research efforts, check the great Samba model twitter.com/liliang_ren/st…

Liliang Ren@liliang_ren

Introducing Samba 3.8B, a simple Mamba+Sliding Window Attention architecture that outperforms Phi3-mini on major benchmarks (e.g., MMLU, GSM8K and HumanEval) by a large margin.😮 And it has an infinite context length with linear complexity.🤯 Paper: arxiv.org/abs/2406.07522 (1/6)

English

0

7

6.2K

Yang Liu@nlpyang·13 Haz

I truly believe now it is the time that you change all your local attentions to SSMs :)

Liliang Ren@liliang_ren

Introducing Samba 3.8B, a simple Mamba+Sliding Window Attention architecture that outperforms Phi3-mini on major benchmarks (e.g., MMLU, GSM8K and HumanEval) by a large margin.😮 And it has an infinite context length with linear complexity.🤯 Paper: arxiv.org/abs/2406.07522 (1/6)

English

1

0

12

2.3K

Yang Liu@nlpyang·14 May

not multilingual enough :)

Atty Eleti@athyuttamre

GPT-4o is also natively multilingual. We've improved our tokenizer for various languages, resulting in, for example, a 3-4x speed improvement for Indian languages.

English

0

2

1.3K

Yang Liu@nlpyang·15 Nis

@BorisMPower So proud we did this❤️

English

0

4

1.3K

Boris Power@BorisMPower·15 Nis

A GPT-4 optimized for Japanese + an office in Tokyo! ❤️🇯🇵 openai.com/blog/introduci…

English

8

9

124

14.2K

Yang Liu@nlpyang·13 Ara

If you are at NeurIPS, you could check out our poster about Efficient Transformer this evening.

English

0

20

2K

Yang Liu@nlpyang·6 Ara

spent some time looking for Gemini's "uncertainty routed" prompt. but still no clue

Boris Power@BorisMPower

The top line number for MMLU is a bit gamed - Gemini is actually worse than GPT-4 when compared on normal few shot or chain of thought

English

0

909

Yang Liu@nlpyang·6 Ara

Wow, google really cares a lot about MMLU 🧐

English

0

3

647

Yang Liu@nlpyang·6 Ara

I have to admit, writing ICLR meta reviews is much more fun than writing ARR ones. What happened here?

English

0

9

1.2K

Yang Liu@nlpyang·2 Ara

After 1 year birthday of ChatGPT, you think it is a good thing or a bad thing for NLP research?

English

0

641

Yang Liu@nlpyang·2 Ara

Check out this amazing work by Zineng! 🚀 We have a model that can interact with you to generate all modalities.

Zineng Tang@ZinengTang

🔥Excited to introduce CoDi-2! It follows complex multimodal-interleaved in-context instructions to generate any modalities (text, vision, audio) in zero/few-shot interactive way! codi-2.github.io huggingface.co/papers/2311.18… @yzy_ai @nlpyang @ChenguangZhu2 @mohitban47 🧵👇

English

0

4

9

3.1K

Yang Liu@nlpyang·21 Kas

Contact with AIG is only a symbol or a switch. Regardless of the content of the encounter, the results would be the same. The impact would be magnified by the lens of human mass psychology and culture until it resulted in substantive influences on the progress of civilization.

English

0

1

2

1.1K

Yang Liu@nlpyang·21 Kas

This madness should stop. And I hope that place is the same as it was last week.

English

0

1

452

Yang Liu

Jelajahi