Junmo Kang (@JunmoKang) - Twitter Profili | Zamantika Mersobahis Locabet

Junmo Kang retweetledi

Alan Ritter@alan_ritter·1 Ağu

🎉 Excited to see that our paper on cost-efficient data annotation for LLMs won an SAC Highlight Award! 🔗 Check out @mohit_rag18's work here: aclanthology.org/2025.acl-long.…

Alan Ritter@alan_ritter

Check out @mohit_rag18's recent work analyzing data annotation costs associated with SFT vs. Preference Fine-Tuning.

English

1

4

42

3.7K

Junmo Kang retweetledi

Jungsoo Park@jungsoo___park·16 May

Excited to share our paper on uncovering prompting patterns in Frontier LLMs using automated meta-analysis at @aclmeeting #ACL2025

Jungsoo Park@jungsoo___park

🚨 Just Out Can LLMs extract experimental data about themselves from scientific literature to improve understanding of their behavior? We propose a semi-automated approach for large-scale, continuously updatable meta-analysis to uncover intriguing behaviors in frontier LLMs. 🧵

English

1

2

11

402

Junmo Kang retweetledi

Jungsoo Park@jungsoo___park·3 Mar

🚨 Just Out Can LLMs extract experimental data about themselves from scientific literature to improve understanding of their behavior? We propose a semi-automated approach for large-scale, continuously updatable meta-analysis to uncover intriguing behaviors in frontier LLMs. 🧵

English

1

11

41

4.6K

Junmo Kang@JunmoKang·20 Şub

How should a fixed annotation budget be allocated between SFT and Preference-Tuning data? Check out @mohit_rag18’s awesome work exploring this!

Mohit Raghavendra@mohit_r9a

🚨Just out Targeted data curation for SFT and RLHF is a significant cost factor 💰for improving LLM performance during post-training. How should you allocate your data annotation budgets between SFT and Preference Data? We ran 1000+ experiments to find out! 1/7

English

0

3

30

2.9K

Junmo Kang retweetledi

Alan Ritter@alan_ritter·20 Şub

Check out @mohit_rag18's recent work analyzing data annotation costs associated with SFT vs. Preference Fine-Tuning.

Mohit Raghavendra@mohit_r9a

🚨Just out Targeted data curation for SFT and RLHF is a significant cost factor 💰for improving LLM performance during post-training. How should you allocate your data annotation budgets between SFT and Preference Data? We ran 1000+ experiments to find out! 1/7

English

0

1

16

5.1K

Junmo Kang retweetledi

Mohit Raghavendra@mohit_r9a·19 Şub

🚨Just out Targeted data curation for SFT and RLHF is a significant cost factor 💰for improving LLM performance during post-training. How should you allocate your data annotation budgets between SFT and Preference Data? We ran 1000+ experiments to find out! 1/7

English

2

30

141

16.7K

Junmo Kang retweetledi

Ruohao Guo@GuoOctavia·21 Haz

Ever wondered if style lexicons still play a role in the era of LLMs? 🤔 We tested 13 established and 63 novel language styles across different LLMs. 🧠✨ It turns out lexicons are still crucial for style understanding! But how can we better leverage this lexical knowledge? Our approach: meta-tuning LLMs to leverage lexical knowledge for generalizable language style understanding. Check out our latest work at Main of #ACL2024NLP! 🚀 arxiv.org/abs/2305.14592 @mlatgt @ICatGT

English

1

9

29

6.3K

Junmo Kang retweetledi

Jeonghwan Kim@MasterJeongK·19 Haz

Many retrieval-augmented generation (RAG) language models simply consider whether a retrieved document is "relevant" or not. Excited to share our latest research published at #NAACL2024 Findings! 🔍📚 🎉Kudos to @GiwonHong413849 @JunmoKang Paper: arxiv.org/abs/2305.01579

English

4

9

51

5.5K

Junmo Kang@JunmoKang·1 Mar

@stevebach Interesting work! You may also be interested in our work, which aligns with this direction. arxiv.org/abs/2310.00160

English

2

0

10

713

Stephen Bach@stevebach·1 Mar

We just released Bonito 🐟, an open-source model that converts your raw, unannotated data into synthetic instruction tuning datasets. With it, you can easily create a specialized LLM for your proprietary and private data! (1/n) github.com/BatsResearch/b…

English

52

144

627

77K

Junmo Kang retweetledi

Anna Rogers@annargrs·10 Ara

@chrmanning at #EMNLP2023 to #NLProc PhD students, who are having an existential crisis over LLMs: Aeronautics students do not build Boeings for their PhD theses. They do smaller models - and still make meaningful contributions. There's plenty of such opportunities for us too.

English

4

74

376

45.7K

Junmo Kang retweetledi

Yang Chen@ychenNLP·30 Kas

If you are interested in multimodal RAG, check out our new paper! - We propose a unified multimodal retriever trained with instruction-tuning on 8 tasks (🖼️image/📝text query -> target) arxiv.org/abs/2311.17136 #RAG #multimodal

Cong Wei@CongWei1230

🚀 Introduce UniIR, a unified instruction-guided multimodal retriever handles diverse tasks. - 1️⃣model for 8️⃣ retrieval tasks (SoTA w/ Instruction-tuning) - Generalizes to unseen retrieval tasks. - M-BEIR: multimodal retrieval benchmark w/ 10 datasets, 1.1M queries, 5.6M cands.

English

0

29

138

14.4K

Junmo Kang retweetledi

Fan Bai@loadingfan·16 Kas

Struggling to sift through endless tables and lengthy webpages for useful information? 👉Checkout our paper “Schema-Driven Information Extraction from Heterogeneous Tables” to see how LLMs are revolutionizing this process! 🔗arXiv: arxiv.org/abs/2305.14336 @ICatGT @mlatgt

English

1

8

24

3K

Junmo Kang retweetledi

Yao Dou@Yaooo01·16 Kas

Ever wondered “Am I oversharing on social media?”, and worried about privacy risks? In our new paper, we use language models to pinpoint self-disclosures and provide diverse abstractions for users to choose from. All of these are backed and motivated by a real-world user study!

English

1

4

38

29.9K

Junmo Kang@JunmoKang·15 Kas

@jamessealesmith @zsoltkira @Samsung_RA Congratulations, James! 🎉

English

1

0

1

123

James Smith@jamessealesmith·15 Kas

Defended my PhD today!!! Thank you to everyone who supported me along the way, especially my advisor @zsoltkira for putting up with me trying to sneak pictures of my dog into every publication! I'll be joining @Samsung_RA next month as a research scientist! 🥳

English

17

3

153

10.8K

Junmo Kang retweetledi

Carlos E. Perez@IntuitMachine·5 Eki

Self-specialization is crucially important for the ongoing development and progress of large language models (LLMs) for the following key reasons: Expertise in Niche Domains - As LLMs are applied to more specialized domains like biomedicine, law, etc., uncovering domain expertise is critical. Self-specialization provides an efficient way to carve out niche expertise from generalist LLMs. Data Efficiency - Acquiring expert annotations is challenging. Self-specialization only needs a handful of seeds, enabling domain specialization with minimal human involvement. This is far more practical than relying solely on manual data. Parameter Efficiency - Compact specialization modules can be overlaid on top of a shared base LLM, avoiding redundant parameters for each domain. This allows serving multiple expert models efficiently. Adaptability - The self-supervised approach inherently adapts the LLM to new domains by generating tailored data. This is more flexible than pre-defined training objectives. Scalability - By having LLMs self-generate data, self-specialization removes the training data bottleneck. This enables scaling to new domains easily without manual data collection. In summary, self-specialization essentially provides a pathway to extract specialized knowledge in an adaptable, scalable, and extremely efficient manner. This will be a critical capability as we push LLMs into more and more expert domains while needing to maintain versatility and avoid exponentially growing data and parameter needs. Unlocking latent domain expertise will be key, and self-specialization offers a highly promising approach to make this feasible.

English

6

57

322

105.5K

Junmo Kang@JunmoKang·4 Eki

🤗Huggingface page: huggingface.co/papers/2310.00… The code will be released soon!

English

0

5

35

14.3K

Junmo Kang@JunmoKang·3 Eki

Please check out the paper for more details. This work was partly done during my internship @MITIBMLab. Huge thanks to all the collaborators! @lhyTHU, Yada Zhu, James Glass, @neurobongo, @alan_ritter, @RogerioFeris, @leokarlin. 8/8

English

1

0

5

549

Junmo Kang@JunmoKang·3 Eki

🚨Can we self-align LLMs with an expert domain like biomedicine with limited supervision? Introducing Self-Specialization, uncovering expertise latent within LLMs to boost their utility in specialized domains. arxiv.org/abs/2310.00160 @ICatGT @mlatgt @MIT_CSAIL @MITIBMLab 1/8

English

1

24

78

13K

Junmo Kang

Keşfet