Jack Lin

41 posts

Jack Lin

@jacklin_64

Katılım Mart 2020

152 Takip Edilen114 Takipçiler

Jack Lin retweetledi

Bryan Catanzaro@ctnzr·24 Mar

Thank you to everyone in the community who is testing and using Nemotron models. It's great to see Nemotron-Cascade-2, Nemotron-3-Super and Nemotron-3-Nano trending on HF. The Nemotron team is working hard to incorporate all your feedback into Nemotron 4. And yes, Nemotron 3 Ultra is still on track for release. huggingface.co/models?pipelin…

English

225

54.8K

Jack Lin retweetledi

AK@_akhaliq·20 Mar

Nvidia just released Nemotron-Cascade 2 on Hugging Face paper: huggingface.co/papers/2603.19… model: huggingface.co/collections/nv…

English

7.3K

Jack Lin retweetledi

DailyPapers@HuggingPapers·20 Mar

NVIDIA just released Nemotron-Cascade 2 on Hugging Face A 30B MoE model with 3B activated parameters that achieves gold medal performance at IMO and IOI 2025.

English

318

28.1K

Jack Lin retweetledi

Wei Ping@_weiping·20 Mar

🚀 Introducing Nemotron-Cascade 2 🚀 Just 3 months after Nemotron-Cascade 1, we’re releasing Nemotron-Cascade 2: an open 30B MoE with 3B active parameters, delivering best-in-class reasoning and strong agentic capabilities. 🥇 Gold Medal-level performance on IMO 2025, IOI 2025, and ICPC World Finals 2025: • Capabilities once thought achievable only by frontier proprietary models (e.g. Gemini Deep Think) or frontier-scale open models (i.e. DeepSeek-V3.2-Speciale-671B-A37B). • Remarkably high intelligence density with 20× fewer parameters. 🏆 Best-in-class across math, code reasoning, alignment, and instruction following: • Outperforms the latest Qwen3.5-35B-A3B (2026-02-24) and even larger Qwen3.5-122B-A10B (2026-03-11). 🧠 Powered by Cascade RL + multi-domain on-policy distillation: • Significantly expand Cascade RL across a much broader range of reasoning and agentic domains than Nemotron-Cascade 1, while distilling from the strongest intermediate teacher models throughout training to recover regressions and sustain gains. 🤗 Model + SFT + RL data: 👉 huggingface.co/collections/nv… 📄 Technical report: 👉 research.nvidia.com/labs/nemotron/…

English

143

897

160.9K

Jack Lin retweetledi

Yangyi Chen@YangyiChen6666·17 Ara

Super proud to introduce my first work at NVIDIA!! Nemotron-Cascade, our RL scaling efforts to build fully open-source general-purpose reasoning models that achieve SoTA performance on math, coding, and SWE. I am extremely honored to join this small but closely-connected team led by the wonderful @_weiping!

English

128

7.2K

Jack Lin@jacklin_64·17 Ara

Check out the first comprehensive study on cascade RL to build general-purpose reasoning models. We also release the training data and the strong 8B 14B General-purpose reasoning models.

Wei Ping@_weiping

🚀 Introducing Nemotron-Cascade! 🚀 We’re thrilled to release Nemotron-Cascade, a family of general-purpose reasoning models trained with cascaded, domain-wise reinforcement learning (Cascade RL), delivering best-in-class performance across a wide range of benchmarks. 💻 Coding powerhouse After RL, our 14B model: • Surpasses DeepSeek-R1-0528 (671B) on LiveCodeBench v5/v6/Pro. • Achieves silver-medal performance at IOI 2025 🥈. • Reaches a 43.1% pass @1 on SWE-Bench Verified, and 53.8% with test-time scaling. 🧠 What is Cascade RL? Instead of mixing heterogeneous prompts across domains, Cascade RL trains sequentially, domain by domain, which reduces engineering complexity, mitigates heterogeneous verification latencies, and enables domain-specific curricula and tailored hyperparameter tuning. ✨ Key insight Using RLHF for alignment as a pre-step dramatically boosts complex reasoning—far beyond preference optimization. Subsequent domain-wise RLVR stages rarely hurt the benchmark performance attained in earlier domains and may even improve it, as illustrated in the following figure. 🤗 Models & training data 🔥 👉 huggingface.co/collections/nv… 📄 Technical report with detailed training and data recipes 👉 arxiv.org/pdf/2512.13607

English

469

Jack Lin retweetledi

Jimmy Lin@lintool·14 Haz

@yupp_ai @UWaterloo Today marks the beginning of this journey for me, and I’m happy to share more details in the coming months! Until then, I hope you’ll try out yupp.ai and share your feedback. (9/9)

English

3.1K

Jack Lin retweetledi

Xueguang Ma@xueguang_ma·27 Şub

Introducing DRAMA🎭: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers. We propose to train a smaller dense retriever using a pruned LLM as the backbone, fine-tuned with diverse LLM data augmentations. With single-stage training, DRAMA achieves strong performance on both English and multilingual retrieval tasks—enabling smaller retrievers to benefit from ongoing LLM advancements.

English

11.4K

Jack Lin retweetledi

Xueguang Ma@xueguang_ma·30 Oca

In this work led by @ShengyaoZhuang , we explore various settings to attack recent document screenshot retrievers like DSE and ColPali. 🚨What you see might not be what you searched for.

Shengyao Zhuang@ShengyaoZhuang

Our new paper, which studies the vulnerability of document screenshot retrievers like DSE and ColPali to pixel poisoning attacks, is now available on Arxiv! arxiv.org/pdf/2501.16902 This work was done with @EkaterinaKhr, @xueguang_ma, @bevan_koopman, @lintool, @guidozuc.

English

619

Jack Lin retweetledi

Victoria X Lin@VictoriaLinML·12 Ara

#NeurIPS2024 I will present "Nearest Neighbor Speculative Decoding for LLM Generation and Attribution" led by @alexlimh23 at the poster session today. ⏰ Thu Dec 12 at 4:30-7:30 PM PST 🏛️ East Exhibit Hall A-C, #2201 🔗 neurips.cc/virtual/2024/p… Please drop by if you would like to chat about semi-parametric language modeling, beyond token-level decoding and generation attribution!

Minghan@alexlimh23

1/ Excited to share that our paper "NEST🪺: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution" is accepted at #NeurIPS2024! 🚀 Catch us at the poster session on Thu, Dec 12, 4:30–7:30 PM PST, East Exhibit Hall A-C, #2201. [Details: neurips.cc/virtual/2024/p…]

English

8.2K

Jack Lin@jacklin_64·11 Ara

I will present our paper FLAME on factuality alignment for LLMs with @luyu_gao at #NeurIPS2024! 🎉 Join us at East Exhibit Hall A-C, Booth #3501 for a chat on Wed (Dec 11, 4:30--7:30 pm). Looking forward to connecting! More detail: neurips.cc/virtual/2024/p…

Xilun Chen@ccsasuke

Introducing FLAME🔥: Factuality-Aware Alignment for LLMs We found that the standard alignment process **encourages** hallucination. We hence propose factuality-aware alignment while maintaining the LLM's general instruction-following capability. arxiv.org/abs/2405.01525

English

2.7K

Jack Lin retweetledi

Jimmy Lin@lintool·2 Ara

Congratulations to Dr. @jacklin_64 for successfully defending his Ph.D. thesis "Building a Robust Retrieval System with Dense Retrieval Models"! 🎉

English

119

10.3K

Jack Lin retweetledi

Nan Wang@nanwang_t·8 Kas

Crucial work in the field of multimodal embeddings! It’s impressive that multimodal embeddings are reaching SOTA-level performance comparable to text-only embeddings in the retrieval tasks.

Jack Lin@jacklin_64

Introducing MM-Embed, the first multimodal retriever achieving SOTA results on the multimodal M-BEIR benchmark and compelling results (among top-5 retrievers) on the text-only MTEB retrieval benchmark. Paper: arxiv.org/abs/2411.02571 🤗 Model: huggingface.co/nvidia/MM-Embed

English

640

Jack Lin@jacklin_64·6 Kas

This project was done while interning at NVIDIA this summer. Big thanks to all the amazing co-authors, @chankyul77 @MohammadShoeybi @lintool @ctnzr and @_weiping

English

279

Jack Lin@jacklin_64·6 Kas

Finally, for challenging multimodal queries, a free performance boost is possible: prompt multimodal LLMs as zero-shot rerankers.

English

316

Jack Lin@jacklin_64·6 Kas

English

8.6K

Jack Lin@jacklin_64·11 Eki

The sky last night was insane! Thanks to Waterloo for this epic aurora show.

English

244

Jack Lin retweetledi

Raphael Tang@ralph_tang·21 Eyl

Our paper on understanding variability in text-to-image models was accepted at #EMNLP2024 main track! Lots of thanks to my collaborators @crystina_z @yaolu_nlp @Wenyan62 @Ulienida and mentors @lintool Pontus @ferhanture. Check out w1kp.com

English

2.8K

Keşfet

@_weiping @yupp_ai @UWaterloo @ShengyaoZhuang @alexlimh23 @luyu_gao @chankyul77 @MohammadShoeybi