Sam Blouir

26 posts

Sam Blouir

@SamBlouir_NLP

Intern @ Amazon AGI Foundations Thanks for coming to our AAAI 2025 Foundation Models for Biology workshop!

Fairfax, VA Katılım Mayıs 2022

180 Takip Edilen99 Takipçiler

Sam Blouir retweetledi

Defne Circi@DCirci·26 Oca

If you are working on AI for Materials, consider submitting to the AI4MAT Workshop @iclr_conf! 📍 April 26–27 2026, Rio 🇧🇷 📝 Deadline: Feb 1 🌐 sites.google.com/view/ai4mat/ Many thanks to the organizing team ✨ @MiretSantiago @anoopnm007 @SteMartiniani @MoosaviSMohamad Emily Jin

English

7.4K

Sam Blouir retweetledi

Delta Institute @ ICLR@DeltaInstitutes·18 Ara

We hosted our first NeurIPS reading group with @Devvrit_Khatri, including lunch sponsored by @AGI_Inc! Huge thanks to Devvrit for presenting his breakthrough paper on RL Scaling Laws from his time at Meta Superintelligence Labs!

English

472

Sam Blouir retweetledi

Delta Institute @ ICLR@DeltaInstitutes·18 Ara

English

1.2K

Sam Blouir retweetledi

Defne Circi@DCirci·6 Eyl

Excited to share that we will be hosting a Duke-site for the 3rd Annual LLM Hackathon for Applications in Materials & Chemistry from September 11–12, 2025. Registration: RSVP for the Duke Hub: lnkd.in/gRPPPDSW Register for the Global Event: lnkd.in/eXD8cmhS

English

Sam Blouir retweetledi

Jimmy Smith@jimmysmith1919·12 Ağu

Check out our newly released 450M and 1.6B VLMs for efficient edge vision-language applications! Huggingface: huggingface.co/LiquidAI/LFM2-… huggingface.co/LiquidAI/LFM2-… Blog: liquid.ai/blog/lfm2-vl-e…

Liquid AI@liquidai

Introducing LFM2-VL — our new generation of efficient vision-language models for real-world deployment, from smartphones and laptops to wearables and embedded systems. 🧵

English

2.1K

Sam Blouir@SamBlouir_NLP·16 Tem

@simran_s_arora @_albertgu

QAM

Sam Blouir@SamBlouir_NLP·16 Tem

Grateful for every co-author behind these papers 👇 • Agents for Genomics - @SamBlouir_NLP, Flavia Negrete, Amarda Shehu • Birdie - @SamBlouir_NLP, @jimmysmith1919 , @anas_ant, Amarda Shehu • BirdieDNA -@SamBlouir_NLP, @DCirci, @MoldwinAsher, Amarda Shehu

English

Sam Blouir@SamBlouir_NLP·16 Tem

#ICML2025: Agents bring NLP advances to genomics - come see "Agents for Genomics” (ACM HCI '25 STAIG) Wed, July 16, 11:30-12 PT at Amazon’s booth. See the evolution from Birdie (EMNLP ’24) to BirdieDNA (ICLR ’25 MLGenX) and AMR (antimicrobial resistance, upcoming).

English

989

Sam Blouir retweetledi

Raj Dabre@prajdabre·26 Haz

🚨🚨🚨New paper alert🚨🚨🚨 Cycle distill can help you go from a pretrained models requiring few shots and unsupervised data to high quality fine tuned models which can work in zero shot. The idea is simple: Stage 0: Prompt a LLM with a few handcrafted shots and generate synthetic fine tuning data. Stage 1 to K: Fine tune on the synthetic data and generate next iteration of synthetic data. We see gains everywhere even after 2 cycles on a small dataset. Much future work to be done. Paper: arxiv.org/abs/2506.19952

English

106

17.1K

Sam Blouir retweetledi

Antonis Anastasopoulos@anas_ant·9 May

📢 I am looking for a postdoc for the next academic year! (Due to the funding source, US persons preferred) Interested in multimodal LLMs and their application to education domains (plus multilinguality, cross-lingual, and low-resource learning)? Contact me here/email if yes!

English

3.9K

Sam Blouir@SamBlouir_NLP·2 May

#ICLR2025 & #ACMHCI are a wrap! 🎉 Couldn't ask for better vibes, conversations, and meeting so many other researchers! Thanks for checking out: 🧬 BirdieDNA 🗣️ SLP Sidekick, and 🤖 Agents for Genomics Big thanks to my co-authors @DCirci , Flavia Negrete, Celeste Watkins, Asher Moldwin & Amarda Shehu, absolute rockstars 🤝🙏

English

135

Sam Blouir@SamBlouir_NLP·25 Nis

🚀Thrilled to present 3 papers this weekend! BirdieDNA (MLGenX-ICLR 2025) 4/27 4-5:25 PM SGT SLP Sidekick (Staig-ACM HCI) 4/27 10:15-11:15 JST Agents for Genomics (Staig-ACM HCI) 4/27 13:30-14:30 JST Deeply grateful for my co-authors @DCirci, Flavia Negrete, Celeste Watkins, Asher Moldwin & Amarda Shehu! 🙏 #ACM #ICLR2025

English

442

Sam Blouir@SamBlouir_NLP·18 Mar

@RoyXie_ @Apple Congratulations Roy!

English

201

Roy Xie@RoyXie_·18 Mar

Excited to share that I will be joining @Apple Seattle as a Research Intern, working on LLM reasoning/efficiency. Let me know if you would like to connect/meet up!

English

10.4K

Sam Blouir@SamBlouir_NLP·19 Şub

@psidharth567 @prajdabre1 You can use it free with a 2 million context window (only 1 million when thinking is turned on). Even video uploads work. aistudio.google.com/prompts/new_ch…

English

Sidharth Pulipaka@psidharth567·18 Şub

@prajdabre1 Didn't Google figure out long context window almost 1 and a half year ago?? The fact that they are serving 2 million context window at such a low price indicates that DeepSeek isn't even close to what those big US labs have behind the doors. Open source is so behind 😢

English

Raj Dabre@prajdabre·18 Şub

GOATs! They just can't stop making top tier plays! The west is done.

DeepSeek@deepseek_ai

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference! Core components of NSA: • Dynamic hierarchical sparse strategy • Coarse-grained token compression • Fine-grained token selection 💡 With optimized design for modern hardware, NSA speeds up inference while reducing pre-training costs—without compromising performance. It matches or outperforms Full Attention models on general benchmarks, long-context tasks, and instruction-based reasoning. 📖 For more details, check out our paper here: arxiv.org/abs/2502.11089

English

2.3K

Sam Blouir@SamBlouir_NLP·11 Şub

@thomasahle Recognizing unanswerable questions given a context was a primary contribution of SQuAD-V2 in 2018 arxiv.org/abs/1806.03822

English

Thomas Ahle@thomasahle·10 Şub

I find Meta’s original approach to hallucinations delightfully counter intuitive: 1. Extract factoid from training dataset 2. LLM generates (question, answer) pair based on factoid 3. Ask the the question (without fact in context) and judge it against true answer. 4. If answer is wrong, train model to say "I don't know". In a way this is obvious in hindsight, but it goes against ML engineers natural tendency when detecting a wrong answer: Teaching the model the right answer. At least appreciate why this wasn't an obvious thing to do for generation 1 LLMs.

Aran Komatsuzaki@arankomatsuzaki

Meta presents Improving Factuality with Explicit Working Memory Presents EWE, a novel approach that enhances factuality in long-form text generation by integrating a working memory that receives real-time feedback from external resources EWE outperforms strong baselines on four fact-seeking long-form generation datasets, increasing the factuality metric, VeriScore, by 2 to 10 points absolute without sacrificing the helpfulness of the responses. arxiv.org/abs/2412.18069

English

356

39.3K

Sam Blouir@SamBlouir_NLP·11 Kas

Huge thanks to @GMNLP, @StanfordAILab, and all of our collaborators! 🙏

English

168

Sam Blouir@SamBlouir_NLP·11 Kas

General benchmark scores remain intact across 21 tasks on the EleutherAI LM Eval harness, and greatly improve on our new infilling task. 💡 With smarter training, we maintain SSMs’ efficiencies while dramatically enhancing their capabilities.

English

264

Sam Blouir@SamBlouir_NLP·11 Kas

🚀 Introducing Birdie 🐤! Our EMNLP 2024 paper supercharges SSMs like Mamba and Hawk on long-range, context-heavy tasks, closing the gap with Transformers. Come see us at 12:30 - 2:00 PM in Riverfront Hall - Lobby Level #EMNLP2024! Proud to collaborate with @jimmysmith1919, @anas_ant, and Amarda Shehu on this work. 📄 Paper: arxiv.org/abs/2411.01030 💻 Code: github.com/samblouir/bird…

English

9.9K

Keşfet

@iclr_conf @MiretSantiago @anoopnm007 @SteMartiniani @MoosaviSMohamad @Devvrit_Khatri @agi_inc @simran_s_arora