AdapterHub

170 posts

AdapterHub

AdapterHub

@AdapterHub

A central repository for pre-trained adapter modules in transformers! Active maintainers: @clifapt @h_sterz @LeonEnglaender @timo_imhof @PfeiffJo

Katılım Mayıs 2020
1.3K Takip Edilen1.2K Takipçiler
AdapterHub retweetledi
AdapterHub retweetledi
Leon Engländer
Leon Engländer@LeonEnglaender·
LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't. We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do. Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld. 📜 arxiv.org/abs/2604.17609 🧵👇
Leon Engländer tweet media
English
9
23
140
14.7K
AdapterHub retweetledi
Clifton Poth
Clifton Poth@clifapt·
Took Claude up for a spin on the weekend and started a quick open-source self-hosted re-implementation Thinking Machines' Tinker API: github.com/calpt/open-tin…
English
0
3
7
351
AdapterHub
AdapterHub@AdapterHub·
Also new since v1.0: ✅ Added AdapterPlus ✅ Gradient Checkpointing support for memory efficiency ✅ Push & load complex adapter compositions (Stack, Fuse, etc.) directly via the Hugging Face Hub! These additions make Adapters even more powerful & usable. (4/5)
English
1
0
1
98
AdapterHub
AdapterHub@AdapterHub·
🚀Adapters v1.2 is out!🚀 We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code! We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3! Explore this +2 new adapter methods in this thread👇(1/5)
AdapterHub tweet media
English
1
3
22
2.6K
AdapterHub retweetledi
Jonas Pfeiffer
Jonas Pfeiffer@PfeiffJo·
I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭 Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM forms.gle/N94ViTmKHCCAcv…
English
3
56
295
40.8K
AdapterHub retweetledi
UKP Lab
UKP Lab@UKPLab·
🎉M2QA has been accepted to #EMNLP Findings!🎉 M2QA is a new multilingual and multidomain QA dataset. We show that current transfer methods are insufficient and that language & domain transfer aren't independent! 📄 Paper: arxiv.org/abs/2407.01091 👇👇👇 twitter.com/LeonEnglaender…
Leon Engländer@LeonEnglaender

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇

English
0
2
15
855
AdapterHub retweetledi
AdapterHub
AdapterHub@AdapterHub·
👏 Huge thanks to all contributors and our amazing community! Adapters is an open-source project, and we're excited to see what you build with it and how you use it for your research. If you have questions or ideas, join the discussion on GitHub! github.com/adapter-hub/ad…
English
0
0
5
180
AdapterHub
AdapterHub@AdapterHub·
🎙️ New Models Alert! Adapters now supports: - Whisper: Our first audio model! - Mistral - MT5 - PLBart With Whisper, we bring speech recognition capabilities to our library!🔊 Notebook: github.com/adapter-hub/ad…
English
1
0
5
247
AdapterHub
AdapterHub@AdapterHub·
🎉Adapters 1.0 is here!🚀 Our open-source library for modular and parameter-efficient fine-tuning got a major upgrade! v1.0 is packed with new features (ReFT, Adapter Merging, QLoRA, ...), new models & improvements! Blog: adapterhub.ml/blog/2024/08/a… Highlights in the thread! 🧵👇
English
2
7
44
5.5K
AdapterHub
AdapterHub@AdapterHub·
📢 New preprint 🎉 We - the AdapterHub team - present the M2QA benchmark to evaluate joint domain and language transfer! 🔬 Key highlight: We show that adapter-based methods on small language models can reach the performance of Llama 3 on M2QA! 🚀 👇
AdapterHub tweet media
Leon Engländer@LeonEnglaender

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇

English
0
2
8
617
AdapterHub retweetledi
Leon Engländer
Leon Engländer@LeonEnglaender·
📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇
Leon Engländer tweet media
English
2
2
14
4.8K