AdapterHub (@AdapterHub) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

AdapterHub@AdapterHub·24 Kas

🎉 Exciting news! The new Adapters library for modular and parameter-efficient transfer learning is out! 🤖 Now simplified & disentangled from @huggingface pip install adapters pip install transformers 📄arxiv.org/abs/2311.11077 👾 github.com/adapter-hub/ad… #EMNLP2023 🧵👇

English

7

101

460

123K

AdapterHub retweetledi

Tom Sherborne@tomsherborne·21 Nis

When you give an LLM a task, and a solution, point it to the solution, and then force it to read the solution... ...we still do not actually solve the task. Not even close to 100%. Read @LeonEnglaender's important internship work @cohere investigating exploration for agents

Leon Engländer@LeonEnglaender

LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't. We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do. Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld. 📜 arxiv.org/abs/2604.17609 🧵👇

English

0

4

9

1K

AdapterHub retweetledi

Leon Engländer@LeonEnglaender·21 Nis

LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't. We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do. Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld. 📜 arxiv.org/abs/2604.17609 🧵👇

English

9

23

140

14.7K

AdapterHub retweetledi

Clifton Poth@clifapt·2 Mar

Took Claude up for a spin on the weekend and started a quick open-source self-hosted re-implementation Thinking Machines' Tinker API: github.com/calpt/open-tin…

English

0

3

7

351

AdapterHub@AdapterHub·21 May

As always, a huge thanks to our community for the awesome PRs that helped shape this release! 🎉 Read all about v1.2 on our blog: adapterhub.ml/blog/2025/05/a… 💻 Explore the code, try it out & star our repo ⭐: github.com/adapter-hub/ad… (5/5)

English

0

3

95

AdapterHub@AdapterHub·21 May

Also new since v1.0: ✅ Added AdapterPlus ✅ Gradient Checkpointing support for memory efficiency ✅ Push & load complex adapter compositions (Stack, Fuse, etc.) directly via the Hugging Face Hub! These additions make Adapters even more powerful & usable. (4/5)

English

1

0

1

98

AdapterHub@AdapterHub·21 May

🚀Adapters v1.2 is out!🚀 We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code! We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3! Explore this +2 new adapter methods in this thread👇(1/5)

English

1

3

22

2.6K

AdapterHub retweetledi

Jonas Pfeiffer@PfeiffJo·24 Mar

I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭 Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM forms.gle/N94ViTmKHCCAcv…

English

3

56

295

40.8K

AdapterHub@AdapterHub·30 Oca

🎁 A new update of the Adapters library is out! Check out all the novelties, changes & fixes here: github.com/adapter-hub/ad…

English

0

4

5

641

AdapterHub retweetledi

UKP Lab@UKPLab·7 Kas

🎉M2QA has been accepted to #EMNLP Findings!🎉 M2QA is a new multilingual and multidomain QA dataset. We show that current transfer methods are insufficient and that language & domain transfer aren't independent! 📄 Paper: arxiv.org/abs/2407.01091 👇👇👇 twitter.com/LeonEnglaender…

Leon Engländer@LeonEnglaender

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇

English

0

2

15

855

AdapterHub retweetledi

Jinghan Zhang@jinghan23·25 Ağu

Thank you @AdapterHub for implementing our #NeurIPS method (arxiv.org/abs/2306.14870) in your latest update! 🎉 Great to see our work being applied for practical advancements. Check out their work! #MachineLearning #AdapterMerging #ModelMerging

AdapterHub@AdapterHub

🎉Adapters 1.0 is here!🚀 Our open-source library for modular and parameter-efficient fine-tuning got a major upgrade! v1.0 is packed with new features (ReFT, Adapter Merging, QLoRA, ...), new models & improvements! Blog: adapterhub.ml/blog/2024/08/a… Highlights in the thread! 🧵👇

English

0

2

11

1.5K

AdapterHub@AdapterHub·12 Ağu

👏 Huge thanks to all contributors and our amazing community! Adapters is an open-source project, and we're excited to see what you build with it and how you use it for your research. If you have questions or ideas, join the discussion on GitHub! github.com/adapter-hub/ad…

English

0

5

180

AdapterHub@AdapterHub·12 Ağu

🎙️ New Models Alert! Adapters now supports: - Whisper: Our first audio model! - Mistral - MT5 - PLBart With Whisper, we bring speech recognition capabilities to our library!🔊 Notebook: github.com/adapter-hub/ad…

English

1

0

5

247

AdapterHub@AdapterHub·12 Ağu

🎉Adapters 1.0 is here!🚀 Our open-source library for modular and parameter-efficient fine-tuning got a major upgrade! v1.0 is packed with new features (ReFT, Adapter Merging, QLoRA, ...), new models & improvements! Blog: adapterhub.ml/blog/2024/08/a… Highlights in the thread! 🧵👇

English

2

7

44

5.5K

AdapterHub@AdapterHub·2 Tem

📢 New preprint 🎉 We - the AdapterHub team - present the M2QA benchmark to evaluate joint domain and language transfer! 🔬 Key highlight: We show that adapter-based methods on small language models can reach the performance of Llama 3 on M2QA! 🚀 👇

Leon Engländer@LeonEnglaender

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇

English

0

2

8

617

AdapterHub retweetledi

Leon Engländer@LeonEnglaender·2 Tem

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜arxiv.org/abs/2407.01091 🧵👇

English

2

14

4.8K

AdapterHub

Keşfet