Martin Genzel
126 posts

Martin Genzel
@MartinGenzel
Staff Machine Learning Researcher @MMerantix | Applied mathematician | Interested in Deep Learning, LLMs, Model Compression & Efficiency
Berlin, Germany Katılım Haziran 2021
614 Takip Edilen273 Takipçiler
Sabitlenmiş Tweet
Martin Genzel retweetledi

…catch the gang presenting this one in Seoul!
Mattes Mollenhauer@gaussianmeasure
New paper! We compress LLM parameters down to effective lengths of 2 bits/parameter while retaining 8 bit performance. The core idea is to first quantize with an entropy regularizer and then use lossless compression in the latent space.
English

👏 Shout-out to all co-authors for an amazing collaboration: @patrickputzky, @gaussianmeasure, Sebastian Schulze, Thomas Wollmann, and Stefan Dietzel
📄 Paper: arxiv.org/abs/2601.22787
🧑💻 Code: github.com/merantix-momen…
English
Martin Genzel retweetledi

We're releasing the DASLab GGUF Quantization Toolkit! 🚀
First open-source toolkit bringing GPTQ + EvoPress to @ggerganov's GGUF format, enabling heterogeneous quantization based on importance.
Result: Better models at the same file size.
[1/5]

English

Very much looking forward to it 😀 The whole @MMerantix research team will be at ICML! Let's connect 🤝
More details about our work on LLM compression 👉 x.com/MartinGenzel/s…
Mattes Mollenhauer@gaussianmeasure
I’m in Vancouver! Reach out if you want to grab a coffee and talk learning theory, model compression, Markov processes, inverse problems… We’ll also present a poster at Efficient Systems for Foundation Models Workshop!
English

Image sources:
Llama: pixabay.com/illustrations/…
Slider Tool: squoosh.app
English

👏 Big shout out to all co-authors for an amazing collab: @patrickputzky, Pengfei Zhao, Sebastian Schulze, @gaussianmeasure, Robert Seidel, Stefan Dietzel, and Thomas Wollmann
📄 Paper: arxiv.org/abs/2502.01717
🧑💻 Code: github.com/merantix-momen…
🤗 Models: tinyurl.com/bdacy658
Deutsch

📢 Excited to share our latest research @MMerantix on Any Compression of Foundation Models.
We all know how intuitive and seamless image compression is: use a slider to specify your target size and get an instant preview.
Our quest: Can compressing an LLM be just as easy?
🧵👇
English
