Elisabeth Rumetshofer

26 posts

Elisabeth Rumetshofer banner
Elisabeth Rumetshofer

Elisabeth Rumetshofer

@LizRumetshofer

PhD candidate @ JKU Linz, Institute for Machine Learning.

Katılım Nisan 2015
85 Takip Edilen161 Takipçiler
Elisabeth Rumetshofer retweetledi
Sepp Hochreiter
Sepp Hochreiter@HochreiterSepp·
I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. arxiv.org/abs/2405.04517
English
47
360
1.8K
276.5K
Elisabeth Rumetshofer
Elisabeth Rumetshofer@LizRumetshofer·
🎉 Exciting news! Our latest work has been published in Nature Communications. 🎉 CLOOME utilizes contrastive learning to connect microscopy images and chemical structures, paving the way for major advancements in drug discovery and beyond.🌟🔬💊 📜nature.com/articles/s4146…
Ana Sanchez-Fernandez@ana_sanchezf

Very excited to share that our work, CLOOME, has been now published in @NatureComms! CLOOME introduces chemical structure querying for bioimaging databases. 🧵

English
0
8
26
3.1K
Elisabeth Rumetshofer retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Thanks @_akhaliq for sharing! SITTA unlocks zero-shot image captioning via a generative language model by aligning its embedding space with that of a pretrained vision encoder without any access to gradient information. 1/6
Fabian Paischer tweet media
AK@_akhaliq

SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: huggingface.co/papers/2307.05… Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

English
1
37
74
49.8K
Elisabeth Rumetshofer retweetledi
Kajetan Schweighofer
Kajetan Schweighofer@kschweig_·
🚀 Excited to share our latest research on quantifying the predictive uncertainty of machine learning models. QUAM searches for adversarial models (not adversarial examples!) to better estimate the epistemic uncertainty, the uncertainty about chosen model parameters. 1/5
Kajetan Schweighofer tweet media
English
4
65
247
57.9K
Elisabeth Rumetshofer retweetledi
Thomas Schmied
Thomas Schmied@thsschmied·
Excited to share our recent work on parameter-efficient fine-tuning in RL. We pre-train a Decision Transformer (DT) on 50 tasks from two domains, and subsequently fine-tune on various down-stream tasks. Joint work with @mrkhof, @PaischerFabian, Razvan, and @HochreiterSepp. 1/n
Thomas Schmied tweet media
English
1
19
43
6.6K
Elisabeth Rumetshofer retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n
Fabian Paischer tweet media
Jeff Clune@jeffclune

Introducing Thought Cloning: AI agents learn to *think* & act like humans by imitating the thoughts & actions of humans thinking out loud while acting, enhancing performance, efficiency, generalization, AI Safety & Interpretability. Led by @shengranhu arxiv.org/abs/2306.00323 1/5

English
1
44
101
33.4K
Elisabeth Rumetshofer
Elisabeth Rumetshofer@LizRumetshofer·
Really looking forward to presenting our work CLOOB, a multi-modal contrastive learning method, at #NeurIPS2022 🤩🤩🤩See you at our poster! 🫵
English
0
9
44
0
Elisabeth Rumetshofer retweetledi
Günter Klambauer
Günter Klambauer@gklambauer·
The ELLIS ML4Molecules workshop will also happen this year on November 28 in VIRTUAL format! Please find the announcement and the call for papers here: moleculediscovery.github.io/workshop2022/ Looking forward to your contributions!
Günter Klambauer tweet media
English
2
79
213
0
Elisabeth Rumetshofer retweetledi
Christian Steinparz
Christian Steinparz@CSteinparz·
Thrilled to announce that our work on "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning" got accepted at #CoLLAs2022. We introduce various domain shifts in a grid world in a controlled manner. 1/5
GIF
English
1
15
24
0
Elisabeth Rumetshofer retweetledi
Sepp Hochreiter
Sepp Hochreiter@HochreiterSepp·
Message me or meet me at ICML. If you are interested in top notch research in machine learning, we have open PhD and Postdoc positions in Linz/Vienna. We do research in DL, RL, Self-Supervised, Drug Discovery, Earth Sciences and Climate Change. Apply: bit.ly/3cx1n8r
English
4
32
107
0
Elisabeth Rumetshofer retweetledi
Martin Gauch
Martin Gauch@martingauch·
Interested in few-shot learning beyond miniImageNet? Deep nets often struggle to predict systems with varying parameters when data are scarce. Our new few-shot learning method SubGD is here to help! arxiv.org/abs/2206.03483 🧵 1/5
GIF
English
4
42
155
0
Elisabeth Rumetshofer retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Excited to share our work on history compression via language models in RL, presented at #ICML2022🤩🤩. Our novel framework HELM⎈ augments an agent with a history compression module which leverages a pretrained language Transformer without any training or finetuning 🤯🤯 1/5
GIF
English
3
22
74
0
Elisabeth Rumetshofer
Elisabeth Rumetshofer@LizRumetshofer·
If you are interested how CLOOB performs in a LiT setting, check out our notebook.
English
0
3
12
0