Markus Hofmarcher

24 posts

Markus Hofmarcher

Markus Hofmarcher

@mrkhof

PhD student @ JKU Linz, Institute for Machine Learning

Katılım Aralık 2018
17 Takip Edilen92 Takipçiler
Markus Hofmarcher retweetledi
Adrian
Adrian@artificiadrian·
Automated knowledge graph generation pipelines often jump from unstructured text to triplets, producing inconsistent graphs that work for one-hop fact lookup but fail at multi-hop reasoning and degrade as they grow. We introduce HyDRA, a consistency-first pipeline.🧵 (1/9)
GIF
English
1
1
4
333
Markus Hofmarcher retweetledi
Marius-Constantin Dinu
Marius-Constantin Dinu@DinuMariusC·
Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]
Marius-Constantin Dinu tweet media
English
5
19
69
13.6K
Markus Hofmarcher retweetledi
Sepp Hochreiter
Sepp Hochreiter@HochreiterSepp·
I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. arxiv.org/abs/2405.04517
English
47
360
1.8K
276.5K
Markus Hofmarcher retweetledi
Marius-Constantin Dinu
Marius-Constantin Dinu@DinuMariusC·
🚀 SymbolicAI – a framework for logic-based approaches combining generative models and solvers. Alongside, we introduce a benchmark and empirical measure to evaluate SOTA LLMs in AI-centric workflows. Read more in our paper arxiv.org/abs/2402.00854 #MachineLearning 🧠💡[1/n]
English
2
62
235
56.3K
Markus Hofmarcher retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Interested in a semantic memory for reinforcement learning? I was recently invited to a podcast talking about our #NeurIPS2023 paper: Semantic HELM (arxiv.org/abs/2306.09312). In case you are interested, you can stream the episode here: open.spotify.com/episode/4n2lmC…
Fabian Paischer@PaischerFabian

Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n

English
1
15
23
2K
Markus Hofmarcher retweetledi
Elisabeth Rumetshofer
Elisabeth Rumetshofer@LizRumetshofer·
🎉 Exciting news! Our latest work has been published in Nature Communications. 🎉 CLOOME utilizes contrastive learning to connect microscopy images and chemical structures, paving the way for major advancements in drug discovery and beyond.🌟🔬💊 📜nature.com/articles/s4146…
Ana Sanchez-Fernandez@ana_sanchezf

Very excited to share that our work, CLOOME, has been now published in @NatureComms! CLOOME introduces chemical structure querying for bioimaging databases. 🧵

English
0
8
26
3.1K
Markus Hofmarcher retweetledi
Johannes Brandstetter
Johannes Brandstetter@jo_brandstetter·
Personal update: last month, I re-joined the group of my mentor @HochreiterSepp and my amazing colleague @gklambauer in Linz, opening my own group "AI for data-driven simulations". We all share the vision to create a large-scale AI ecosystem in Linz. Big news to come soon 🚀
Johannes Brandstetter tweet media
English
18
24
244
41.4K
Markus Hofmarcher retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Thanks @_akhaliq for sharing! SITTA unlocks zero-shot image captioning via a generative language model by aligning its embedding space with that of a pretrained vision encoder without any access to gradient information. 1/6
Fabian Paischer tweet media
AK@_akhaliq

SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: huggingface.co/papers/2307.05… Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

English
1
37
74
49.8K
Markus Hofmarcher retweetledi
AK
AK@_akhaliq·
SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: huggingface.co/papers/2307.05… Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.
AK tweet media
English
0
15
66
54.2K
Markus Hofmarcher retweetledi
Kajetan Schweighofer
Kajetan Schweighofer@kschweig_·
🚀 Excited to share our latest research on quantifying the predictive uncertainty of machine learning models. QUAM searches for adversarial models (not adversarial examples!) to better estimate the epistemic uncertainty, the uncertainty about chosen model parameters. 1/5
Kajetan Schweighofer tweet media
English
4
65
247
57.9K
Markus Hofmarcher retweetledi
Thomas Schmied
Thomas Schmied@thsschmied·
Excited to share our recent work on parameter-efficient fine-tuning in RL. We pre-train a Decision Transformer (DT) on 50 tasks from two domains, and subsequently fine-tune on various down-stream tasks. Joint work with @mrkhof, @PaischerFabian, Razvan, and @HochreiterSepp. 1/n
Thomas Schmied tweet media
English
1
19
43
6.6K
Markus Hofmarcher retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n
Fabian Paischer tweet media
Jeff Clune@jeffclune

Introducing Thought Cloning: AI agents learn to *think* & act like humans by imitating the thoughts & actions of humans thinking out loud while acting, enhancing performance, efficiency, generalization, AI Safety & Interpretability. Led by @shengranhu arxiv.org/abs/2306.00323 1/5

English
1
44
101
33.4K
Markus Hofmarcher retweetledi
Marius-Constantin Dinu
Marius-Constantin Dinu@DinuMariusC·
We are excited to present our work, combining the power of a symbolic approach and Large Language Models (LLMs). Our Symbolic API bridges the gap between classical programming (Software 1.0) and differentiable programming (Software 2.0). GitHub: github.com/Xpitfire/symbo… [1/n]
Marius-Constantin Dinu tweet media
English
22
124
585
220.3K
Markus Hofmarcher retweetledi
Marius-Constantin Dinu
Marius-Constantin Dinu@DinuMariusC·
This includes fact-based generation of text, flow control of a generative process towards a desired outcome, and interpretability within generative processes. GitHub: github.com/Xpitfire/symbo… [5/n]
English
4
12
55
4.7K
Markus Hofmarcher retweetledi
Fabian Paischer
Fabian Paischer@PaischerFabian·
Excited to share our work on history compression via language models in RL, presented at #ICML2022🤩🤩. Our novel framework HELM⎈ augments an agent with a history compression module which leverages a pretrained language Transformer without any training or finetuning 🤯🤯 1/5
GIF
English
3
22
74
0
Markus Hofmarcher retweetledi
Forest
Forest@forestapp_cc·
【Final Sprint: #1MTreeChallenge】 Forest has in total planted 980 thousand trees and is about to hit 1 million now! Let’s cross this milestone together: we will donate 1 tree for every 10 Likes or 1 Retweet of this tweet.🌲 Save the Earth at your fingertips🌏!
Forest tweet media
English
62
5.8K
8.2K
0