Markus Hofmarcher (@mrkhof) - Twitter Profili | Zamantika Mersobahis Locabet

Markus Hofmarcher retweetledi

Adrian@artificiadrian·23 Tem

Automated knowledge graph generation pipelines often jump from unstructured text to triplets, producing inconsistent graphs that work for one-hop fact lookup but fail at multi-hop reasoning and degrade as they grow. We introduce HyDRA, a consistency-first pipeline.🧵 (1/9)

GIF

English

1

4

333

Markus Hofmarcher retweetledi

Marius-Constantin Dinu@DinuMariusC·31 May

Excited to present our work “Large Language Models Can Self-Improve At Web Agent Tasks”. We show that synthetic data self-improvement boosts task completion by 31% on WebArena and introduce quality metrics for measuring autonomous agent workflows. #AI #MachineLearning #LLMs [1/n]

English

5

19

69

13.6K

Markus Hofmarcher retweetledi

Sepp Hochreiter@HochreiterSepp·8 May

I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. arxiv.org/abs/2405.04517

English

47

360

1.8K

276.5K

Markus Hofmarcher retweetledi

Marius-Constantin Dinu@DinuMariusC·2 Şub

🚀 SymbolicAI – a framework for logic-based approaches combining generative models and solvers. Alongside, we introduce a benchmark and empirical measure to evaluate SOTA LLMs in AI-centric workflows. Read more in our paper arxiv.org/abs/2402.00854 #MachineLearning 🧠💡[1/n]

English

2

62

235

56.3K

Markus Hofmarcher retweetledi

Fabian Paischer@PaischerFabian·7 Ara

Interested in a semantic memory for reinforcement learning? I was recently invited to a podcast talking about our #NeurIPS2023 paper: Semantic HELM (arxiv.org/abs/2306.09312). In case you are interested, you can stream the episode here: open.spotify.com/episode/4n2lmC…

Fabian Paischer@PaischerFabian

Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n

English

1

15

23

2K

Markus Hofmarcher retweetledi

Elisabeth Rumetshofer@LizRumetshofer·14 Kas

🎉 Exciting news! Our latest work has been published in Nature Communications. 🎉 CLOOME utilizes contrastive learning to connect microscopy images and chemical structures, paving the way for major advancements in drug discovery and beyond.🌟🔬💊 📜nature.com/articles/s4146…

Ana Sanchez-Fernandez@ana_sanchezf

Very excited to share that our work, CLOOME, has been now published in @NatureComms! CLOOME introduces chemical structure querying for bioimaging databases. 🧵

English

0

8

26

3.1K

Markus Hofmarcher retweetledi

Johannes Brandstetter@jo_brandstetter·6 Kas

Personal update: last month, I re-joined the group of my mentor @HochreiterSepp and my amazing colleague @gklambauer in Linz, opening my own group "AI for data-driven simulations". We all share the vision to create a large-scale AI ecosystem in Linz. Big news to come soon 🚀

English

18

24

244

41.4K

Markus Hofmarcher retweetledi

Fabian Paischer@PaischerFabian·13 Tem

Thanks @_akhaliq for sharing! SITTA unlocks zero-shot image captioning via a generative language model by aligning its embedding space with that of a pretrained vision encoder without any access to gradient information. 1/6

AK@_akhaliq

SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: huggingface.co/papers/2307.05… Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

English

1

37

74

49.8K

Markus Hofmarcher retweetledi

AK@_akhaliq·13 Tem

SITTA: A Semantic Image-Text Alignment for Image Captioning paper page: huggingface.co/papers/2307.05… Textual and semantic comprehension of images is essential for generating proper captions. The comprehension requires detection of objects, modeling of relations between them, an assessment of the semantics of the scene and, finally, representing the extracted knowledge in a language space. To achieve rich language capabilities while ensuring good image-language mappings, pretrained language models (LMs) were conditioned on pretrained multi-modal (image-text) models that allow for image inputs. This requires an alignment of the image representation of the multi-modal model with the language representations of a generative LM. However, it is not clear how to best transfer semantics detected by the vision encoder of the multi-modal model to the LM. We introduce two novel ways of constructing a linear mapping that successfully transfers semantics between the embedding spaces of the two pretrained models. The first aligns the embedding space of the multi-modal language encoder with the embedding space of the pretrained LM via token correspondences. The latter leverages additional data that consists of image-text pairs to construct the mapping directly from vision to language space. Using our semantic mappings, we unlock image captioning for LMs without access to gradient information. By using different sources of data we achieve strong captioning performance on MS-COCO and Flickr30k datasets. Even in the face of limited data, our method partly exceeds the performance of other zero-shot and even finetuned competitors. Our ablation studies show that even LMs at a scale of merely 250M parameters can generate decent captions employing our semantic mappings. Our approach makes image captioning more accessible for institutions with restricted computational resources.

English

0

15

66

54.2K

Markus Hofmarcher retweetledi

Kajetan Schweighofer@kschweig_·11 Tem

🚀 Excited to share our latest research on quantifying the predictive uncertainty of machine learning models. QUAM searches for adversarial models (not adversarial examples!) to better estimate the epistemic uncertainty, the uncertainty about chosen model parameters. 1/5

English

4

65

247

57.9K

Markus Hofmarcher retweetledi

Thomas Schmied@thsschmied·27 Haz

Excited to share our recent work on parameter-efficient fine-tuning in RL. We pre-train a Decision Transformer (DT) on 50 tasks from two domains, and subsequently fine-tune on various down-stream tasks. Joint work with @mrkhof, @PaischerFabian, Razvan, and @HochreiterSepp. 1/n

English

1

19

43

6.6K

Markus Hofmarcher retweetledi

Fabian Paischer@PaischerFabian·16 Haz

Excited to share our latest work on a semantic and interpretable memory module for RL! Complementary to recent developments in the realm of explainable AI, we focus on interpretability w.r.t. the memory of an agent. 1/n

Jeff Clune@jeffclune

Introducing Thought Cloning: AI agents learn to *think* & act like humans by imitating the thoughts & actions of humans thinking out loud while acting, enhancing performance, efficiency, generalization, AI Safety & Interpretability. Led by @shengranhu arxiv.org/abs/2306.00323 1/5

English

1

44

101

33.4K

Markus Hofmarcher retweetledi

Johannes Schimunek@JSchimunek·26 Nis

🚀 Excited to share our #ICLR2023 work on 🚨 context-enriched molecule representations🚦 improve few-shot drug discovery 💊 🚨 Paper: openreview.net/pdf?id=XrMWUuE… App: HuggingFace 🤗 under prep! #ICLR2023 🧑‍💼 poster 🗨: iclr.cc/virtual/2023/p… ⏰ Wed 3 May 4:30 pm - 6:30 pm CAT

English

2

21

41

6.5K

Markus Hofmarcher retweetledi

Marius-Constantin Dinu@DinuMariusC·20 Oca

We are excited to present our work, combining the power of a symbolic approach and Large Language Models (LLMs). Our Symbolic API bridges the gap between classical programming (Software 1.0) and differentiable programming (Software 2.0). GitHub: github.com/Xpitfire/symbo… [1/n]

English

22

124

585

220.3K

Markus Hofmarcher retweetledi

Marius-Constantin Dinu@DinuMariusC·20 Oca

This includes fact-based generation of text, flow control of a generative process towards a desired outcome, and interpretability within generative processes. GitHub: github.com/Xpitfire/symbo… [5/n]

English

4

12

55

4.7K

Markus Hofmarcher retweetledi

Fabian Paischer@PaischerFabian·25 May

Excited to share our work on history compression via language models in RL, presented at #ICML2022🤩🤩. Our novel framework HELM⎈ augments an agent with a history compression module which leverages a pretrained language Transformer without any training or finetuning 🤯🤯 1/5

GIF

English

3

22

74

0

Markus Hofmarcher retweetledi

Johannes Brandstetter@jo_brandstetter·22 Eki

Wow, wanna see how to beat CLIP with the new CLOOB? Fantastic work lead by my colleagues @fuerst_andreas and @LizRumetshofer (Sepp Hochreiter's group) applying modern Hopfield networks to image-text data. Paper: arxiv.org/abs/2110.11316 Blogpost: ml-jku.github.io/cloob

English

2

39

87

0

Markus Hofmarcher retweetledi

Johannes Brandstetter@jo_brandstetter·29 Oca

Our paper "Hopfield Networks is All You Need" is accepted at #ICLR2021. Time to give some talks :) I am very honored to present our research today at the great platform of @ml_collective @savvyRL (mlcollective.org/dlct/).

English

6

42

176

0

Markus Hofmarcher retweetledi

Forest@forestapp_cc·25 Oca

【Final Sprint: #1MTreeChallenge】 Forest has in total planted 980 thousand trees and is about to hit 1 million now! Let’s cross this milestone together: we will donate 1 tree for every 10 Likes or 1 Retweet of this tweet.🌲 Save the Earth at your fingertips🌏!

English

62

5.8K

8.2K

0

Markus Hofmarcher retweetledi

Johannes Brandstetter@jo_brandstetter·14 Eki

📢 Representation fusion to boost few-shot learning paper link: arxiv.org/abs/2010.06498 blog post: ml-jku.github.io/chef/

English

0

11

14

0

Markus Hofmarcher

Keşfet