Fabio Fehr

11 posts

Fabio Fehr banner
Fabio Fehr

Fabio Fehr

@FabioFehr

Idiap Researcher Assistant / EPFL PhD Candidate

Katılım Mart 2022
139 Takip Edilen59 Takipçiler
Fabio Fehr
Fabio Fehr@FabioFehr·
@DamienTeney Thanks @DamienTeney for the encouragement! The higher spearman's correlation shows the length is more adaptive to shorter and longer length summaries. In the Appendix (Figure 5 and Table 7) Curation Corpus and CNN/DM dataset we get improved OOD performance for longer summaries.
English
0
0
1
34
Damien Teney
Damien Teney@DamienTeney·
@FabioFehr Interesting results. To be totally convincing, I'd like to see the same exp with ID/OOD data swapped. Eg if shorter summaries are better OOD, you want to make sure there's not simply a systematic "making shorter summaries" happening (which wd be detrimental with sets swapped).
English
1
0
0
38
Fabio Fehr
Fabio Fehr@FabioFehr·
I am so excited by our latest work on ArXiv! TL;DR: We apply NVIB to pretrained Transformers which allows for an information theoretic post-training regularisation. With no weight updates we achieve improved performance in out-of-domain generalisation! arxiv.org/abs/2312.00662
Fabio Fehr tweet media
English
1
3
10
566
Fabio Fehr
Fabio Fehr@FabioFehr·
I am really excited our short paper "Learning to Abstract with Nonparametric Variational Information Bottleneck" got accepted to the findings of EMNLP 2023! A big thanks to my college Melika Behjati and supervisor James Henderson
Fabio Fehr tweet media
English
2
1
12
1.1K
Fabio Fehr retweetledi
Tom McCoy
Tom McCoy@RTomMcCoy·
🤖🧠NEW PAPER🧠🤖 Bayesian models can learn rapidly. Neural networks can handle messy, naturalistic data. How can we combine these strengths? Our answer: Use meta-learning to distill Bayesian priors into a neural network! Paper: arxiv.org/abs/2305.14701 1/n
Tom McCoy tweet media
English
4
107
514
80.5K
Fabio Fehr
Fabio Fehr@FabioFehr·
To volunteer at a conference? - What are the pros and cons?
English
0
0
1
166
Fabio Fehr retweetledi
James Henderson
James Henderson@JamieBHenderson·
I am excited to announce our @iclr_conf 2023 paper: "A VAE for Transformers with Nonparametric Variational Information Bottleneck" openreview.net/pdf?id=6QkjC_c… We propose to model Transformer embeddings as nonparametric mixture distributions using Dirichlet processes. @FabioFehr
English
0
6
29
2.2K