Jinlan Fu

23 posts

Jinlan Fu banner
Jinlan Fu

Jinlan Fu

@JinlanFu

Postdoc at @NUSComputing, previously FudanNLP. | Interests: Large Language Model, Text Evaluation, Generation, Dialog System

Singapore Katılım Mayıs 2017
188 Takip Edilen209 Takipçiler
Sabitlenmiş Tweet
Jinlan Fu
Jinlan Fu@JinlanFu·
Can the text evaluator be customized for different/new evaluation aspects without training? Our GPTScore achieves customized, multifaceted, and training-free using emergent abilities of PLM, i.g., instruction and in-context learning. Paper: arxiv.org/pdf/2302.04166…
GIF
English
2
10
47
11.8K
Jinlan Fu
Jinlan Fu@JinlanFu·
[H3] Different evaluation aspects exhibit certain correlations. Combining definitions with other highly correlated aspects can improve evaluation performance.
Jinlan Fu tweet mediaJinlan Fu tweet media
English
1
1
2
597
Jinlan Fu
Jinlan Fu@JinlanFu·
Can the text evaluator be customized for different/new evaluation aspects without training? Our GPTScore achieves customized, multifaceted, and training-free using emergent abilities of PLM, i.g., instruction and in-context learning. Paper: arxiv.org/pdf/2302.04166…
GIF
English
2
10
47
11.8K
Jinlan Fu retweetledi
Luyu Gao
Luyu Gao@luyu_gao·
[1/4] Introducing HyDE, a method to unsupervisedly build dense retrievers. HyDE zero-shot instructs GPT to generate a fictional document and re-encodes it with Contriever to search in its embedding space. Put it simply, casting retrieval-like behavior in GPT into real retrieval.
Luyu Gao tweet media
English
9
72
393
74.5K
Jinlan Fu
Jinlan Fu@JinlanFu·
Q3: What makes a good prompt for multilingual multitask prompt training? A3: The best performance is achieved when the model is trained on the datasets equipped with unified and cross-lingual prompts.
Jinlan Fu tweet mediaJinlan Fu tweet media
English
0
0
0
0
Jinlan Fu
Jinlan Fu@JinlanFu·
Q2: How do different characteristics of datasets and languages affect the performance of PolyPrompt? A2: PolyPrompt cannot benefit all languages. e.g., languages that appear only once in target datasets have benefits when PolyPrompt is enhanced by high-resource datasets;
Jinlan Fu tweet mediaJinlan Fu tweet media
English
1
0
0
0
Jinlan Fu retweetledi
Pengfei Liu
Pengfei Liu@stefan_fee·
I'm super honored that our paper was awarded as "Best Demo Paper" at ACL 2021!! Special thanks to all awesome collaborators, reviewers' insightful suggestions, and the committee's recognition. We will continue optimizing ExplainaBoard to make it a truly useful tool!
Pengfei Liu tweet media
Pengfei Liu@stefan_fee

What's your system good/bad at? Where can your model outperform others? What are the mistakes that the top-10 systems make? We are always struggling with these questions. A new academic tool can help us answer them in a one-click fashion and many more:explainaboard.nlpedia.ai

English
3
6
71
0
Jinlan Fu
Jinlan Fu@JinlanFu·
So excited to share our recent work at #NAACL2021. We empirically study four tagging tasks and thirteen datasets, interpreting the improvement brought by larger-context training using the ExplainaBoard toolkit. explainaboard.nlpedia.ai
Pengfei Liu@stefan_fee

The success of many NLP techniques (BERT, Retrieval ...) is essentially due to the use of the larger context, but the MORE, the BETTER? We (@JinlanFu ) try to answer this question in #NAACL2021: Larger-Context Tagging: When and Why Does It Work? arxiv.org/pdf/2104.04434…

English
0
0
4
0
Jinlan Fu retweetledi
Sebastian Ruder
Sebastian Ruder@seb_ruder·
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation We examine the state of multilingual benchmarking and propose an improved benchmark covering more challenging tasks, including a diagnostic and evaluation suite to inform future work. arxiv.org/abs/2104.07412
Sebastian Ruder tweet media
English
1
31
155
0
Jinlan Fu
Jinlan Fu@JinlanFu·
Happy to be one of the contributors of ExplainaBoard, which covers more than 40 datasets, 9 tasks, and 7 analysis functionalities to interpret the characteristics of systems, datasets, and their interplay. Thanks, @stefan_fee, @gneubig, and et al.. Tool: explainaboard.nlpedia.ai
Pengfei Liu@stefan_fee

What's your system good/bad at? Where can your model outperform others? What are the mistakes that the top-10 systems make? We are always struggling with these questions. A new academic tool can help us answer them in a one-click fashion and many more:explainaboard.nlpedia.ai

English
1
2
16
0