Massimo Nicosia

101 posts

Massimo Nicosia banner
Massimo Nicosia

Massimo Nicosia

@maxnicosia

👨🏻‍💻 Staff SWE in Research @googledeepmind @google ⚙️ #nlproc #nlu #multimodality #LLMs 🎓 Ph.D. @UniTrento 💾 Past: @QatarComputing, #webdev, #skater

Zurich, Switzerland Katılım Eylül 2008
339 Takip Edilen205 Takipçiler
Sabitlenmiş Tweet
Massimo Nicosia
Massimo Nicosia@maxnicosia·
📄 Our new EMNLP paper is on arXiv! 📄 1⃣ Train an mT5 filler model to reconstruct full parses from English utterances + parse signatures 2⃣ Run it on translations and parse signatures to obtain high quality i18n synthetic data! More here:👉 arxiv.org/abs/2109.04319 👈 @Google
English
1
0
9
0
Massimo Nicosia retweetledi
Saoban Lateefat
Saoban Lateefat@SaobanL·
Attending @Google @GoogleDeepMind google network and social event was great too as I had the opportunity to network with various AI practitioners.
Saoban Lateefat tweet mediaSaoban Lateefat tweet mediaSaoban Lateefat tweet media
English
1
1
2
150
Massimo Nicosia retweetledi
Fatima
Fatima@TiiimaT·
#Indaba2023 is a wrap! Six days of packed workshops, presentations, and panels. With an attendance of over 800 AI practitioners 32 African countries. But the main highlight was the people and hearing about their journeys. Thanks to the organizing committee for a great experience
English
1
11
83
6.3K
Massimo Nicosia retweetledi
Kevin Patrick Murphy
Kevin Patrick Murphy@sirbayes·
I had a great time meeting folks from all over Africa (plus several Google DeepMind colleagues from UK/US...) at the #Indaba2023
Kevin Patrick Murphy tweet mediaKevin Patrick Murphy tweet media
English
2
9
116
13.9K
Massimo Nicosia
Massimo Nicosia@maxnicosia·
At Deep Learning Indaba! See you at the Google booths or at Tuesday's Breakfast Mentorship Session. Thursday aft I'll present a "Request For Plot", a short project that students can complete to start a collab with Google researchers. LMK if you are here and would like to chat!
Massimo Nicosia tweet mediaMassimo Nicosia tweet media
Accra, Ghana 🇬🇭 English
0
0
5
337
Massimo Nicosia retweetledi
Priyanka Agrawal
Priyanka Agrawal@priyanka_17·
We are excited to release QAmeleon dataset, a multilingual QA dataset with 47,000+ LLM-generated QA pairs in 8 languages! github.com/google-researc… Training multilingual QA models with this bridges ~60% of the gap between an En-only baseline and a fully-supervised upper bound.
English
4
48
235
31K
Massimo Nicosia
Massimo Nicosia@maxnicosia·
🚨🚨🚨 Check out this exciting work led by team mate @PfeiffJo, a new modular model that is great at cross-lingual transfer! Happy to have contributed to this project! 😊
Jonas Pfeiffer@PfeiffJo

We propose 𝗺𝗺𝗧𝟱 a modular multilingual seq2seq model. Our modular design and training regime solves source language hallucinations resulting in massive performance gains in cross-lingual transfer scenarios. 📄 arxiv.org/abs/2305.14224

English
0
0
4
537
Massimo Nicosia retweetledi
AK
AK@_akhaliq·
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot; its focus on user-centric tasks -- tasks with broad adoption by speakers of high-resource languages; and its focus on under-represented languages where this scarce-data scenario tends to be most realistic. XTREME-UP evaluates the capabilities of language models across 88 under-represented languages over 9 key user-centric technologies including ASR, OCR, MT, and information access tasks that are of general utility. We create new datasets for OCR, autocomplete, semantic parsing, and transliteration, and build on and refine existing datasets for other tasks. XTREME-UP provides methodology for evaluating many modeling scenarios including text-only, multi-modal (vision, audio, and text),supervised parameter tuning, and in-context learning paper page: huggingface.co/papers/2305.11…
AK tweet media
English
1
7
29
15K
Massimo Nicosia retweetledi
Massimo Nicosia retweetledi
Sebastian Ruder
Sebastian Ruder@seb_ruder·
🚨 I'm really excited to release XTREME-UP, a new benchmark focusing on under-represented languages. We emphasize a realistic evaluation setting including new and existing user-centric tasks and realistic data sizes beyond the few-shot setting.
Google AI@GoogleAI

With the rapid development of language technology, it’s important that as many languages as possible benefit from these technologies, so we’re sharing XTREME-UP, a benchmark for evaluating multilingual models. 📝goo.gle/xtreme-up-paper 💻github.com/google-researc… Read 🧵↓ (1/3)

English
2
30
132
28.7K
Massimo Nicosia
Massimo Nicosia@maxnicosia·
Our new work is finally out! 🥳 XTREME-UP, A User-Centric Scarce-Data Benchmark for Under-Represented Languages. Proud of scaling NLP to more under-represented languages while collaborating with so many talented folks! Find more info here: ▶️ github.com/google-researc…
Google AI@GoogleAI

With the rapid development of language technology, it’s important that as many languages as possible benefit from these technologies, so we’re sharing XTREME-UP, a benchmark for evaluating multilingual models. 📝goo.gle/xtreme-up-paper 💻github.com/google-researc… Read 🧵↓ (1/3)

English
0
0
10
663
Massimo Nicosia retweetledi
Fangyu Liu
Fangyu Liu@hardy_qr·
Thanks AK for sharing! TL; DR: We propose MatCha🍵, visual language pretraining by (1) mapping charts to their underlying data tables and rendering code (2) decoding answers of math questions rendered as images.
Fangyu Liu tweet media
Aran Komatsuzaki@arankomatsuzaki

MATCHA : Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering Outperforms SotA by up to 20% on PlotQA and ChartQA - Transfers well to domains like screenshot, diagrams, and document figures arxiv.org/abs/2212.09662

English
2
4
37
10.6K
Massimo Nicosia retweetledi
Fangyu Liu
Fangyu Liu@hardy_qr·
📍🧵🚨 QA on plots & charts is a complex task requiring sophisticated reasoning - our visual language models struggle with this. LLMs are super strong reasoners - but they only work for text. What do we do? We translate plots & charts to text so LLM can understand!
Fangyu Liu tweet media
English
3
16
80
11.1K
Jonas Pfeiffer
Jonas Pfeiffer@PfeiffJo·
I’m happy to announce that I have started at @Google Research in Zürich as a Research Scientist 😍🥳
English
58
10
1.1K
0