Massimo Nicosia

101 posts

Massimo Nicosia

@maxnicosia

👨🏻‍💻 Staff SWE in Research @googledeepmind @google ⚙️ #nlproc #nlu #multimodality #LLMs 🎓 Ph.D. @UniTrento 💾 Past: @QatarComputing, #webdev, #skater

Zurich, Switzerland Katılım Eylül 2008

339 Takip Edilen205 Takipçiler

Sabitlenmiş Tweet

Massimo Nicosia@maxnicosia·10 Eyl

📄 Our new EMNLP paper is on arXiv! 📄 1⃣ Train an mT5 filler model to reconstruct full parses from English utterances + parse signatures 2⃣ Run it on translations and parse signatures to obtain high quality i18n synthetic data! More here:👉 arxiv.org/abs/2109.04319 👈 @Google

English

Massimo Nicosia@maxnicosia·6 Ara

Great to be in #singapore for #emnlp2023 with two contributions: check out the XTREME-UP and mmT5 papers! XTREME-UP: arxiv.org/abs/2305.11938 mmT5: arxiv.org/abs/2305.14224 #EMNLP2023 #NLProc #google @GoogleDeepMind @GoogleAI @Google

English

439

Massimo Nicosia retweetledi

Saoban Lateefat@SaobanL·16 Eyl

Attending @Google @GoogleDeepMind google network and social event was great too as I had the opportunity to network with various AI practitioners.

English

150

Massimo Nicosia retweetledi

Fatima@TiiimaT·10 Eyl

#Indaba2023 is a wrap! Six days of packed workshops, presentations, and panels. With an attendance of over 800 AI practitioners 32 African countries. But the main highlight was the people and hearing about their journeys. Thanks to the organizing committee for a great experience

English

6.3K

Massimo Nicosia@maxnicosia·11 Eyl

Thanks for organizing the sessions @2ayasalama ! It was great meeting you and chatting with all the participants 😃 #Indaba2023 #DLI_23 #Yɛbetumi

AYA@2ayasalama

Day 3 at the @DeepIndaba 1st day of "Breakfast & Mentorship sessions" ♥️ And 1st of 2 days dedicated for research showcases from across the continent. ⚡🔦 Spotlight talks talking place now. #Indaba2023 #Yɛbetumi

English

533

Massimo Nicosia retweetledi

Kevin Patrick Murphy@sirbayes·8 Eyl

I had a great time meeting folks from all over Africa (plus several Google DeepMind colleagues from UK/US...) at the #Indaba2023

English

116

13.9K

Massimo Nicosia@maxnicosia·4 Eyl

At Deep Learning Indaba! See you at the Google booths or at Tuesday's Breakfast Mentorship Session. Thursday aft I'll present a "Request For Plot", a short project that students can complete to start a collab with Google researchers. LMK if you are here and would like to chat!

Accra, Ghana 🇬🇭 English

337

Massimo Nicosia retweetledi

Priyanka Agrawal@priyanka_17·11 Ağu

We are excited to release QAmeleon dataset, a multilingual QA dataset with 47,000+ LLM-generated QA pairs in 8 languages! github.com/google-researc… Training multilingual QA models with this bridges ~60% of the gap between an En-only baseline and a fully-supervised upper bound.

English

235

31K

Massimo Nicosia@maxnicosia·24 May

🚨🚨🚨 Check out this exciting work led by team mate @PfeiffJo, a new modular model that is great at cross-lingual transfer! Happy to have contributed to this project! 😊

Jonas Pfeiffer@PfeiffJo

We propose 𝗺𝗺𝗧𝟱 a modular multilingual seq2seq model. Our modular design and training regime solves source language hallucinations resulting in massive performance gains in cross-lingual transfer scenarios. 📄 arxiv.org/abs/2305.14224

English

537

Massimo Nicosia retweetledi

AK@_akhaliq·23 May

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot; its focus on user-centric tasks -- tasks with broad adoption by speakers of high-resource languages; and its focus on under-represented languages where this scarce-data scenario tends to be most realistic. XTREME-UP evaluates the capabilities of language models across 88 under-represented languages over 9 key user-centric technologies including ASR, OCR, MT, and information access tasks that are of general utility. We create new datasets for OCR, autocomplete, semantic parsing, and transliteration, and build on and refine existing datasets for other tasks. XTREME-UP provides methodology for evaluating many modeling scenarios including text-only, multi-modal (vision, audio, and text),supervised parameter tuning, and in-context learning paper page: huggingface.co/papers/2305.11…

English

15K

Massimo Nicosia retweetledi

David Ifeoluwa Adelani 🇳🇬@davlanade·23 May

I'm very happy that MasakhaNER github.com/masakhane-io/m… by @MasakhaneNLP is part of XTREME-UP, you can now evaluate on several low-resource languages in a more realistic setting. Thank you @GoogleAI team for this new benchmark.

Google AI@GoogleAI

With the rapid development of language technology, it’s important that as many languages as possible benefit from these technologies, so we’re sharing XTREME-UP, a benchmark for evaluating multilingual models. 📝goo.gle/xtreme-up-paper 💻github.com/google-researc… Read 🧵↓ (1/3)

English

7.7K

Massimo Nicosia retweetledi

Sebastian Ruder@seb_ruder·22 May

🚨 I'm really excited to release XTREME-UP, a new benchmark focusing on under-represented languages. We emphasize a realistic evaluation setting including new and existing user-centric tasks and realistic data sizes beyond the few-shot setting.

Google AI@GoogleAI

English

132

28.7K

Massimo Nicosia@maxnicosia·22 May

Our new work is finally out! 🥳 XTREME-UP, A User-Centric Scarce-Data Benchmark for Under-Represented Languages. Proud of scaling NLP to more under-represented languages while collaborating with so many talented folks! Find more info here: ▶️ github.com/google-researc…

Google AI@GoogleAI

English

663

Massimo Nicosia retweetledi

Fangyu Liu@hardy_qr·20 Ara

Thanks AK for sharing! TL; DR: We propose MatCha🍵, visual language pretraining by (1) mapping charts to their underlying data tables and rendering code (2) decoding answers of math questions rendered as images.

Aran Komatsuzaki@arankomatsuzaki

MATCHA : Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering Outperforms SotA by up to 20% on PlotQA and ChartQA - Transfers well to domains like screenshot, diagrams, and document figures arxiv.org/abs/2212.09662

English

10.6K

Massimo Nicosia retweetledi

Fangyu Liu@hardy_qr·21 Ara

📍🧵🚨 QA on plots & charts is a complex task requiring sophisticated reasoning - our visual language models struggle with this. LLMs are super strong reasoners - but they only work for text. What do we do? We translate plots & charts to text so LLM can understand!

English

11.1K

Massimo Nicosia retweetledi

Jay Alammar@JayAlammar·7 Ara

In the Massively Multilingual NLU 2022 workshop (mmnlu-22.github.io), @maxnicosia showcases the Translate and Fill approach (aclanthology.org/2021.findings-…). For multilingual text generation, they get better results with ByT5 models over mT5 for smaller models. #EMNLP2022

English

Massimo Nicosia@maxnicosia·22 Ağu

@PfeiffJo @Google Welcome to the team Jonas! 😉🥳

English

Jonas Pfeiffer@PfeiffJo·22 Ağu

I’m happy to announce that I have started at @Google Research in Zürich as a Research Scientist 😍🥳

English

1.1K

Massimo Nicosia@maxnicosia·15 Ağu

@AmazonScience @nopper @emnlpmeeting Thank you!

English

Amazon Science@AmazonScience·15 Ağu

@maxnicosia @nopper @emnlpmeeting Congrats both!

English

Massimo Nicosia@maxnicosia·13 Ağu

🥇 Happy to announce that @nopper and I won the Zero-Shot task of the MMNLU-22 Challenge organized by Amazon! Our recipe? Translate-and-Fill synthetic data and ByT5. 📄TaF paper: aclanthology.org/2021.findings-… 🏆 MMNLU-22: mmnlu-22.github.io @emnlpmeeting @GoogleAI @AmazonScience

English

Massimo Nicosia@maxnicosia·14 Ağu

@OG_rohan @PMinervini @nopper @emnlpmeeting @GoogleAI @AmazonScience @heymaxichrome Thank you Rohan! We'll look into that. Please note that our system was trained without gold data from non-en languages. If your goal is to serve the most accurate parser, you may be better off training ByT5 huggingface.co/google/byt5-ba… on all the available gold data from Massive.

English

Rohan Sachan@OG_rohan·13 Ağu

@maxnicosia @PMinervini @nopper @emnlpmeeting @GoogleAI @AmazonScience @heymaxichrome great achievement 🔥. Do you and @nopper plan on releasing the model publicly anytime soon? Would love to host it on the ML API Marketplace that I am building.

English

Massimo Nicosia@maxnicosia·14 Ağu

@Kuzeko @nopper @emnlpmeeting @GoogleAI @AmazonScience Thank you Matteo!

English

M. Lissandrini@Kuzeko·13 Ağu

@maxnicosia @nopper @emnlpmeeting @GoogleAI @AmazonScience Congratulations!!

English

Keşfet

@GoogleDeepMind @GoogleAI @Google @2ayasalama @PfeiffJo @MasakhaneNLP @AmazonScience @nopper