Genta Winata (@gentaiscool) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Genta Winata@gentaiscool·26 Nis

⭐️We're thrilled to share that our paper WorldCuisines has been selected for the Best Theme Paper Award at NAACL 2025 @naaclmeeting! 🎉 A huge thank you to the reviewers and area chair for this incredible recognition — we’re truly honored. Massive gratitude to all our amazing co-authors for the countless hours, late nights, and deep discussions that went into creating this high-quality dataset. 2025.naacl.org/blog/best-pape… We can't wait to present next week at NAACL! Catch us at our poster session (Wednesday, April 30) and Best Paper Award Session (Friday, May 2) for our oral presentation. Check out the paper and project here: 🌐 worldcuisines.github.io Contributors: @fredyhudi, @patrickamadeus_, @davidanugraha, @rifkiaputri, @zzeet, @ubaidalih, @auliaadilaa, @adamnohejl, @JunhoMyung00211, @aliceoh, @AnarSnowball, @faridlazuarda, @jcblaisecruz, @nedjmaou, @jodieyzhou, @AboladeDaud, @prajdabre1, @holylovenia, @SCahyawijaya, @bryanwilie92, @mrpeerat, @farizikhwantri, @gkuwanto, @llamagrp, @mv_zhukova, @EmmanueleChers1, @AlhamFikri, @davlanade, @tarowatanabe, @OptionsGod_lgd,@AyuP_AI, and many others who are not on X. Acknowledgments: @nayeon7lee, @Wenliang_Dai, @pascalefung who helped and provided us insightful suggestions. #nlproc #naacl2025 #worldcuisines

English

9

15

121

17.6K

Genta Winata retweetledi

Elias Stengel-Eskin@EliasEskin·9 Nis

🚨 Excited to announce that RGD has been accepted to #ACL2026 Main! Routing with Generated Data (RGD) is a new LLM routing paradigm where routers estimate skills of models using generated data, without ground-truth labels. We further introduce CASCAL, a new router for RGD that discovers niche skills via consensus voting + hierarchical clustering, with no ground truth needed. 🧵👇

Elias Stengel-Eskin@EliasEskin

📢 Introducing Routing with Generated Data (RGD), a new setting for annotation-free LLM routing. We study how routers can be trained without any ground-truth labels. We also introduce CASCAL, a novel label-free LLM router that identifies niche skills using consensus-voting and hierarchical clustering. ➡️ Most LLM routers assume access to labeled, in-domain data to estimate model skills (query-answer routers). However, user distributions are unknown and labels are expensive or unavailable, highlighting the need for routers that work without labels. ➡️ We introduce Routing with Generated Data (RGD): routers are trained only on Q&A data generated from task descriptions, without human annotation. We experiment with various LLM generators of different strengths (Gemini-2.5-Flash, Qwen-3-32B, Exaone-3.5-7.8B). ➡️ CASCAL outperforms other query-answer and query-only routers across diverse datasets (MMLU-Pro, SuperGPQA, MedMCQA, BigBench Extra Hard), and is more robust to weaker generators.

English

0

16

36

4.5K

Genta Winata retweetledi

Alham Fikri Aji@AlhamFikri·15 Mar

VLMs can easily get distracted by unrelated cultural cues. Happy to present our work on this soon at #CVPR2026🥳 Working on multilingual VLMs? Consider using our benchmark: 📜arxiv.org/pdf/2511.17004 🤗huggingface.co/datasets/patri… Amazing work by @patrickamadeus_ and colleagues!

pat ✈️ CVPR@patrickamadeus_

Excited to share that we have committed our paper “Vision-Language Models are Confused Tourists” to #CVPR2026 (Findings)! 🇺🇸🏔 Arxiv: arxiv.org/abs/2511.17004 We question whether current SOTA VLMs remain robust in simple cultural grounding QA when distracting contextual objects are present For example, if you eat chicken schnitzel with Mt. Fuji in the background, will the model fail to recognize it as Japanese katsu? ConfusedTourists introduces: 👉 5k+ evaluation samples across 3 cultural item categories, comprising 243 unique cultural items from 57 countries and 11 sub-regions 🌍 👉 Evaluation of 14 VLMs across 12 data features 🤖 👉 Findings showing that simple concept mixing can cause up to a -40% drop in perform 📉 Special thanks to my co-authors @IkhlasulHanif0 , @emthehunt, @gentaiscool, @FajriKoto, and my advisor @AlhamFikri for the valuable contributions along the way! #multimodal #vlm #multicultural #robustness #evaluation #NLProc #ComputerVision

English

2

18

71

8K

Genta Winata@gentaiscool·2 Şub

Happy to have my first Nature paper. Thank you @CAIS for the collaboration nature.com/articles/s4158…

Center for AI Safety@CAIS

Last week, Humanity’s Last Exam was published in @Nature. In just over a year, model scores on HLE have risen from under 5% to nearly 40%. Thank you to @scale_AI and the 1000+ HLE co-authors for helping policymakers and the public track these rapid advances in AI capabilities.

English

0

22

1.4K

Genta Winata retweetledi

Center for AI Safety@CAIS·2 Şub

Last week, Humanity’s Last Exam was published in @Nature. In just over a year, model scores on HLE have risen from under 5% to nearly 40%. Thank you to @scale_AI and the 1000+ HLE co-authors for helping policymakers and the public track these rapid advances in AI capabilities.

English

9

41

157

27.4K

Genta Winata retweetledi

Elias Stengel-Eskin@EliasEskin·15 Oca

📢 Introducing Routing with Generated Data (RGD), a new setting for annotation-free LLM routing. We study how routers can be trained without any ground-truth labels. We also introduce CASCAL, a novel label-free LLM router that identifies niche skills using consensus-voting and hierarchical clustering. ➡️ Most LLM routers assume access to labeled, in-domain data to estimate model skills (query-answer routers). However, user distributions are unknown and labels are expensive or unavailable, highlighting the need for routers that work without labels. ➡️ We introduce Routing with Generated Data (RGD): routers are trained only on Q&A data generated from task descriptions, without human annotation. We experiment with various LLM generators of different strengths (Gemini-2.5-Flash, Qwen-3-32B, Exaone-3.5-7.8B). ➡️ CASCAL outperforms other query-answer and query-only routers across diverse datasets (MMLU-Pro, SuperGPQA, MedMCQA, BigBench Extra Hard), and is more robust to weaker generators.

English

1

27

44

10.6K

Genta Winata@gentaiscool·28 Ara

@quarbby Me too

English

0

131

lynnette ng@quarbby·28 Ara

On this day, I finally got myself Premium as a Christmas present. Finally in the cool kids club 😀

English

1

0

6

410

Genta Winata@gentaiscool·28 Ara

@haryoaw Happened to me as well in neurips. They got poster, we got nothing

English

0

2

140

Haryo@haryoaw·28 Ara

It's interesting that in the conference, I met someone who had presented a paper that had the same idea (different execution) as ours. Ours led to a finding, and theirs led to an oral presentation. 😭

English

4

0

5

369

Genta Winata@gentaiscool·28 Ara

@haryoaw Try kfc and mcd in India. I heard it is good

English

1

0

128

Haryo@haryoaw·27 Ara

My friend ordered and ate fish and chips in India out of lots of Indian food choices.

English

2

0

6

500

Genta Winata@gentaiscool·24 Ara

@prajdabre @osanseviero We need IndicGPT @prajdabre

Nederlands

0

1

211

Raj Dabre@prajdabre·24 Ara

Very excited to release our ongoing work on IndicBERT-V3! Some key points: 1. Long context 2. Various model sizes 3. SOTA performance on bitext mining and RAG on our internal evaluations 4. Multilingual Indic support Go wild! cc @osanseviero for visibility, since we used Gemma. @anoopk

neural nets.@cneuralnetwork

We are releasing IndicBERT-v3, a suite of multilingual encoder language models (270M, 1B, 4B) built on top of Gemma-3. We adapted these models to use bidirectional attention, making them effective for encoder-heavy tasks. (1/3) @psidharth567 @_iunravel

English

12

8

169

13.4K

Genta Winata@gentaiscool·24 Ara

@IkhlasulHanif0 @WenhuChen 10k is all my citations till 2025 lol

English

0

33

Hanif | AI NOT FOR PRODUCTIVITY@IkhlasulHanif0·24 Ara

@WenhuChen Insane

Türkçe

1

0

915

Wenhu Chen@WenhuChen·24 Ara

Surpassed 10K citations in a single year! 🥳

English

22

4

456

41.1K

Genta Winata@gentaiscool·24 Ara

@WenhuChen GOAT!!!

English

0

554

Genta Winata@gentaiscool·24 Ara

💡Have you ever wondered whether vision–language models can be easily tricked by adding landmarks or flags to an image? In the spirit of the holidays🎄, we show that VLMs can indeed be easily confused like "Confused Tourists" ✈️: their performance drops significantly when such image perturbations are applied. 🔎 Check out "VLMs are Confused Tourists" ✈️ here arxiv.org/pdf/2511.17004 #vision #nlproc #robustness

pat ✈️ CVPR@patrickamadeus_

Craving holiday-themed paper? Say less🎄 Turns out, Vision Language Models are Confused Tourists ✈️😵‍💫 We show that adversarially induced cultural scenes significantly impair VLM cultural comprehension and trigger potential bias #NLProc #multimodal #robustness /thread 🧵(1/8)

English

0

3

16

2.5K

Genta Winata@gentaiscool·24 Ara

@IkhlasulHanif0 x = San Diego (ACL) y = South Korea (ICML) 😀

Indonesia

0

1

129

Hanif | AI NOT FOR PRODUCTIVITY@IkhlasulHanif0·23 Ara

Prof: "what are your plans on winter break?" Stud: "Oh I plan to go to x, y, z, ..." Prof: "Oh I mean in terms of research" aint noway

English

4

0

4

493

Genta Winata@gentaiscool·24 Ara

@IkhlasulHanif0 @patrickamadeus_ spend some real money

English

0

13

Hanif | AI NOT FOR PRODUCTIVITY@IkhlasulHanif0·23 Ara

@patrickamadeus_ How to buy patrick

English

1

0

2

67

pat ✈️ CVPR@patrickamadeus_·23 Ara

I WANT THIS PLEASE 😭😭

fel@suiczide

best purchase of 2025

English

1

0

3

331

Genta Winata retweetledi

pat ✈️ CVPR@patrickamadeus_·23 Ara

Craving holiday-themed paper? Say less🎄 Turns out, Vision Language Models are Confused Tourists ✈️😵‍💫 We show that adversarially induced cultural scenes significantly impair VLM cultural comprehension and trigger potential bias #NLProc #multimodal #robustness /thread 🧵(1/8)