Awa Dieng

456 posts

Awa Dieng

@adoubleva

machine learning researcher

Beigetreten Eylül 2018

361 Folgt803 Follower

Awa Dieng retweetet

Jeff Dean@JeffDean·7 Mar

We've been working on the Waxal dataset project since 2021, aiming to enhance the amount of data available for African languages. This public speech dataset initially covers 27 Sub-Saharan African languages spoken by over 100 million speakers across more than 26 countries. 🌍

English

276

1.7K

192.5K

Awa Dieng retweetet

Schmidt Sciences@schmidtsciences·14 Şub

Schmidt Sciences grantees @adjiboussodieng, an AI2050 Fellow, and @Yoshua_Bengio, a Science of Trustworthy AI grantee, were recently appointed by the @UN General Assembly to serve on the inaugural Independent International Scientific Panel on AI: un.org/independent-in…

English

1.7K

Awa Dieng retweetet

Adji Bousso Dieng@adjiboussodieng·13 Şub

On Feb 12, the @UN General Assembly formally appointed the 40 members of the first Independent International Scientific Panel on AI. I am honored to join this historic effort to place science at the heart of global #AI governance, ensuring this technology benefits all of humanity. Learn more about the panel members here: lnkd.in/eyVQg7p3 #AI #Science #Governance #Service

UN Office for Digital and Emerging Technologies@ODET_UN

The UN General Assembly has appointed 40 experts of the Independent International Scientific Panel on AI. 🏛️ This body begins its work as a scientifically-grounded foundation, ensuring global understanding is driven by evidence-based scientific assessments. #DigitalCooperation

English

7.2K

Awa Dieng retweetet

Princeton Computer Science@PrincetonCS·13 Şub

Faculty members @adjiboussodieng and @korolova have been named to the @UN Independent International Scientific Panel on AI. Composed of 40 experts, the panel is the first global scientific body dedicated entirely to artificial intelligence. bit.ly/3MuPGRY

English

Awa Dieng retweetet

Statistics Papers@StatsPapers·3 Şub

Singular Bayesian Neural Networks Mame Diarra Toure, David A. Stephens arxiv.org/abs/2602.00387 [𝚜𝚝𝚊𝚝.𝙼𝙻 𝚌𝚜.𝙻𝙶 𝚜𝚝𝚊𝚝.𝙰𝙿]

English

918

Awa Dieng retweetet

AFAA 2026 @ ICLR@afciworkshop·2 Şub

🚨 Deadline Extended to Feb 5 (AoE)! CFP still OPEN for the #AFAA2026 Workshop at @iclr_conf — on fairness across alignment & agentic AI systems. Full & tiny papers welcome • Interdisciplinary work encouraged! 🔗 afciworkshop.org #ICLR2026 #AFAA2026

AFAA 2026 @ ICLR@afciworkshop

🚨 CFP OPEN! We’re launching the #AFAA2026 Workshop at @iclr_conf on fairness across alignment and agentic AI systems. Submit full or tiny papers! Interdisciplinary work especially welcome :D 🗓 Deadline: Jan 31 (AoE) | 🔗 afciworkshop.org #AFAA2026 #ICLR2026

English

6.7K

Awa Dieng@adoubleva·2 Şub

🚨The deadline has been extended to February 05!! 🚨 Consider submitting your drafts, either to the full paper track (6-9 pages) or the tiny paper track (1-3 pages). We accept dual submissions to other conferences. See details at afciworkshop.org

AFAA 2026 @ ICLR@afciworkshop

English

307

Awa Dieng retweetet

Sanmi Koyejo@sanmikoyejo·12 Oca

Major release from our group today: “End-to-End Test-Time Training” (TTT-E2E). We show that continual learning enables strong long context performance without changing the underlying architecture!

Karan Dalal@karansdalal

LLM memory is considered one of the hardest problems in AI. All we have today are endless hacks and workarounds. But the root solution has always been right in front of us. Next-token prediction is already an effective compressor. We don’t need a radical new architecture. The missing piece is to continue training the model at test-time, using context as training data. Our full release of End-to-End Test-Time Training (TTT-E2E) with @NVIDIAAI, @AsteraInstitute, and @StanfordAILab is now available. Blog: nvda.ws/4syfyMN Arxiv: arxiv.org/abs/2512.23675 This has been over a year in the making with @arnuvtandon and an incredible team.

English

2.6K

Awa Dieng retweetet

Princeton University@Princeton·7 Oca

Princeton researchers have developed a new tool to speed the discovery of advanced materials known as metal organic frameworks potentially leading to better battery chemistry, more efficient carbon capture and improved access to clean water. bit.ly/4ptp3u6

English

2.1K

Awa Dieng@adoubleva·7 Oca

Excited for the next edition of the Algorithmic Fairness workshop, this time at #ICLR2026! Consider submitting your work (full or short papers) by Jan 31. See the full call for papers below 👇🏾

AFAA 2026 @ ICLR@afciworkshop

English

180

Awa Dieng retweetet

Stanford HAI@StanfordHAI·6 Haz

With support from @StanfordHAI, a team of Stanford researchers investigates when and why current domain generalization benchmarks fall short and how they might be improved: stanford.io/4kTMQl3

English

2.6K

Awa Dieng retweetet

Vertaix® (AI & Science)@Vertaix_·4 Haz

Our paper introducing Alternators has been accepted at TMLR. Alternators are a new class of generative models that outperform state-space models, such as Mambas, and diffusion models such as ScoreGrad. Check it out! GitHub: github.com/vertaix/Altern… Paper: openreview.net/forum?id=Q70C1…

GIF

English

239

14.9K

Awa Dieng retweetet

Vertaix® (AI & Science)@Vertaix_·15 May

#NewPaper What does the concept of information have to do with diversity? Introducing the Vendi Information Gain, a new way of quantifying information in science and machine learning using diversity ☀️ 🗝️Shannon introduced mutual information (MI) in 1948 in his seminal paper titled "A Mathematical Theory of Communication," which birthed information theory. MI has since pervaded science and engineering. However, MI has several limitations that become more pronounced in high dimensions and/or under limited data. 💡Our new paper proposes a flexible alternative to MI that scales nicely to high dimensions and works well under limited data settings. We call it the Vendi Information Gain (VIG). VIG is defined as the difference between the marginal Vendi entropy of a random variable and its conditional Vendi entropy given another variable. Here we use "Vendi entropy" to mean the logarithm of the Vendi Score. 🟩 VIG has many nice properties, all of which we describe in the paper. In particular, it generalizes MI. MI appears as a special case of VIG when you assume there are no similarities between samples. Importantly, VIG doesn't require specifying a probability distribution over samples like MI does: it works directly with samples! Finally, VIG is asymmetric, reflecting the natural asymmetry of information. We demonstrate other properties in the paper. 🆕VIG allows us to develop a novel information-theoretic framework for active data acquisition, a paradigm widely used in data-driven science. This framework outperforms STRADDLE, considered SOTA in level-set estimation. 🧾We demonstrate VIG empirically in many applications, including in cognitive science to model human response times to external stimuli, in epidemiology to model epidemic processes and detect disease hotspots in many countries via level-set estimation. #InformationTheory #VendiScoring #Vertaix #Princeton @PrincetonCS Link to paper: arxiv.org/abs/2505.09007 Authors: Quan Nguyen (@the_subtrahend) and Adji Bousso Dieng (@adjiboussodieng)

GIF

English

2.3K

Awa Dieng retweetet

Karan Singhal@thekaransinghal·12 May

📣 Proud to share HealthBench, an open-source benchmark from our Health AI team at OpenAI, measuring LLM performance and safety across 5000 realistic health conversations. 🧵 Unlike previous narrow benchmarks, HealthBench enables meaningful open-ended evaluation through 48,562 unique physician-written rubric criteria spanning several health contexts (e.g., emergencies, global health) and behavioral dimensions (e.g., accuracy, instruction following, communication). Blog, paper, code: openai.com/index/healthbe…

English

427

116.9K

Awa Dieng@adoubleva·8 May

@kchonyc I know a causal inference paper review when I see one 😭

English

Kyunghyun Cho@kchonyc·7 May

aaahhhhh these identifiability inquisitors ... can you just go after LLM papers, ask them for the identifiability of LLM prompts and give me some slack, please? 😭

English

3.6K

Awa Dieng retweetet

Vertaix® (AI & Science)@Vertaix_·8 May

Our LLM4Mat-Bench paper is now published @MLSTjournal ✨ Test your favorite LLM on the benchmark to predict material properties. 📖Paper: iopscience.iop.org/article/10.108… 💻Code: github.com/vertaix/LLM4Ma…

Vertaix® (AI & Science)@Vertaix_

#NewPaper Have you been wondering how your favorite LLM, e.g. Llama, Mistral, or Gemma performs on materials property prediction? We have just released LLM4Mat-Bench, an extensive benchmark for materials property prediction with LLMs! LLM4Mat-Bench has unique features: ☀️It spans 10 data collections, containing more than 2.6 Million data points. ☀️It covers 45 distinct material properties. ☀️It covers three different material representations: CIF, text description, and composition. ☀️It provides baseline results from different types and sizes of LLMs, e.g. Llama, Mistral, Gemma, MatBERT, and LLM-Prop. With materials data scattered everywhere, we believe LLM4Mat-Bench represents a unified data source for driving research on leveraging LLMs for materials science. The benchmark will be maintained and we look forward to your task and data contributions. Our @andre_niyongabo will present the paper at the AI4Mat #NeurIPS2024 workshop this December. Paper: arxiv.org/abs/2411.00177 Code: github.com/vertaix/LLM4Ma… Authors: Andre Niyongabo Rubungo (@andre_niyongabo), Kangming Li (@KangmingLi_), Jason Hattrick-Simpers, and Adji Bousso Dieng (@adjiboussodieng) #AI4Materials #MatSci #NLP4Science #Benchmarks #LLMs #Vertaix

English

Awa Dieng retweetet

AFAA 2026 @ ICLR@afciworkshop·8 May

🎊 We are happy to share the proceedings of the 5th AFME workshop published at PMLR #AFME2024 #NeurIPS2024 proceedings.mlr.press/v279/ Take a look at the preface for a summary of the workshop and topics discussed!

English

711

Awa Dieng retweetet

Google AI@GoogleAI·30 Nis

Today we announce TRINDs, a dataset and benchmarking pipeline that uses synthetic personas to train and optimize performance of LLMs for tropical and infectious diseases, which are out-of-distribution for most models. Learn more →goo.gle/4cTAZR4

English

355

41.5K

Awa Dieng retweetet

Mercy Nyamewaa Asiedu, Ph.D@dr_nyamewaa·30 Nis

How do large language models(LLMs) perform on out-of-distribution health datasets? Do tropical and infectious diseases serve as an out-of-distribution case to test LLMs? Does performance demonstrate potential use for global health surveillance? Check out our blog to learn more🦟

Google AI@GoogleAI

English

2.2K

Awa Dieng retweetet

Pablo Samuel Castro@pcastr·27 Nis

The end of an era! Hugo's impact on the AI ecosystem, especially in Montreal, is truly inspiring. After my postdoc in 2012 I basically quit research, and it was thanks to Hugo believing in me that I was able to be a researcher again in 2017. You will be dearly missed, Hugo!

Hugo Larochelle@hugo_larochelle

Today is my last day at Google. I started over 8 years ago, with a mandate to build a team doing bleeding edge AI research from Montreal, in what would be the first big tech AI research lab in the city. These years led to countless amazing scientific contributions from my team, to several initiatives nurturing the Montreal AI ecosystem, and to many new invaluable friendships across the globe at Google. It is with a heavy heart that I say goodbye, but I know I’m leaving behind an exceptionally strong Google DeepMind group in Montreal for which its best accomplishments are still ahead. There are too many people to thank, but I can’t pass on thanking Samy Bengio and @JeffDean who first believed in me and the opportunity of building a research lab in Montreal. I’m still working on determining the details of my next chapter, but certainly it will be grounded in my continuing motivation to leverage and make the most out of our enormous and talented local AI ecosystem.

English

10K

Entdecken

@adjiboussodieng @Yoshua_Bengio @UN @korolova @iclr_conf @StanfordHAI @PrincetonCS @the_subtrahend