Awa Dieng

456 posts

Awa Dieng banner
Awa Dieng

Awa Dieng

@adoubleva

machine learning researcher

Beigetreten Eylül 2018
361 Folgt803 Follower
Awa Dieng retweetet
Jeff Dean
Jeff Dean@JeffDean·
We've been working on the Waxal dataset project since 2021, aiming to enhance the amount of data available for African languages. This public speech dataset initially covers 27 Sub-Saharan African languages spoken by over 100 million speakers across more than 26 countries. 🌍
English
70
276
1.7K
192.5K
Awa Dieng retweetet
Adji Bousso Dieng
Adji Bousso Dieng@adjiboussodieng·
On Feb 12, the @UN General Assembly formally appointed the 40 members of the first Independent International Scientific Panel on AI. I am honored to join this historic effort to place science at the heart of global #AI governance, ensuring this technology benefits all of humanity. Learn more about the panel members here: lnkd.in/eyVQg7p3 #AI #Science #Governance #Service
Adji Bousso Dieng tweet media
UN Office for Digital and Emerging Technologies@ODET_UN

The UN General Assembly has appointed 40 experts of the Independent International Scientific Panel on AI. 🏛️ This body begins its work as a scientifically-grounded foundation, ensuring global understanding is driven by evidence-based scientific assessments. #DigitalCooperation

English
3
17
42
7.2K
Awa Dieng retweetet
Princeton Computer Science
Princeton Computer Science@PrincetonCS·
Faculty members @adjiboussodieng and @korolova have been named to the @UN Independent International Scientific Panel on AI. Composed of 40 experts, the panel is the first global scientific body dedicated entirely to artificial intelligence. bit.ly/3MuPGRY
Princeton Computer Science tweet media
English
2
6
33
7K
Awa Dieng retweetet
Statistics Papers
Statistics Papers@StatsPapers·
Singular Bayesian Neural Networks Mame Diarra Toure, David A. Stephens arxiv.org/abs/2602.00387 [𝚜𝚝𝚊𝚝.𝙼𝙻 𝚌𝚜.𝙻𝙶 𝚜𝚝𝚊𝚝.𝙰𝙿]
Statistics Papers tweet media
English
0
6
18
918
Awa Dieng retweetet
AFAA 2026 @ ICLR
AFAA 2026 @ ICLR@afciworkshop·
🚨 Deadline Extended to Feb 5 (AoE)! CFP still OPEN for the #AFAA2026 Workshop at @iclr_conf — on fairness across alignment & agentic AI systems. Full & tiny papers welcome • Interdisciplinary work encouraged! 🔗 afciworkshop.org #ICLR2026 #AFAA2026
AFAA 2026 @ ICLR@afciworkshop

🚨 CFP OPEN! We’re launching the #AFAA2026 Workshop at @iclr_conf on fairness across alignment and agentic AI systems. Submit full or tiny papers! Interdisciplinary work especially welcome :D 🗓 Deadline: Jan 31 (AoE) | 🔗 afciworkshop.org #AFAA2026 #ICLR2026

English
0
4
5
6.7K
Awa Dieng
Awa Dieng@adoubleva·
🚨The deadline has been extended to February 05!! 🚨 Consider submitting your drafts, either to the full paper track (6-9 pages) or the tiny paper track (1-3 pages). We accept dual submissions to other conferences. See details at afciworkshop.org
AFAA 2026 @ ICLR@afciworkshop

🚨 CFP OPEN! We’re launching the #AFAA2026 Workshop at @iclr_conf on fairness across alignment and agentic AI systems. Submit full or tiny papers! Interdisciplinary work especially welcome :D 🗓 Deadline: Jan 31 (AoE) | 🔗 afciworkshop.org #AFAA2026 #ICLR2026

English
0
1
5
307
Awa Dieng retweetet
Awa Dieng retweetet
Princeton University
Princeton University@Princeton·
Princeton researchers have developed a new tool to speed the discovery of advanced materials known as metal organic frameworks potentially leading to better battery chemistry, more efficient carbon capture and improved access to clean water. bit.ly/4ptp3u6
English
1
8
21
2.1K
Awa Dieng
Awa Dieng@adoubleva·
Excited for the next edition of the Algorithmic Fairness workshop, this time at #ICLR2026! Consider submitting your work (full or short papers) by Jan 31. See the full call for papers below 👇🏾
AFAA 2026 @ ICLR@afciworkshop

🚨 CFP OPEN! We’re launching the #AFAA2026 Workshop at @iclr_conf on fairness across alignment and agentic AI systems. Submit full or tiny papers! Interdisciplinary work especially welcome :D 🗓 Deadline: Jan 31 (AoE) | 🔗 afciworkshop.org #AFAA2026 #ICLR2026

English
0
0
3
180
Awa Dieng retweetet
Stanford HAI
Stanford HAI@StanfordHAI·
With support from @StanfordHAI, a team of Stanford researchers investigates when and why current domain generalization benchmarks fall short and how they might be improved: stanford.io/4kTMQl3
Stanford HAI tweet media
English
5
6
30
2.6K
Awa Dieng retweetet
Vertaix® (AI & Science)
Vertaix® (AI & Science)@Vertaix_·
#NewPaper What does the concept of information have to do with diversity? Introducing the Vendi Information Gain, a new way of quantifying information in science and machine learning using diversity ☀️ 🗝️Shannon introduced mutual information (MI) in 1948 in his seminal paper titled "A Mathematical Theory of Communication," which birthed information theory. MI has since pervaded science and engineering. However, MI has several limitations that become more pronounced in high dimensions and/or under limited data. 💡Our new paper proposes a flexible alternative to MI that scales nicely to high dimensions and works well under limited data settings. We call it the Vendi Information Gain (VIG). VIG is defined as the difference between the marginal Vendi entropy of a random variable and its conditional Vendi entropy given another variable. Here we use "Vendi entropy" to mean the logarithm of the Vendi Score. 🟩 VIG has many nice properties, all of which we describe in the paper. In particular, it generalizes MI. MI appears as a special case of VIG when you assume there are no similarities between samples. Importantly, VIG doesn't require specifying a probability distribution over samples like MI does: it works directly with samples! Finally, VIG is asymmetric, reflecting the natural asymmetry of information. We demonstrate other properties in the paper. 🆕VIG allows us to develop a novel information-theoretic framework for active data acquisition, a paradigm widely used in data-driven science. This framework outperforms STRADDLE, considered SOTA in level-set estimation. 🧾We demonstrate VIG empirically in many applications, including in cognitive science to model human response times to external stimuli, in epidemiology to model epidemic processes and detect disease hotspots in many countries via level-set estimation. #InformationTheory #VendiScoring #Vertaix #Princeton @PrincetonCS Link to paper: arxiv.org/abs/2505.09007 Authors: Quan Nguyen (@the_subtrahend) and Adji Bousso Dieng (@adjiboussodieng)
GIF
English
0
4
12
2.3K
Awa Dieng retweetet
Karan Singhal
Karan Singhal@thekaransinghal·
📣 Proud to share HealthBench, an open-source benchmark from our Health AI team at OpenAI, measuring LLM performance and safety across 5000 realistic health conversations. 🧵 Unlike previous narrow benchmarks, HealthBench enables meaningful open-ended evaluation through 48,562 unique physician-written rubric criteria spanning several health contexts (e.g., emergencies, global health) and behavioral dimensions (e.g., accuracy, instruction following, communication). Blog, paper, code: openai.com/index/healthbe…
Karan Singhal tweet media
English
29
74
427
116.9K
Awa Dieng
Awa Dieng@adoubleva·
@kchonyc I know a causal inference paper review when I see one 😭
English
0
0
1
94
Kyunghyun Cho
Kyunghyun Cho@kchonyc·
aaahhhhh these identifiability inquisitors ... can you just go after LLM papers, ask them for the identifiability of LLM prompts and give me some slack, please? 😭
English
2
0
6
3.6K
Awa Dieng retweetet
Vertaix® (AI & Science)
Vertaix® (AI & Science)@Vertaix_·
Our LLM4Mat-Bench paper is now published @MLSTjournal ✨ Test your favorite LLM on the benchmark to predict material properties. 📖Paper: iopscience.iop.org/article/10.108… 💻Code: github.com/vertaix/LLM4Ma…
Vertaix® (AI & Science)@Vertaix_

#NewPaper Have you been wondering how your favorite LLM, e.g. Llama, Mistral, or Gemma performs on materials property prediction? We have just released LLM4Mat-Bench, an extensive benchmark for materials property prediction with LLMs! LLM4Mat-Bench has unique features: ☀️It spans 10 data collections, containing more than 2.6 Million data points. ☀️It covers 45 distinct material properties. ☀️It covers three different material representations: CIF, text description, and composition. ☀️It provides baseline results from different types and sizes of LLMs, e.g. Llama, Mistral, Gemma, MatBERT, and LLM-Prop. With materials data scattered everywhere, we believe LLM4Mat-Bench represents a unified data source for driving research on leveraging LLMs for materials science. The benchmark will be maintained and we look forward to your task and data contributions. Our @andre_niyongabo will present the paper at the AI4Mat #NeurIPS2024 workshop this December. Paper: arxiv.org/abs/2411.00177 Code: github.com/vertaix/LLM4Ma… Authors: Andre Niyongabo Rubungo (@andre_niyongabo), Kangming Li (@KangmingLi_), Jason Hattrick-Simpers, and Adji Bousso Dieng (@adjiboussodieng) #AI4Materials #MatSci #NLP4Science #Benchmarks #LLMs #Vertaix

English
0
4
23
2K
Awa Dieng retweetet
Google AI
Google AI@GoogleAI·
Today we announce TRINDs, a dataset and benchmarking pipeline that uses synthetic personas to train and optimize performance of LLMs for tropical and infectious diseases, which are out-of-distribution for most models. Learn more →goo.gle/4cTAZR4
English
20
89
355
41.5K
Awa Dieng retweetet
Mercy Nyamewaa Asiedu, Ph.D
Mercy Nyamewaa Asiedu, Ph.D@dr_nyamewaa·
How do large language models(LLMs) perform on out-of-distribution health datasets? Do tropical and infectious diseases serve as an out-of-distribution case to test LLMs? Does performance demonstrate potential use for global health surveillance? Check out our blog to learn more🦟
Google AI@GoogleAI

Today we announce TRINDs, a dataset and benchmarking pipeline that uses synthetic personas to train and optimize performance of LLMs for tropical and infectious diseases, which are out-of-distribution for most models. Learn more →goo.gle/4cTAZR4

English
1
8
13
2.2K
Awa Dieng retweetet
Pablo Samuel Castro
Pablo Samuel Castro@pcastr·
The end of an era! Hugo's impact on the AI ecosystem, especially in Montreal, is truly inspiring. After my postdoc in 2012 I basically quit research, and it was thanks to Hugo believing in me that I was able to be a researcher again in 2017. You will be dearly missed, Hugo!
Hugo Larochelle@hugo_larochelle

Today is my last day at Google. I started over 8 years ago, with a mandate to build a team doing bleeding edge AI research from Montreal, in what would be the first big tech AI research lab in the city. These years led to countless amazing scientific contributions from my team, to several initiatives nurturing the Montreal AI ecosystem, and to many new invaluable friendships across the globe at Google. It is with a heavy heart that I say goodbye, but I know I’m leaving behind an exceptionally strong Google DeepMind group in Montreal for which its best accomplishments are still ahead. There are too many people to thank, but I can’t pass on thanking Samy Bengio and @JeffDean who first believed in me and the opportunity of building a research lab in Montreal. I’m still working on determining the details of my next chapter, but certainly it will be grounded in my continuing motivation to leverage and make the most out of our enormous and talented local AI ecosystem.

English
2
4
98
10K