Magda Dubois

95 posts

Magda Dubois

@DubMagda

Research Scientist @AISecurityInst working on LLM evaluations 🤖 | PhD in computational cognitive neuroscience @MPC_CompPsych 🧠

London, England Katılım Aralık 2016

452 Takip Edilen380 Takipçiler

Magda Dubois retweetledi

Cozmin Ududec@CUdudec·26 Şub

New from the Science of Evaluation Team at @AISafetyInst: a pipeline for rigorous transcript analysis. I think transcript analysis is still underrated, especially as model horizons are getting longer and task environments more complex.

English

1.3K

Magda Dubois retweetledi

Arvindh Arun@arvindh__a·12 Eyl

Why does horizon length grow exponentially as shown in the METR plot? Our new paper investigates this by isolating the execution capabilities of LLMs. Here's why you shouldn't be fooled by slowing progress on typical short-task benchmarks... 🧵

English

268

56.3K

Magda Dubois retweetledi

Konrad Rieck 🌈@mlsec·1 Tem

We're excited to announce the Call for Papers for SaTML 2026, the premier conference on secure and trustworthy machine learning @satml_conf We seek papers on secure, private, and fair learning algorithms and systems. 👉 satml.org/call-for-paper… ⏰ Deadline: Sept 24

English

5.7K

Magda Dubois retweetledi

Sahar Abdelnabi 🕊@sahar_abdelnabi·2 Haz

Hawthorne effect describes how study participants modify their behavior if they know they are being observed In our paper 📢, we study if LLMs exhibit analogous patterns🧠 Spoiler: they do⚠️ 🧵1/n

English

126

24.7K

Magda Dubois retweetledi

summerfieldlab @summerfieldlab.bsky.social@summerfieldlab·9 Tem

In a new paper, we examine recent claims that AI systems have been observed ‘scheming’, or making strategic attempts to mislead humans. We argue that to test these claims properly, more rigorous methods are needed.

summerfieldlab @summerfieldlab.bsky.social tweet media

English

17.2K

Magda Dubois retweetledi

AI Security Institute@AISecurityInst·9 Tem

Evaluating AI models is essential for improving their performance and understanding their risks. Increasingly, researchers are using “autograders” – having Large Language Models (LLMs) grade model outputs. But how do we know if these autograders are reliable? 🧵

English

5.4K

Magda Dubois@DubMagda·13 May

New paper introducing a framework to better quantify uncertainty in LLM evaluations (led by @LLuettgau🙌). A beta Python package (developed by @HarryCoppock🚀) is available if you want to try it out. ➡️Get in touch if you have any Qs/feedback! Paper: arxiv.org/abs/2505.05602

AI Security Institute@AISecurityInst

Advanced AI systems require complex evaluations to measure abilities, but conventional analysis techniques often fall short. Introducing HiBayES: a flexible, robust statistical modelling framework that accounts for the nuances & hierarchical structure of advanced evaluations.

English

148

Magda Dubois retweetledi

AI Security Institute@AISecurityInst·6 May

🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow. It’s our roadmap for tackling the hardest technical challenges in AI security.

English

122

29.3K

Magda Dubois retweetledi

Lennart Luettgau@LLuettgau·23 Eyl

Excited to share our brand-new work shedding some light on the neural mechanisms behind one of human’s coolest cognitive feats: compositional generalization of structural knowledge! A Tweeprint-Thread 🧵 1/n

English

Magda Dubois retweetledi

Alexandr Wang@alexandr_wang·25 Tem

1/ New paper in Nature shows model collapse as successive model generations models are recursively trained on synthetic data. This is an important result. While many researchers today view synthetic data as AI philosopher’s stone, there is no free lunch. Read more 👇

English

662

272.2K

Magda Dubois retweetledi

Felix Busch@Fel_Busch·13 Ağu

I am excited to share that our article *Navigating the European Union Artificial Intelligence Act for Healthcare* has just been published in @npjDigitalMed🚀 #AIRegulation #DigitalHealth #EUAIAct #MedicalDevices #Innovation #npjDigitalMedicine #AIinHealthcare

English

Magda Dubois@DubMagda·2 May

@AshBowler @OOssmy @Gelironald @PascoFearon Well done Aislinn!!

English

Dr Aislinn Bowler@AshBowler·1 May

Happy to announce I've passed my viva with minor corrections! Thanks to my examiners @OOssmy and Kate Langley for a great discussion! And to my supervisors @Gelironald and @PascoFearon!

English

3.1K

Magda Dubois retweetledi

Matthew Nour@Matt_Nour·11 Eki

Paper out in @PNASNews! A 'cognitive mapping' lens on language in psychosis, using word embedding models, computational modelling, and MEG. A hint of what's to come at @OxPsychiatry and @UCLBrainScience... With @mcneural_, @YunzheNeuro, Ray Dolan. pnas.org/doi/abs/10.107…

English

122

20K

Magda Dubois retweetledi

Lennart Luettgau@LLuettgau·1 Eyl

Preprint alert🚨! In this new paper we study how humans decompose dynamical subprocesses and leverage the abstracted subprocesses for compositional reuse of experience in new situations. psyarxiv.com/sxn4a/ Tweeprint to follow soon!

English

10.5K

Magda Dubois retweetledi

Marcelo Mattar@marcelomattar·4 May

In our lab's latest paper, we introduce a novel modeling approach using RNNs to reveal the cognitive algorithms behind animal decision-making. Check out our preprint, led by UCSD PhD student @Ji_An_Li and co-authored by Marcus Benna: biorxiv.org/content/10.110…

English

19.9K

Magda Dubois@DubMagda·5 May

@julia_griem @BaskinSommers @forensicrg Congrats Julia !! 🥳

Català

Julia Griem@julia_griem·4 May

Successfully defended my PhD today - what a great feeling! Thank you to my wonderful examiners Essi Viding & @BaskinSommers, and thank you to @forensicrg for an amazing 4.5 years!

English

2.6K

Magda Dubois@DubMagda·6 Nis

Congratulations to my academic sibling @AlisaLoosen for those (very) well-deserved three shiny balloons

English

2.5K

Magda Dubois@DubMagda·14 Mar

Wanna try out a (cool🦙) alternative to GPT?

Yann Dubois@yanndubs

🦙Excited to share this demo of Alpaca 🔥Highlights: ~GPT3.5 performance for < 600$🔥 The goal was to have a simple model /training procedure that academics could study and improve with limited resources We achieved that by finetuning a 7B LLaMA on 52K generated instructions

English

287

Magda Dubois@DubMagda·21 Ara

Postdoc position in Boston ⭐️ Great place and amazing person to work with !

English

667

Magda Dubois retweetledi

Tobias Hauser@TobiasUHauser·2 Ara

A while ago we published this #RegisteredReport in @NatureComms - but was this format of pre-registration really useful? Find some answers in this Q&A with us and one of the reviewers: nature.com/articles/s4146…

Magda Dubois@DubMagda

Our #RegisteredReport with @TobiasUHauser is now out in @NatureComms 🤓 We asked how people differ in their exploration - and found that impulsive and anxious subjects explore using different exploration strategies ! 1/ nature.com/articles/s4146…

English

Keşfet

@aisafetyinst @satml_conf @LLuettgau @HarryCoppock @npjDigitalMed @AshBowler @OOssmy @Gelironald