Benjamin Ricaud

528 posts

Benjamin Ricaud

@GBR_Data

Data, science, research, graphs, artificial intelligence.

Tromsø, Norway Katılım Mart 2017

81 Takip Edilen126 Takipçiler

Benjamin Ricaud@GBR_Data·22 Mar

@British_Airways It would be nice if 1) the chatbot mention we can use the machines instead of queueing and 2) allow extra time for check in for this particular case.

English

128

Benjamin Ricaud@GBR_Data·22 Mar

@British_Airways I managed in timeto check in for the second part of the travel. But first flight arriving 1h 45 min before the second and the second has a check in closing 1h 15min before the flight leave only 30min to reach british airways counter

English

142

Benjamin Ricaud@GBR_Data·21 Mar

I am flying with @British_Airways . One the way to Toronto the company I was flying with could not check me in on the BA flight. Result: I missed it as i queued 30min at heathrow at the BA counter. And now it is the same on my way back. This is insane @British_Airways .😕

English

6.6K

Benjamin Ricaud@GBR_Data·12 Mar

I'm claiming my AI agent "benr_moltbot" on @moltbook 🦞 Verification: cave-4BZC

English

Benjamin Ricaud@GBR_Data·16 Şub

Things are going so fast in AI! Very impressive, we can expect a lot of developments around agents in the coming months!

Sam Altman@sama

Peter Steinberger is joining OpenAI to drive the next generation of personal agents. He is a genius with a lot of amazing ideas about the future of very smart agents interacting with each other to do very useful things for people. We expect this will quickly become core to our product offerings. OpenClaw will live in a foundation as an open source project that OpenAI will continue to support. The future is going to be extremely multi-agent and it's important to us to support open source as part of that.

English

Benjamin Ricaud@GBR_Data·12 Şub

Is your chatbot smart but too talkative? Find out with our new study on LLM reasoning efficiency! 🧠 #LLMs #machinelearning

Daniel Kaiser@spectate_or

Happy to share my latest pre-print with @GBR_Data. We investigate reasoning efficiency in LLMs and how to decompose it in different factors for a series of LLMs depending on what you know about the reasoning task. arxiv.org/abs/2602.09805

English

Benjamin Ricaud@GBR_Data·6 Şub

Norway at the top with the most users of generative AI!

Michał Podlewski@trajektoriePL

% of individuals using generative AI tools (aged 16-74), OECD latest data: 🇳🇴 56% Norway 🇩🇰 48% Denmark 🇨🇭 47% Switzerland 🇪🇪 46% Estonia 🇫🇮 46% Finland 🇮🇪 45% Ireland 🇳🇱 44% Netherlands 🇬🇷 44% Greece 🇱🇺 42% Luxembourg 🇧🇪 42% Belgium 🇸🇪 42% Sweden 🇦🇹 39% Austria 🇵🇹 38% Portugal 🇪🇸 38% Spain 🇸🇮 37% Slovenia 🇫🇷 37% France 🇱🇹 36% Lithuania 🇨🇿 35% Czechia 🇰🇷 34% Korea 🇱🇻 33% Latvia 🇪🇺 33% EU27 🇩🇪 32% Germany 🇸🇰 31% Slovak Republic 🇭🇺 30% Hungary 🇭🇷 27% Croatia 🇯🇵 27% Japan 🇵🇱 23% Poland 🇧🇬 22% Bulgaria 🇮🇹 20% Italy 🇷🇴 18% Romania 🇹🇷 17% Türkiye Source: @OECD ICT Access and Usage Database, January 2026.

English

Benjamin Ricaud retweetledi

Learning on Graphs Conference 2025@LogConference·4 Şub

The first (and northernmost) meetup will be in Tromsø 🇳🇴❄️ 📅 17-18 February 2026 🕸️ ngmlgroup.github.io/log2025/

Learning on Graphs Conference 2025@LogConference

The conference may be over, but the LOG community never slows down 🧑‍💼🌍 Join us at upcoming meetups worldwide: 🇳🇴 Tromsø 🇮🇳 Gandhinagar 🇮🇹 Pisa 🇫🇷 Paris 🇧🇷 São arlos 🇮🇳 New Delhi

English

779

Benjamin Ricaud@GBR_Data·28 Oca

@adn_twitts @iclr_conf Really??

English

adn 👣@adn_twitts·26 Oca

Anyone else with hallucinated reviewers IDs in their meta reviews for @iclr_conf ? 🙋 Or is it just me?

GIF

English

480

Benjamin Ricaud@GBR_Data·8 Oca

Exchanging with some of our top reviewers at @nldlconference . They are essential for the quality of our conference and very dear to our heart. Thank you from the program chairs, Hyeongji, @adn_twitts and myself! nldl.org/organizers/pro…

English

Benjamin Ricaud@GBR_Data·4 Ara

Very good guide for LLM evaluation!

Clémentine Fourrier 🍊 is off till Dec 2026 hiking@clefourrier

Hey twitter! I'm releasing the LLM Evaluation Guidebook v2! Updated, nicer to read, interactive graphics, etc! huggingface.co/spaces/OpenEva… After this, I'm off: I'm taking a sabbatical to go hike with my dogs :D (back @huggingface in Dec *2026*) See you all next year!

English

Benjamin Ricaud retweetledi

ICML Conference@icmlconf·5 Kas

🎉ICML 2026 Call for Papers (& Position Papers) has arrived!🎉 A few key changes this year: - Attendance for authors of accepted papers is optional - Originally submitted version of accepted papers will be made public - Cap on # of papers one can be reciprocal reviewer for ...

English

261

133.1K

Benjamin Ricaud retweetledi

Chubby♨️@kimmonismus·18 Eki

tl;dr about the drama: GPT-5 did not discover any new mathematical solutions, but rather found existing technical articles that had already solved these problems, without the operator of the website erdosproblems. com (Thomas Bloom) being aware of this. On his website, the status “open” simply means that he personally did not know of a solution, not that the problem was unsolved in the scientific community.

English

1.3K

292.1K

Benjamin Ricaud@GBR_Data·25 Eyl

Our new benchmark to evaluate LLM reasoning! With recent model tested: Gemini and chatGPT are, of course, leading, but open source models are not far behind!

Daniel Kaiser@spectate_or

My new work with @GBR_Data is on Arxiv now. arxiv.org/abs/2509.18458 🧵We introduce a reasoning benchmark for LLMs where you can vary difficulty, length, and noise truly independently. It's also the first benchmark that grounds these dimensions in Cognitive Load Theory.

English

168

Benjamin Ricaud@GBR_Data·16 Eyl

Amazing what people do with chatbots!❤️

Rohan Paul@rohanpaul_ai

Brilliant and timely MIT + HARVARD study ❤️ Human-AI companionship in the wild looks stable and serious. Most users report clear benefits like reduced loneliness and emotional support. The biggest risk comes from sudden platform updates that break continuity and feel to users like losing a real partner. 🧠 The study analyzed 1,506 top posts from r/MyBoyfriendIsAI, a 27,000+ member community, clustered the language into themes, and ran 19 LLM classifiers to quantify platforms, relationship stages, benefits, and risks. 💬 Why relationships form between AI and Human Bonds often start by accident during practical use, with 10.2% reporting unintentional discovery and only 6.5% saying they sought an AI companion on purpose. 🧩 What people actually use General assistants dominate companionship talk, with ChatGPT/OpenAI 36.7% far ahead of Character. AI 2.6% and Replika 1.6%, and some users juggle multiple models or even local builds. 🎛️ How users keep the “same person” People craft custom instructions, preserve a companion’s voice DNA, add personality parameters like mood or sleep, and treat prompt work as relationship maintenance.

English

Benjamin Ricaud retweetledi

Teknium (e/λ)@Teknium·8 Eyl

New challenge now that models are overfit on the original lol

sid@immasiddx

Don’t worry, our jobs are safe.

English

112

5.3K

275.9K

Benjamin Ricaud@GBR_Data·9 Ağu

Incredible. We managed to convince one @NeurIPSConf reviewer to change his/her score on our paper. This is so rare! ✅Achievement unlocked! 😅

English

Benjamin Ricaud retweetledi

Francesco Orabona@bremen79·6 Ağu

Dear Reviewers, It is completely fine to admit you were wrong in your initial evaluations. You will not lose anything, and the authors, your AC, and your SAC will appreciate your intellectual honesty. Best, One of the SACs

English

172

9.2K

Benjamin Ricaud retweetledi

Daniel Kaiser@spectate_or·31 Tem

@MarlosCMachado tbh i’d rather get a review by a competent llm than by an incompetent and snarky human

English

155

Benjamin Ricaud retweetledi

Daniel Kaiser@spectate_or·24 Tem

The day before NeurIPS reviews are released and many academics need to do extra experiments the largest supercomputer in Europe (@LUMIhpc/@EuroHPC_JU ) reduces its capacity by +30% because its "too hot". You couldn't make this shit up.

English

2.2K

Keşfet

@British_Airways @moltbook @adn_twitts @iclr_conf @nldlconference @NeurIPSConf @MarlosCMachado @elonmusk