Bethge Lab

324 posts

Bethge Lab banner
Bethge Lab

Bethge Lab

@bethgelab

Perceiving Neural Networks

Tübingen, Germany เข้าร่วม Temmuz 2017
243 กำลังติดตาม3.3K ผู้ติดตาม
Bethge Lab
Bethge Lab@bethgelab·
This was work done by an amazing team---co-led by Anne-Sofie Maerten and Juliane Verwiebe, w/ @ShyamgopalKart1, @AmyPr, Johan Wagemans, and Matthias Bethge at the Tübingen AI Center and KU Leuven.
Deutsch
0
0
1
78
Bethge Lab
Bethge Lab@bethgelab·
🔎 Key finding: Users significantly preferred images optimized for their individual preferences (1065 ELO) compared to un-optimized baselines (1016), all while maintaining photorealism.
Bethge Lab tweet media
English
1
0
1
111
Bethge Lab รีทวีตแล้ว
Katrin Franke
Katrin Franke@kfrankelab·
🚨We're hiring a Research Coordinator for our collaborative research center "Robust Vision" @uni_tue—a leadership role at the heart of a major @dfg_public consortium spanning neuroscience, ML & computer vision. Requirements: MSc/PhD + science management experience. Apply by March 15 👇 join our amazing team with @mackelab @bethgelab & many more! 📷 uni-tuebingen.de/en/university/… Please spread the word 🙏
English
0
2
4
794
Bethge Lab รีทวีตแล้ว
Bethge Lab
Bethge Lab@bethgelab·
Checkout our #NeurIPS2025 paper on combining deep networks with mechanistic models to improve visual behavior. Where: Poster #2103, Session 4, Exhibit Hall C,D,E When: 04/12 4:30PM PST Go say hi to @fededagos!
Federico D'Agostino@fededagos

🚨 New paper at #NeurIPS2025! A systematic fixation-level comparison of a performance-optimized DNN scanpath model and a mechanistic cognitive model reveals behaviourally relevant mechanisms that can be added to the mechanistic model to substantially improve performance. 🧵👇

English
0
1
6
842
Bethge Lab
Bethge Lab@bethgelab·
Hot off the press: new paper that demonstrates online data curation significantly outperforms CLIP and SigLIP pretraining. It also provides empericial evidence for specific curation for classification and retrieval. Check it out below👇
Adhiraj Ghosh@adhiraj_ghosh98

🚨Current data curation results in the creation of static datasets and the use of model-based filters that induce many biases. Can we fix this? We propose ✨CABS✨, a flexible concept-aware online batch curation method that improves CLIP pretraining! arxiv.org/abs/2511.20643 🧵👇

English
0
1
3
689
Bethge Lab
Bethge Lab@bethgelab·
New work from our lab conducting a critical audit of a recent work introducing spatial supersensing! Check it out below👇
Vishaal Udandarao@vishaal_urao

🚀 New paper! arxiv.org/abs/2511.16655 Recently, Cambrian-S released models & two benchmarks (VSR & VSC) for “spatial supersensing” in video! We found: 1️⃣ Simple no-frame baseline (NoSense) ~perfectly solves VSR! 2️⃣ Tiny sanity check collapses Cambrian-S perf to 0% on VSC! 🧵👇

English
0
0
3
765
Bethge Lab รีทวีตแล้ว
Hardik Bhatnagar
Hardik Bhatnagar@hrdkbhatnagar·
🚨 Breaking @WeiboLLM's VibeThinker 1.5B leads the Sober Reasoning leaderboard for its size Punching way above its weight -- outperforming even 32B models 🔥 Outstanding work, @WeiboLLM team!
Hardik Bhatnagar tweet media
English
1
6
16
2.1K
Bethge Lab รีทวีตแล้ว
Vishaal Udandarao
Vishaal Udandarao@vishaal_urao·
🚀New Paper arxiv.org/abs/2510.20860 We conduct a systematic data-centric study for speech-language pretraining, to improve end-to-end spoken-QA! 🎙️🤖 Using our data-centric insights, we pretrain a 3.8B SpeechLM (called SpeLangy) outperforming 3x larger models! 🧵👇
Vishaal Udandarao tweet media
English
3
40
127
9.7K
Bethge Lab
Bethge Lab@bethgelab·
Our Sober Reasoning paper (accepted at COLM'25) was recently featured on the State of AI Report 2025!!
Nathan Benaich@nathanbenaich

🪩The one and only @stateofai 2025 is live! 🪩 It’s been a monumental 12 months for AI. Our 8th annual report is the most comprehensive it's ever been, covering what you *need* to know about research, industry, politics, safety and our new usage data. My highlight reel:

English
0
0
3
1K
Bethge Lab รีทวีตแล้ว
Andreas Hochlehnert
Andreas Hochlehnert@ahochlehnert·
Great to see that our work is featured in the @stateofai 2025 🎉 @hrdkbhatnagar @vishaal_urao @SamuelAlbanie @AmyPrb @MatthiasBethge
Andreas Hochlehnert tweet media
Nathan Benaich@nathanbenaich

🪩The one and only @stateofai 2025 is live! 🪩 It’s been a monumental 12 months for AI. Our 8th annual report is the most comprehensive it's ever been, covering what you *need* to know about research, industry, politics, safety and our new usage data. My highlight reel:

English
2
2
11
1.5K
Bethge Lab รีทวีตแล้ว
Adhiraj Ghosh
Adhiraj Ghosh@adhiraj_ghosh98·
Excited to be in Vienna for #ACL2025🇦🇹! You'll find @sbdzdz and I by our ONEBench poster, so do drop by! 🗓️Wed, July 30, 11-12:30 CET 📍Hall 4/5 I’m also excited to talk about lifelong and personalised benchmarking, data curation and vision-language in general! Let’s connect!
Adhiraj Ghosh tweet media
English
0
5
16
895
Bethge Lab รีทวีตแล้ว
Ori Press
Ori Press@ori_press·
Do language models have algorithmic creativity? To find out, we built AlgoTune, a benchmark challenging agents to optimize 100+ algorithms like gzip compression, AES encryption and PCA. Frontier models struggle, finding only surface-level wins. Lots of headroom here!🧵⬇️
Ori Press tweet media
English
6
59
163
25.1K
Bethge Lab
Bethge Lab@bethgelab·
🧠🤖 We’re hiring a Postdoc in NeuroAI! Join CRC1233 "Robust Vision" (Uni Tübingen) to build benchmarks & evaluation methods for vision models, bridging brain & AI. Work with top faculty & shape vision research. Apply: tinyurl.com/3jtb4an6 #NeuroAI #Jobs
English
0
2
8
984
Bethge Lab
Bethge Lab@bethgelab·
Recent work from our lab trying to ask questions on how to fairly evaluate and measure progress in language model reasoning! Check out the full thread below!
Vishaal Udandarao@vishaal_urao

🚀New Paper! arxiv.org/abs/2504.07086 Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress? We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀 🧵👇

English
0
3
18
932
Bethge Lab รีทวีตแล้ว
Lukas Thede
Lukas Thede@lukas_thede·
🧠 Keeping LLMs factually up to date is a common motivation for knowledge editing. But what would it actually take to support this in practice at the scale and speed the real world demands? We explore this question and really push the limits of lifelong knowledge editing. 👇
Lukas Thede tweet media
English
1
6
23
3.3K
Bethge Lab รีทวีตแล้ว
Shiven Sinha
Shiven Sinha@shiven_sinha·
AI can generate correct-seeming hypotheses (and papers!). Brandolini's law states BS is harder to refute than generate. Can LMs falsify incorrect solutions? o3-mini (high) scores just 9% on our new benchmark REFUTE. Verification is not necessarily easier than generation 🧵
Shiven Sinha tweet media
English
10
38
152
52.7K