The hardest cases? When some answers conflict while others align. These “mixed” scenarios tripped up even top models, revealing gaps in reasoning and limitations in current generation strategies.
🚨 New paper out! 📄
What happens when LLMs & RLMs face conflicting answers to a question? 🤔
They often ignore disagreement and confidently pick one “correct” answer. 🤯
📄 arxiv.org/pdf/2508.12355#AI#LLM#NLP#MachineLearning