Changyue Wang

5 posts

Changyue Wang

Changyue Wang

@imBeBr2

PhD @Tsinghua_Uni and @thuir_lab

เข้าร่วม Ocak 2024
10 กำลังติดตาม5 ผู้ติดตาม
ทวีตที่ปักหมุด
Changyue Wang
Changyue Wang@imBeBr2·
📢Our work on detecting hallucinations in Large Reasoning Models (LRMs) has been accepted at #AAAI2026 as an Oral Presentation! 📷 We introduce RACE, the first hallucination detection framework specifically designed for opaque-box Large Reasoning Models (LRMs). (1/3)
Changyue Wang tweet media
English
1
3
3
406
Changyue Wang
Changyue Wang@imBeBr2·
Existing detectors only check the final answer, missing fatal flaws hidden in the reasoning trace. We built the solution. 👋 RACE (Reasoning and Answer Consistency Evaluation) is the first hallucination detector to jointly evaluate LRM's full reasoning-answer behavior. (2/3)
English
1
0
0
33
Changyue Wang
Changyue Wang@imBeBr2·
📢Our work on detecting hallucinations in Large Reasoning Models (LRMs) has been accepted at #AAAI2026 as an Oral Presentation! 📷 We introduce RACE, the first hallucination detection framework specifically designed for opaque-box Large Reasoning Models (LRMs). (1/3)
Changyue Wang tweet media
English
1
3
3
406
Changyue Wang
Changyue Wang@imBeBr2·
👉Code: github.com/bebr2/EditCoT EditCoT works by generating an initial CoT and then iteratively refining it using a trained editor based on new knowledge. This flexible approach allows LLMs to adapt reasoning across various tasks and languages without task-specific adjustments.
English
0
0
0
28