Qingni Wang
26 posts







Excited to share that I’ll be presenting SAFER at #ICLR2026 🇧🇷 🎊 ✨ SAFER is a two-stage risk control framework for open-ended QA with LLMs. Our goal is to move beyond heuristic uncertainty estimates toward statistically rigorous trustworthiness guarantees. Specifically, we introduce: • Abstention-aware Sampling, which calibrates the minimum sampling budget needed to satisfy a user-specified risk level. • Conformalized Filtering, which removes unreliable candidates while preserving coverage guarantees. Together, SAFER provides controllable miscoverage risk in open-ended generation and takes a step toward more trustworthy LLM deployment. 📌 Poster Session: April 24, 2026 | 3:15–5:45 PM BRT 📍 Pavilion 3, Poster P3-#1806 Authors: @Ceeqnn @YFan_UCSC @xwang_lk I’ll be presenting the poster — feel free to stop by and chat if you’re around. Looking forward to discussions at ICLR!







🚨 New paper alert 🚨 📌 How can we make GUI grounding models reliable in real-world interactions? We introduce 🚀 SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration In GUI agents, a single wrong click isn’t just an error — it can trigger costly or irreversible actions (e.g., unintended payments 💸 or deleting important files 🗑️). The real danger is silent failure: most GUI grounding models always output a coordinate, even when they’re unsure. Instead of trusting a single predicted point, SafeGround: • estimates spatial uncertainty from prediction variability • calibrates a decision threshold with statistical guarantees • enables risk-controlled GUI actions, even with black-box models 💻 Code: github.com/Cece1031/SAFEG… 📄 Paper: arxiv.org/pdf/2602.02419 🧵1/6 #Agents #GUI






















