
Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
Xuan Chen, Lu Yan, Ruqi Zhang, Xiangyu Zhang
arxiv.org/abs/2603.18245 [๐๐.๐๐ด ๐๐.๐ฒ๐]

English
Software Engineering
51.8K posts

@ComputerPapers
Software Engineering submissions to https://t.co/d7SY8OJa0Z (unofficial): design tools, software metrics, testing and debugging, programming environments.







































