
Are your skills actually improving outcomes — or just adding more tokens & complexity? @OpenAI @AnthropicAI @cursor_ai
I Built agent-skills-eval to benchmark skills, measure uplift, track consistency & catch regressions.
github.com/darkrishabh/ag…
#ai #agentskills #evals #agents
English






