
John Zedlewski
933 posts

John Zedlewski
@zstats
Accelerating data science as eng director @NVIDIA RAPIDS. Previously DL for self driving cars, medical imaging, health data, econometrics, and kernel hacking.










Very proud that a cross NVIDIA effort led to top positions in Deep Research Bench I and II. DRB-I evaluates overall report quality: comprehensiveness, insight, instruction-following, and readability whereas DRB-II is more difficult and tests for information recall, analysis, and presentation. This was led by David Austin and @raja_biswas from my team. I contributed a bit. We will share more ASAP. huggingface.co/spaces/muset-a… #leaderboard" target="_blank" rel="nofollow noopener">agentresearchlab.com/benchmarks/dee…



We also verified that GPT-5.2 Pro (High) is SOTA for ARC-AGI-2, scoring 54.2% for $15.72/task (Due to API timeouts, we were unable to reliably verify GPT 5.2 Pro X-High on ARC-AGI-2) All verified GPT-5.2 family scores: arcprize.org/leaderboard






















