notebook enthusiast
5.7K posts

notebook enthusiast
@enthusednotebk
prove the existence of discomfort

I am really enjoying the near-daily stream of interesting papers on the economics of frontier AI. The field building is working and now we have interesting work coming from senior economists as well as junior econ researchers and PhDs. From here, I would like to see computer scientists involved. Collaborations across CS and econ are still very rare even as this area grows. Relative to what we have now, I think the econ work can be sharpened to have more acuity in its study of frontier AI technology. More CS folks at NBER convenings; more economists at NeurIPS and ICML.






Striking image from the new Anthropic labor market impact report.









Across all mini-SWE-agent + <model> runs, SWE-bench Verified's current "ceiling"? - 87.4% (0.874 - 0.8) * 500 = another *37* instances that aren't solved consistently. If you recalculate this number across all official SWE-bench Verified submissions? - 95% from SWE-bench site





















