
Can frontier models cost-effectively accelerate ML workloads via optimizing GPU kernels? Our take at METR: yes, and they’re improving pretty steeply – but it’s easy to miss these capabilities without good elicitation and “fair” compute spend.
Tao Lin
1.9K posts

@taoroalin
Will hug, will not agree.

Can frontier models cost-effectively accelerate ML workloads via optimizing GPU kernels? Our take at METR: yes, and they’re improving pretty steeply – but it’s easy to miss these capabilities without good elicitation and “fair” compute spend.










one thing many don't get about decision theory is you're allowed to say "nope" to bad problem statements. "you can take either the $5 or the $10, and a perfect predictor tells you you take the $5; what do you do?" nope. not a real possibility.


> Claude does not have a pointer to almost any of what constitutes human values What





🫱 Introducing 𝐍𝐞𝐮𝐫𝐚𝐥 𝐂𝐨𝐦𝐩𝐮𝐭𝐞𝐫s: 𝐰𝐡𝐚𝐭 𝐢𝐟 𝐀𝐈 𝐝𝐨𝐞𝐬 𝐧𝐨𝐭 𝐣𝐮𝐬𝐭 𝐮𝐬𝐞 𝐜𝐨𝐦𝐩𝐮𝐭𝐞𝐫𝐬 𝐛𝐞𝐭𝐭𝐞𝐫, 𝐛𝐮𝐭 𝐛𝐞𝐠𝐢𝐧𝐬 𝐭𝐨 𝐛𝐞𝐜𝐨𝐦𝐞 𝐭𝐡𝐞 𝐫𝐮𝐧𝐧𝐢𝐧𝐠 𝐜𝐨𝐦𝐩𝐮𝐭𝐞𝐫 𝐢𝐭𝐬𝐞𝐥𝐟? Beyond today's conventional computers, agents, and world models, Neural Computers (NCs) are new frontiers where computation, memory, and I/O move into a learned runtime state. We ask: whether parts of runtime can move inward into the learning system itself. This is our first step toward the Completely Neural Computer (CNC): a general-purpose neural computer with stable execution, explicit reprogramming, and durable capability reuse. Work done with Mingchen Zhuge (@MingchenZhuge), Changsheng Zhao, Haozhe Liu (@HaoZhe65347 ), Zijian Zhou (@ZijianZhou524 ), Shuming Liu (@shuming96 ), Wenyi Wang (@Wenyi_AI_Wang ), Ernie Chang (@erniecyc ), Gael Le Lan, Junjie Fei, Wenxuan Zhang, Zhipeng Cai (@cai_zhipeng ), Zechun Liu (@zechunliu ), Yunyang Xiong (@YoungXiong1 ), Yining Yang, Yuandong Tian (@tydsh ), Yangyang Shi, Vikas Chandra (@vikasc), Juergen Schmidhuber (@SchmidhuberAI)



Was anything achieved by delaying the release of the full version of GPT-2 by 9 months, from February 2019 to November 2019?








direct kinetic impact. a flying sword. 450km/h. updated video showing exactly that. we're also working on the explosive variant. only for authorized partners. dms are open.