
@JCzarlinski @__tinygrad__ Let me know what you’d like me to try, I may write a couple blog posts
English
罗杰斯
490 posts

@dhbrojas
AGI @ https://t.co/vrJX6VOASs, 清华大学














it is endlessly fascinating to me that we still don't have a true 1M-context model it's an unusual case where the infra is far ahead of the science. Claude discontinued 1M+ context bc it didn't really work past ~200k we don't have the right data? training techniques? not sure
