
ClaraS
1.1K posts

ClaraS
@ClaraShik
Head of Research @ChaincodeLabs. Math, crypto, literature, and other joys.






Very excited about the "First Proof" challenge. I believe novel frontier research is perhaps the most important way to evaluate capabilities of the next generation of AI models. We have run our internal model with limited human supervision on the ten proposed problems. The problems require expertise in their respective domains and are not easy to verify; based on feedback from experts, we believe at least six solutions (2, 4, 5, 6, 9, 10) have a high chance of being correct, and some further ones look promising. We will only publish the solution attempts after midnight (PT), per the authors' guidance - the sha256 hash of the PDF is d74f090af16fc8a19debf4c1fec11c0975be7d612bd5ae43c24ca939cd272b1a . This was a side-sprint executed in a week mostly by querying one of the models we're currently training; as such, the methodology we employed leaves a lot to be desired. We didn't provide proof ideas or mathematical suggestions to the model during this evaluation; for some solutions, we asked the model to expand upon some proofs, per expert feedback. We also manually facilitated a back-and-forth between this model and ChatGPT for verification, formatting and style. For some problems, we present the best of a few attempts according to human judgement. We are looking forward to more controlled evaluations in the next round! 1stproof.org #1stProof









יצא לי לאתגר בשבועות האחרונים כמה מודלי שפה עם טענות והוכחות מתמטיות, כולם בגרסה הכי מתקדמת שזמינה כרגע. עד כה, ב-100% מהמקרים הם יצרו תוצאות משכנעות, מפורטות, מקוריות ופשוט לא נכונות.

I hear there is this cool experiment where they let agents interact with each other in a closed system. I think it’s called bluesky or something?







Mathematics. Technical-journal publisher allowed a coauthor to be DEAD -- because the coauthor contributed content to the first author from the afterlife, or from within a dream experienced by the first author.







1/ Quantum computing predictions lately range from "public key cryptography will be broken in 2 years" to "it's a century away." Both are wrong. My latest post explains what publicly known progress actually supports — and what blockchains should do about it. Thread below 🧵




