
Richard Sutton
400 posts

Richard Sutton
@RichardSSutton
Student of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award




We developed an RL method for fine-tuning our models for precise tasks in just a few hours or even minutes. Instead of training the whole model, we add an “RL token” output to π-0.6, our latest model, which is used by a tiny actor and critic to learn quickly with RL.












UTAR proudly marks the successful conclusion of the 𝟏𝐬𝐭 𝐎𝐩𝐞𝐧𝐦𝐢𝐧𝐝 𝐖𝐢𝐧𝐭𝐞𝐫 𝐒𝐜𝐡𝐨𝐨𝐥. 📄 Read the full feature in The Star: thestar.com.my/metro/metro-ne… #UTAR #OpenmindWinterSchool #OpenmindResearchInstitute #AIMalaysia


@RichardSSutton Thrilled to hear this! Did a video on your work: youtu.be/Dov68JsIC4g





