
Jacob Beck
99 posts

Jacob Beck
@jakeABeck
Let’s get agents to learn fast! 🤖🔥 Research Scientist @Oracle | PhD @UniOfOxford, MS & BS @BrownUniversity, Predoc @Microsoft












@jsuarez Where the experience came from feels like an odd concepts boundary to me, and pragmatically the tools of offline RL look a lot more like those of RL than SL, but it’s hard to argue for the elegance of ultimately arbitrary definitions.







Offline RL is not RL. RL is about interaction. No interaction, no RL.

Welcome to the newest MLC Office Hours host, @jakeABeck, researcher at Oracle! Schedule a chat with Jacob at the link below to talk about RL, LLMs, hypernetworks, meta-learning, multi-agent RL, AI feedback, philosophy, and more! mlcollective.org/services/





Fantastic talk from @RichardSSutton at @RL_Conference with shoutouts to meta-RL. Honored to be called “more extreme” than Rich (by Rich) for taking the Bitter Lesson to heart and suggesting we meta-learn all the components he discussed. My Q: Aren’t LLMs already doing all this?



1️⃣ Exponential AI self-improvement is shaky. The real bottleneck isn’t code; it’s compute & data. In these areas, AIs training AIs are just as limited by the world as humans training AIs. For both, we’ve nearly exhausted the internet’s data.





