Reece Keller

20 posts

Reece Keller

@rdkeller

CS+Neuro @CarnegieMellon. PhD Student with @xaqlab and @aran_nayebi. Connecting unsupervised reinforcement learning and neuroscience.

Pittsburgh, PA 参加日 Şubat 2021

494 フォロー中244 フォロワー

Reece Keller がリツイート

Aran Nayebi@aran_nayebi·5 Mar

If you're attending @CosyneMeeting, come check out our NeuroAgents workshop on Tuesday March 17. Speakers include: Omri Barak, Cristina Savin @lilwebian @rdkeller Caroline Haimerl @JonathanCKao @Ch0iHannah @xaqlab @srinituraga Yanan Sui @TrackingPlumes More details below 👇

Satpreet (Sat) Singh@tweetsatpreet

📣 Excited to announce the 2nd edition of our workshop “Agent-Based Models in Neuroscience: Theory, Autonomy, Embodiment & Environment” at @CosyneMeeting #CoSyNe2026! 🧠🤖🌍🪰🐟🐭💪🧘🏃 🗓️ March 17, 2026 📍 Cascais, Portugal 🔗 Speakers and schedule: neuro-agent-models.github.io

English

2.5K

Reece Keller がリツイート

Ben Eysenbach@ben_eysenbach·11 Eki

Kids spend years playing with blocks, building spatial+arithmetic skills. Today, AI models just read. While AI research often conflates reasoning with language models, block-building lets us study how embodied reasoning might emerge from exploration and trial-and-error learning.

Raj Ghugare@GhugareRaj

Scalable learning mechanisms for agents that solve novel tasks via experience remain an open problem. We argue that a key reason is suitable benchmarks. Simply put, most current generation of interactive benchmarks lack diversity in the skills that could be learned from them. Presenting BuilderBench, a benchmark to accelerate research in pre-training that centers learning from experience. Website: rajghugare19.github.io/builderbench/i…

English

2.3K

Reece Keller@rdkeller·29 Eyl

@xlr8harder @fleetingbits The prior encoded by a genome is in no way comparable to human-generated data. Sutton’s point was not that we should start from zero, but to challenge RLHF as a good prior for understanding the world since HF is not an objective nor well-defined reward function.

English

xlr8harder@xlr8harder·28 Eyl

@fleetingbits I don't get the issue with pretraining on human-generated data. It's an effective way to bootstrap a prior on the world. Humans have a prior on the world too, via our genome. Why should we start from zero? Am I missing something?

English

6.4K

FleetingBits@fleetingbits·28 Eyl

Some thoughts on the Dwarkesh Richard Sutton interview: 1) Richard Sutton has internalized the bitter lesson to a very impressive degree. 2) He doesn't like pretraining because human set the data used in pretraining. He doesn't like post-training because humans set the curriculum. 3) He wants the agent to be able to be given a goal and then be able to loop to learn how to accomplish the goal on its own, just interacting with the world. 4) This involves the agent getting a progressively richer world model, related to its goals, which it is able to manipulate to accomplish its tasks. 5) I don't think that anyone at OpenAI, DeepMind or Anthropic would really disagree with this as the ultimate goal. 6) Whether, it is specialized models that interact in order to form an agent, with an interior training loop, or whether it's in context learning, or whatever. 8) I think the bigger issue with the interview was just that Dwarkesh wasn't familiar with Richard Sutton's way of thinking or talking. 9) Richard Sutton feels very connectionism, early AI, etc... and he understands the material, but he has a more focused worldview.

English

784

83K

Reece Keller がリツイート

Daniel Yamins@dyamins·16 Eyl

Here is our best thinking about how to make world models. I would apologize for it being a massive 40-page behemoth, but it's worth reading. arxiv.org/pdf/2509.09737

Klemen Kotar@KlemenKotar

1/ A good world model should be promptable like an LLM, offering flexible control and zero-shot answers to many questions. Language models have benefited greatly from this fact, but it's been slow to come to vision. We introduce PSI: a path to truly interactive visual world models 🧵

English

224

32.4K

Reece Keller@rdkeller·25 Tem

@RahulChandwaney Amazing work! Where is the top left photo from?

English

105

Rahul C@RahulChandwaney·22 Tem

Inspiration VS Artwork

English

217

Reece Keller がリツイート

Daniel Yamins@dyamins·16 Tem

Over the past 18 months my lab has been developing a new approach to visual world modeling. There will be a magnum opus that ties it all together out in the next couple of weeks. But for now there are some individual application papers that have poked out.

Klemen Kotar@KlemenKotar

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

English

8.2K

Reece Keller がリツイート

Aviral Kumar@aviral_kumar2·24 Haz

Given the confusion around what RL does for reasoning in LLMs, @setlur_amrith & I wrote a new blog post on when RL simply sharpens the base model & when it discovers new reasoning strategies. Learn how to measure discovery + methods to enable it ⬇️ tinyurl.com/rlshadis

English

273

17.2K

Reece Keller がリツイート

Kevin Ellis@ellisk_kellis·12 Haz

New paper: World models + Program synthesis by @topwasu 1. World modeling on-the-fly by synthesizing programs w/ 4000+ lines of code 2. Learns new environments from minutes of experience 3. Positive score on Montezuma's Revenge 4. Compositional generalization to new environments topwasu.github.io/poe-world [1/n]

English

102

567

58K

Reece Keller@rdkeller·12 Haz

@VictorTaelin Morphoceuticals

English

Taelin@VictorTaelin·12 Haz

what are the most exciting developments taking place in biotech right now? companies or teams to watch for? what is the "this will solve everything" of today? CRISPR?

English

203

19.1K

Reece Keller@rdkeller·5 Haz

10/ Animal-like autonomy—flexibly adapting to new environments without supervision—is a key ingredient of general intelligence. Our work shows this hinges on 1) a predictive world model and 2) memory primitives that ground these predictions in ethologically relevant contexts.

English

1.4K

Reece Keller@rdkeller·5 Haz

9/ Finally, we show that the neural-glial circuit proposed in Mu et al. (2019) emerges from the latent dynamics of 3M-Progress agents. Thanks to my collaborators Alyn T. and @fel_p8, and to @xaqlab for his continued support! Paper link: arxiv.org/abs/2506.00138

English

1.7K

Reece Keller@rdkeller·5 Haz

1/ I'm excited to share recent results from my first collaboration with the amazing @aran_nayebi and @Leokoz8! We show how autonomous behavior and whole-brain dynamics emerge in embodied agents with intrinsic motivation driven by world models.

English

385

52.2K

Reece Keller@rdkeller·23 Şub

@mattswider Mine has been counting down minute my minute from 24 and I joined the queue as soon as you tweeted. 18 minutes now...😭this is so stressful

English

Matt Swider (The Shortcut)@mattswider·23 Şub

This is what you want to see – eventually – to get a PS5 restock @ Sony Direct. It'll say "more than an hour wait", then go to 22 minutes, then 11 minutes and count down from there. As long as this doesn't happen at the very end (when it's sold out), you probably have a chance🍀

Jonatan@jonatanb05

@mattswider I have 11 minutes

English

109

115

ディスカバー

@CosyneMeeting @lilwebian @JonathanCKao @Ch0iHannah @xaqlab @srinituraga @TrackingPlumes @xlr8harder