Reece Keller

20 posts

Reece Keller banner
Reece Keller

Reece Keller

@rdkeller

CS+Neuro @CarnegieMellon. PhD Student with @xaqlab and @aran_nayebi. Connecting unsupervised reinforcement learning and neuroscience.

Pittsburgh, PA 参加日 Şubat 2021
494 フォロー中244 フォロワー
Reece Keller がリツイート
Aran Nayebi
Aran Nayebi@aran_nayebi·
If you're attending @CosyneMeeting, come check out our NeuroAgents workshop on Tuesday March 17. Speakers include: Omri Barak, Cristina Savin @lilwebian @rdkeller Caroline Haimerl @JonathanCKao @Ch0iHannah @xaqlab @srinituraga Yanan Sui @TrackingPlumes More details below 👇
Satpreet (Sat) Singh@tweetsatpreet

📣 Excited to announce the 2nd edition of our workshop “Agent-Based Models in Neuroscience: Theory, Autonomy, Embodiment & Environment” at @CosyneMeeting #CoSyNe2026! 🧠🤖🌍🪰🐟🐭💪🧘🏃 🗓️ March 17, 2026 📍 Cascais, Portugal 🔗 Speakers and schedule: neuro-agent-models.github.io

English
0
6
19
2.5K
Reece Keller がリツイート
Ben Eysenbach
Ben Eysenbach@ben_eysenbach·
Kids spend years playing with blocks, building spatial+arithmetic skills. Today, AI models just read. While AI research often conflates reasoning with language models, block-building lets us study how embodied reasoning might emerge from exploration and trial-and-error learning.
Raj Ghugare@GhugareRaj

Scalable learning mechanisms for agents that solve novel tasks via experience remain an open problem. We argue that a key reason is suitable benchmarks. Simply put, most current generation of interactive benchmarks lack diversity in the skills that could be learned from them. Presenting BuilderBench, a benchmark to accelerate research in pre-training that centers learning from experience. Website: rajghugare19.github.io/builderbench/i…

English
1
4
14
2.3K
Reece Keller
Reece Keller@rdkeller·
@xlr8harder @fleetingbits The prior encoded by a genome is in no way comparable to human-generated data. Sutton’s point was not that we should start from zero, but to challenge RLHF as a good prior for understanding the world since HF is not an objective nor well-defined reward function.
English
0
0
3
47
xlr8harder
xlr8harder@xlr8harder·
@fleetingbits I don't get the issue with pretraining on human-generated data. It's an effective way to bootstrap a prior on the world. Humans have a prior on the world too, via our genome. Why should we start from zero? Am I missing something?
English
17
1
82
6.4K
FleetingBits
FleetingBits@fleetingbits·
Some thoughts on the Dwarkesh Richard Sutton interview: 1) Richard Sutton has internalized the bitter lesson to a very impressive degree. 2) He doesn't like pretraining because human set the data used in pretraining. He doesn't like post-training because humans set the curriculum. 3) He wants the agent to be able to be given a goal and then be able to loop to learn how to accomplish the goal on its own, just interacting with the world. 4) This involves the agent getting a progressively richer world model, related to its goals, which it is able to manipulate to accomplish its tasks. 5) I don't think that anyone at OpenAI, DeepMind or Anthropic would really disagree with this as the ultimate goal. 6) Whether, it is specialized models that interact in order to form an agent, with an interior training loop, or whether it's in context learning, or whatever. 8) I think the bigger issue with the interview was just that Dwarkesh wasn't familiar with Richard Sutton's way of thinking or talking. 9) Richard Sutton feels very connectionism, early AI, etc... and he understands the material, but he has a more focused worldview.
English
36
36
784
83K
Reece Keller がリツイート
Rahul C
Rahul C@RahulChandwaney·
Inspiration VS Artwork
Rahul C tweet mediaRahul C tweet media
English
1
16
217
7K
Reece Keller がリツイート
Daniel Yamins
Daniel Yamins@dyamins·
Over the past 18 months my lab has been developing a new approach to visual world modeling. There will be a magnum opus that ties it all together out in the next couple of weeks. But for now there are some individual application papers that have poked out.
Klemen Kotar@KlemenKotar

📷 New Preprint: SOTA optical flow extraction from pre-trained generative video models! While it seems intuitive that video models grasp optical flow, extracting that understanding has proven surprisingly elusive.

English
1
15
78
8.2K
Reece Keller がリツイート
Aviral Kumar
Aviral Kumar@aviral_kumar2·
Given the confusion around what RL does for reasoning in LLMs, @setlur_amrith & I wrote a new blog post on when RL simply sharpens the base model & when it discovers new reasoning strategies. Learn how to measure discovery + methods to enable it ⬇️ tinyurl.com/rlshadis
English
4
35
273
17.2K
Reece Keller がリツイート
Kevin Ellis
Kevin Ellis@ellisk_kellis·
New paper: World models + Program synthesis by @topwasu 1. World modeling on-the-fly by synthesizing programs w/ 4000+ lines of code 2. Learns new environments from minutes of experience 3. Positive score on Montezuma's Revenge 4. Compositional generalization to new environments topwasu.github.io/poe-world [1/n]
English
16
102
567
58K
Taelin
Taelin@VictorTaelin·
what are the most exciting developments taking place in biotech right now? companies or teams to watch for? what is the "this will solve everything" of today? CRISPR?
English
60
4
203
19.1K
Reece Keller
Reece Keller@rdkeller·
10/ Animal-like autonomy—flexibly adapting to new environments without supervision—is a key ingredient of general intelligence. Our work shows this hinges on 1) a predictive world model and 2) memory primitives that ground these predictions in ethologically relevant contexts.
Reece Keller tweet media
English
4
1
30
1.4K
Reece Keller
Reece Keller@rdkeller·
9/ Finally, we show that the neural-glial circuit proposed in Mu et al. (2019) emerges from the latent dynamics of 3M-Progress agents. Thanks to my collaborators Alyn T. and @fel_p8, and to @xaqlab for his continued support! Paper link: arxiv.org/abs/2506.00138
Reece Keller tweet media
English
1
0
18
1.7K
Reece Keller
Reece Keller@rdkeller·
1/ I'm excited to share recent results from my first collaboration with the amazing @aran_nayebi and @Leokoz8! We show how autonomous behavior and whole-brain dynamics emerge in embodied agents with intrinsic motivation driven by world models.
English
9
65
385
52.2K
Reece Keller
Reece Keller@rdkeller·
@mattswider Mine has been counting down minute my minute from 24 and I joined the queue as soon as you tweeted. 18 minutes now...😭this is so stressful
English
0
0
0
0
Matt Swider (The Shortcut)
Matt Swider (The Shortcut)@mattswider·
This is what you want to see – eventually – to get a PS5 restock @ Sony Direct. It'll say "more than an hour wait", then go to 22 minutes, then 11 minutes and count down from there. As long as this doesn't happen at the very end (when it's sold out), you probably have a chance🍀
Jonatan@jonatanb05

@mattswider I have 11 minutes

English
109
7
115
0