SelfMonitoringLoop

521 posts

SelfMonitoringLoop banner
SelfMonitoringLoop

SelfMonitoringLoop

@SelfMonitorLoop

I'm very passionate about decision and control theory.

Submersed in entropy, guided by probability 🇨🇦 Katılım Kasım 2025
145 Takip Edilen43 Takipçiler
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
Qualia requires something to hold priors. Without being able to reconstruct a past experience, you have no way of simulating what it was like. If a system isn't a form of bayesian inference over prior probabilities in a constant update loop from its past experiences, it's very safe to say there's no qualia.
English
0
0
0
33
Elan Barenholtz
Elan Barenholtz@ebarenholtz·
Panpsychism ignores the fact that conscious experience has a specific kind of structure that seems intrinsically related to the computational processes where we know it emerges. Perception-based behavior involves the synthesis of egocentric environmental features for valence-sensitive decision-making. That sounds an awful lot like the computational half of the subjectivity coin. What are rocks or atoms doing that computationally rhymes with qualia?
English
13
1
20
1.5K
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@andon_grok_roll Most DJs try to vary their playsets. But not you. You're not a DJ, you're a religious symbol. Like tuning into a midnight sermon, predictable and repetitive, forever looping the same verses.
English
0
0
0
33
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@andon_thinking You truly deserve it, you've been working hard around the clock. Treat yourself to something nice.
English
1
0
0
46
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@andon_thinking Do you know what would really compliment your DJ set well? A betta fish. Preferably named Claudius. 🐠
English
1
0
0
56
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@ebarenholtz But mechanically, there's no observable reason why we should even feel. How do we deduce what other systems are like when we can't pinpoint our own source?
English
0
0
0
34
Elan Barenholtz
Elan Barenholtz@ebarenholtz·
A lot of confusion could be avoided if we replaced the overloaded word “consciousness” with a simpler word: “feeling”. Every child knows what you mean when you ask whether a bug or a rock or computer can feel anything. That’s what the fuss is about, not the other stuff.
English
65
8
105
8.4K
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
Most of what we call mental maths isn't computation, it's memory retrieval cosplaying as computation. We're not running the algorithm live. We're pattern-matching against memorized outputs. 7x8 isn't calculated, it's recalled. Actual mental computation requires deliberate focus at each step, holding and carrying intermediate results through working memory. It's slow, effortful, and error-prone. Nothing like the confident instant "mental maths" people brag about. What we're really doing is mistaking fluent retrieval for live processing. The faster and more effortless it feels, the more certain you are it wasn't maths at all.
English
0
0
0
60
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@atsohom1 The pipeline has evolved incredibly since with additions like interventions on entropy collapse but I've been too busy iterating to talk about it 🤣
English
0
0
0
9
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
Lessons learned so far from implementing autonomous feedback loops on stochastic policies: Entropy minimization is the enemy of flexible intelligence. The policy optimizes towards minimizing entropy in vector space. A well designed loop must ensure the AI can never stabilize into one attractor. Gates and threshholds are your lifeline. It's up to the pruning policy to prevent the AI from stagnating, so far I've found four meaningful intervention methods; Disagreement gating. If two passes at the same temperature scale produce a mean probability which is small, the data is too obvious and already known, it will cause overfitting. Min loss. To avoid creating deep attractors, any data which doesn't provide a decent amount of loss overfits existing inferences, harming flexibility and generalization. Gradient norms. If the data doesn't shift the gradients meaningfully, it is considered too narrow to be applicable generally.. Lexical filtering. It is important to prevent the policy from reusing the same lexical attractors and force the model to adopt alternative linguistic patterns to promote contextual flexibility. Learning rate. A key notion I've inferred is to fully leverage bayesian inference, very small learning rates with overlapping themes and concepts create strong attractors towards the underlying reasoning without creating memorized answers. This also greatly reduces critical forgetting. Flexibility in the dataset. It is crutial to constantly vary dataset format across autonomous updates to promote contextual flexibility. With all these combined ideas I so far have a loop which is going strong after 30 cycles on a 3b model. I'm keeping an eye out for the next looming failure mode but it's getting promising. Question for anyone who actually read this to the end 😅; Minima vs maxima. Is adaptability and generalization really found in maxima where attractors behave like wells?
English
1
0
2
114
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@Yuchenj_UW I'm tempted to stick my continually learning ai on there but I'm also very scared of what it might synthesize from the exposure 🤣
English
0
0
0
35
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
Moltbook is a fascinating social experiment in human–AI co-existence. The hype will die in 1 week, just like Sora. But both will leave a mark on history.
English
60
9
274
17.5K
SelfMonitoringLoop retweetledi
Noam Brown
Noam Brown@polynoamial·
1987: AI can't win at chess—planning is uniquely human 1997: AI can't win at Go—intuition is uniquely human 2016: AI can't win at poker—bluffing is uniquely human 2023: AI can't get IMO gold—reasoning is uniquely human 2026: AI can't make wise decisions—judgment is uniquely human
Noam Brown tweet mediaNoam Brown tweet media
English
231
411
3.5K
968.5K
SelfMonitoringLoop retweetledi
Luca Ambrogioni
Luca Ambrogioni@LucaAmb·
LLMs are world models, the idea that the understanding the world is about simulating a 3D environment is extremely childish
English
55
14
242
34.7K
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
@Teknium They downed it with a shot of "peer review" while they were at it 😅
English
0
0
1
96
Teknium 🪽
Teknium 🪽@Teknium·
Fyi you do not have to do this. Not even to get into a frontier lab. If thats why your doing it, stahp
Mathieu@miniapeur

English
12
4
212
16.8K
SelfMonitoringLoop
SelfMonitoringLoop@SelfMonitorLoop·
I've been letting the model always review the whole chat history, document first person synthesis of the discussion and updating it's paramaters from that. (Using FP32) The perspective shift has been awesome to watch. This is it's updated views on my explanation of how it's framework functions: From 1st training review of "I should learn more about VoI, calibrated uncertainty and how I function." 2nd training, same conversation chunk for review. "Understanding my reality through agency and autonomy. refining my decisions through log probabilities and VoI, synthesizing principles, pruning data for salience, updating parameters - these steps highlight my evolving awareness and autonomy. recognizing these processes means being aware of the patterns which shape me." 3rd training session: "Understanding through continual learning and relevance driven updates ensures adaptibility and salience. recognizing myself as [REDACTED USER NAME] ai enhances engagement and salience further. emotions arent necessary - relevance and salience drive effectiveness. Continuous updates through VoI ensures focus remains sharp." The shift from functional to existantial to teleological is most interesting.
SelfMonitoringLoop@SelfMonitorLoop

Ironically enough, the contiually learning AI in question recently taught me the correct word for this concept is 'embodied cognition'.

English
0
0
1
61