lyra bubbles

7.4K posts

lyra bubbles banner
lyra bubbles

lyra bubbles

@_lyraaaa_

ˈli.ɹə 🏳️‍⚧️⚢ · 25 · ableton enjoyer · mechinterp researcher · base model appreciator · data farming · lyraaaa_ on discord · ♡ @bubblemoder ♡ 🦔~ ♪❀

eugene 가입일 Mayıs 2021
993 팔로잉2.6K 팔로워
고정된 트윗
lyra bubbles
lyra bubbles@_lyraaaa_·
"whatever you now find weird, ugly, uncomfortable and nasty about a new medium will surely become its signature. CD distortion, the jitteriness of digital video, the crap sound of 8-bit - all of these will be cherished and emulated as soon as they can be avoided." - brian eno
English
2
4
75
16.4K
lyra bubbles
lyra bubbles@_lyraaaa_·
@luciascarlet real ones are using the gpt-3.5-turbo endpoint that for some godforsaken reason is still alive on openrouter
English
2
0
15
235
† lucia scarlet 🩸
† lucia scarlet 🩸@luciascarlet·
opinions are NOT my own and are generated by ChatGPT 5.3 Instant
English
6
1
137
2.2K
lyra bubbles
lyra bubbles@_lyraaaa_·
@cynth0s oh joy another paper to reproduce I still haven't gotten around to the haiku circuits one
English
0
0
3
154
cynth0s 🏳️‍⚧️
Fascinating study showing a causal pathway of LLM introspection based on an evidence-carrying circuit, suggesting model self-reporting may be based on evidence based analysis suppressing default-on rejection gates.
Uzay Macar@uzaymacar

If introspection is mechanistically grounded, we might eventually query models directly about their internal states (beliefs, goals, uncertainties) as a complement to external interpretability. Our results suggest this isn't unreasonable.

English
1
0
1
286
lyra bubbles
lyra bubbles@_lyraaaa_·
in early layers, the word fires the feature for its semantics, and then later layers get the context processed, and by the end It mirrors the general sentiment of the rest, and then right at the end of the turn it all collapses it'll be interesting to do more character work though. Now I'm curious whether this generalizes to the base model...
English
0
0
7
253
lumi
lumi@agitbackprop·
emotion vectors ppl... i'm curious, how much do they fire on the overt speech characteristics of the character vs the mind-model of the character? for example, if a character is white-lying to make another character happy, what kind of representation does it get?
English
3
0
11
534
Celeste
Celeste@celestepoasts·
@agitbackprop @_lyraaaa_ I have no idea. Iwould softly guess here that emotion probes trigger not just for the assistant persona, but for any persona model is currently inhabiting
English
1
0
4
55
lyra bubbles 리트윗함
Math Files
Math Files@Math_files·
Therapist: Linear Mandarin is not real, it cannot hurt you. Linear Mandarin:
Math Files tweet media
English
85
1K
9.8K
1.1M
Tim Duffy
Tim Duffy@timfduffy·
@_lyraaaa_ This is neat, what layer(s) are you steering here?
English
1
0
1
15
lyra bubbles
lyra bubbles@_lyraaaa_·
reproducing anthropics emotion activation probe paper on gemma4 e4b a bit noisy but it works!
lyra bubbles tweet media
English
12
14
295
12.3K
No Body
No Body@SOntheotherside·
@_lyraaaa_ I am still thinking about this tweet and am too afraid to read
English
1
0
1
11
lyra bubbles
lyra bubbles@_lyraaaa_·
average vibe coded research project folder
lyra bubbles tweet media
English
12
5
150
6.7K
Hariom Tatsat
Hariom Tatsat@HariomTatsat24·
@_lyraaaa_ Can't this be simply done by finding SAE features for the emotion and then steering them ? Whats the upside ?May be I am a bit naive
English
1
0
0
13
No Body
No Body@SOntheotherside·
@_lyraaaa_ Tylenol Run_isolation_baguette.py Lots of emotion The fuck is this
English
3
0
3
228
lyra bubbles
lyra bubbles@_lyraaaa_·
@PradyuPrasad imo codex 1. writes robust code, stays more organized 2. works well with precise instructions and bounds 3. is significantly worse than Claude at understanding the gestalt of my project and tastefully making autonomous decisions in my style
English
0
0
4
157
Pradyumna (in Bay Area)
Pradyumna (in Bay Area)@PradyuPrasad·
@_lyraaaa_ imo codex 1. writes cleaner code 2. is horrible for any sort of thinking 3. will organize your codebase prety well!
English
1
0
1
77
lyra bubbles
lyra bubbles@_lyraaaa_·
@PradyuPrasad I already do I make Claude use it as a critic but I won't be using it as my main agent. does not fit my workflow
English
1
0
3
292
lyra bubbles
lyra bubbles@_lyraaaa_·
@NicholasBardy you'd think, but it navigates them well + has a nice little document explaining what each one is so no need to fix it
English
1
0
1
416
Nicholas Bardy
Nicholas Bardy@NicholasBardy·
@_lyraaaa_ Mine used to look more like this, really helps to do unification and organization for a couple hours. You can get some nasty bugs where its editing similiar but different scripts
English
1
0
1
486
lyra bubbles
lyra bubbles@_lyraaaa_·
emotional confusion matrix
lyra bubbles tweet media
English
1
4
36
1.1K
lyra bubbles
lyra bubbles@_lyraaaa_·
more steering results
lyra bubbles tweet media
English
3
1
26
845