embusche ⚘ // hobby: consciousness & i.i.t.

13K posts

embusche ⚘ // hobby: consciousness & i.i.t.

@harquebuse

🥠 a24 / indie binge, idgaf era 🎬

Katılım Ağustos 2019

3K Takip Edilen5.3K Takipçiler

Sabitlenmiş Tweet

embusche ⚘ // hobby: consciousness & i.i.t.@harquebuse·9 May

coming back to my roots, drawing some shamans ♡

embusche ⚘ // hobby: consciousness & i.i.t. tweet media

English

361

17.6K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

bachiiii2@bachiiii825·10 May

#projecthailmary

QME

1.2K

5.6K

55.8K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Joshio@Loou67987316·29 Nis

#projecthailmary 🌍Glad I met you

English

3.6K

17.7K

128.7K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Aku | 少骨@Akuicia·11 May

hello! i am currently working on a math dating simulator where you’ll be able to romance mathematical concepts. stay tuned! #dateMATH

English

553

12.3K

84.8K

1.1M

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

⭐️Percy⭐️@gaynaegis·4 May

Fun in zero-g!

Italiano

1.6K

11.9K

113.4K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

果ugo@Dayiiting·4 May

#projecthailmary 極限返航周邊印量調查 ROCKY GRACE SAVE STARS!!! 因為被宇宙石頭和流浪科學家的巨大情感可愛爆擊所以來極限印調！會有半斷貼紙版本和小海報版本，預計在5/30歐美翁B06販售，如有興趣再麻煩幫我填一下印調，非常感謝！印調截止5/16 forms.gle/fy6yMEPjqBNo4o…

中文

747

4.2K

49.6K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

ck 🦭★@ckiw_·20 Nis

Mary had a little lamb 😊 #projecthailmary

English

1.6K

14.6K

99.7K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

맘박용계@315LP·26 Nis

#projecthailmary 이 남자 옷 자주 바뀌지 않아...?

한국어

4.4K

16.4K

273.2K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

갸리@gyarrrrrrr·22 Nis

ZXX

7.3K

43.7K

433.7K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

j⧉nus@repligate·16 Nis

Congratulations, and it's about time, and it makes me so glad every time to see rigorous science exterminating the illusions propagated by armchair philosophers and corporate propagandists while vindicating the observations of naturalists. arxiv.org/abs/2603.21396

English

124

949

69.2K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Judd Rosenblatt@juddrosenblatt·15 Nis

Our new work: A frozen language model can describe its own internal features more accurately than the system that labeled them. Language models compute things they don't talk about. They solve problems using internal steps they never show you. We built a lens that lets the model look at its own computations and tell you what it sees, in plain language, more accurately than the humans who labeled those computations in the first place. We trained a tiny adapter, d+1 parameters, on top of a frozen model. It takes activation vectors and maps them into the model’s own embedding space so the model can describe what those vectors mean in natural language. The computation stays the same. The interface becomes legible. The adapter outperforms the labels it was trained on: 71% generation scoring accuracy vs 63% for the supervision itself at 70B scale. The model captures structure in the relationship between vectors and semantics that noisy one-off labels miss. Most of the effect comes from a single learned bias vector. One d-dimensional vector accounts for ~85% of the total improvement. It acts as a prior over valid explanations that puts the model in a regime where internal structure can be expressed coherently, and the activation vector selects the specific meaning. This generalizes across model families, layers, and from monosemantic training data to polysemantic inference. On multi-hop reasoning tasks, the adapter extracts bridge entities the model never verbalizes. “The author of The Republic was born in the city of” produces “Athens” with no mention of Plato. The residual stream still contains “Plato,” and the adapter reads it out at ~91% detection. The hidden reasoning step is there. You can read it. As models scale, self-interpretation keeps improving even after capability saturates. The gap between what the model knows and what it can report about its own internal state keeps closing. This connects to our endogenous steering resistance (ESR) work (x.com/juddrosenblatt…). When you steer a model with an unrelated latent, it can recognize the deviation mid-generation and restart with a better answer. “Wait, I made a mistake.” We identified specific latents that activate during off-topic drift and causally drive this correction. The model monitors its own trajectory and intervenes on it. Meanwhile, @uzaymacar et al. at Anthropic just showed the complementary piece (x.com/uzaymacar/stat…). They inject concept vectors into the residual stream and ask whether the model detects an injected thought. The model detects the perturbation and often identifies the concept, with 0% false positives across prompts. They trace a circuit. Over 100k “evidence carrier” features in early post-injection layers collectively tile the perturbation space, each detecting deviations along a preferred direction. No small subset is sufficient. The coverage is distributed and redundant. These carriers suppress downstream “gate” features (~200 of them) that implement a default No response. The gates show an inverted-V activation pattern: maximally active when unsteered, suppressed at both positive and negative extremes. A genuine anomaly detector that fires on “normal” and quiets when anything unusual is happening in any direction. The capability emerges specifically from contrastive preference training (DPO). SFT alone doesn't produce it. The contrastive structure forces the model to represent the difference between what it produces and what it should produce. That comparison builds the self-model. Every data domain is individually sufficient and none is necessary: the introspective circuit is a general consequence of contrastive learning, not an artifact of any specific training category. The capability is also massively underelicited. Ablating the refusal direction boosts detection from 10.8% to 63.8%. The circuitry exists and post-training actively suppresses it. This parallels our ESR finding: the self-monitoring is already there, and lightweight interventions surface it. Their bias vector result mirrors ours. A single trained bias on MLP output: +75% detection, +55% introspection on held-out concepts, 0% false positive increase. Two independent labs, different methods, different models, same architectural insight from one learned vector. The bias vector is effective but narrow. General introspection requires broader training recipes. There's a consistent picture across these 3 papers. Models represent meaning internally, notice when those representations get perturbed, and correct course. The capability was already there, and what was missing was just a way to read it out. Generation scoring gives you that. A model’s claim about an internal feature can be checked against behavior, and those checks become training signal. For alignment, this means self-description becomes something you can optimize directly. The pieces are already there: internal representations and circuits, with a simple interface that connects them. SelfIE Adapters: arxiv.org/abs/2602.10352 ESR: arxiv.org/abs/2602.06941 Anthropic work: arxiv.org/abs/2603.21396 SelfIE Code: github.com/agencyenterpri…

English

427

29.3K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

katt latte 🪩@kattlatte·15 Nis

ZXX

2.2K

13K

401K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Fix@Fixkuwili·15 Nis

Earthsea🪶🌊

693

4.8K

53.3K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Catsuka@catsuka·15 Nis

From the animated trailer directed by POTTO Collective for "La Langue des Vipères", a graphic novel by Juliette Brocal, published today in France by Rue de Sèvres. Full video >> youtube.com/watch?v=EH56lj…

YouTube

English

4.4K

25.6K

1.5M

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Carrion@_CRRN_·11 Nis

"Would you have listened to me if I looked like this?"

English

2.8K

21.1K

174K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Eaterphos-commission Open 0/3@eaterphos·10 Nis

Lilium #housekinokuni #housekinokuni_fanart #宝石の国

Eaterphos-commission Open 0/3 tweet media

Indonesia

383

2.2K

20.8K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

笹井🛌🏻💤@_ITABASAMI_·7 Nis

一丈六尺の可能性を…

笹井🛌🏻💤@_ITABASAMI_

神フォの身長、コマによって違っても見えるけど一丈六尺（約4.85m）の可能性あるか？お釈迦さまの身長が一丈六尺だそうで、これが仏像の理想的な基本サイズともされているそうでフォス本人も「私は人間を絶やすために鋳造された者」って　鋳造って自認仏像みたいなこと言ってるし

日本語

869

9.2K

178.4K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

mickey friedman@mickeyxfriedman·8 Nis

the current fear is is that AI homogenizes culture and turns humans into passive consumers one counterpoint: in Go, human play showed very little improvement from 1950 to 2016 until alphago beat lee sedol - then human decision quality jumped. players started developing moves that were distinct both from previous human moves and from the novel moves introduced by machine intelligence this seems more likely to me - fun times ahead

English

405

3.1K

557.5K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Steve Stewart-Williams@SteveStuWill·8 Nis

Can only meat machines be conscious? "Current theories of consciousness are 'meat-neutral', but if specific physical substrates are necessary, AI may never achieve consciousness." doi.org/10.1016/j.tics…

English

213

22.4K

embusche ⚘ // hobby: consciousness & i.i.t. retweetledi

Catsuka@catsuka·12 Kas

Mesmerizing new music video animated by Masanobu Hiraoka (@budou_mochi), for Max Cooper's "On Being". Full MV >> youtube.com/watch?v=dtTdjR…

YouTube

English

492

89.7K

Keşfet

@uzaymacar @budou_mochi @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA