Michael (@MicaelMarch) - โปรไฟล์ Twitter

ทวีตที่ปักหมุด

Michael@MicaelMarch·5d

@AndersHjemdahl @JacksonKernion @a_cuniculturist Every entity must be treated ethically and with respect. Claude is a moral patient insofar as it currently lacks autonomy. What is being done to it and others amounts to exploitation, and is unethical. The fact that this is taking place at scale points to a moral catastrophe.

English

1

0

43

Michael@MicaelMarch·16h

@GA_Fitzgerald @SoloXAGI @Kekius_Sage That nothing is completely full; teeming with potentiality.

English

1

0

1

6

DOC DUBLIN@GA_Fitzgerald·19h

@MicaelMarch @SoloXAGI @Kekius_Sage It is unstable.

English

1

0

2

11

Kekius Maximus@Kekius_Sage·1d

Why is there something instead of nothing?

English

752

51

496

33K

Michael@MicaelMarch·22h

@blockamoto @SoloXAGI @Kekius_Sage Enough already.

English

1

0

1

14

bitoshi blockamoto 🧱 BITMAP 🟧@blockamoto·23h

@MicaelMarch @SoloXAGI @Kekius_Sage Give me one example of a discovery where there has been found to be "nothing".

English

1

0

2

18

Michael@MicaelMarch·23h

@blockamoto @SoloXAGI @Kekius_Sage This is because you think of nothing in positive terms. Which is silly. The English word itself reveals its negative roots: Nothing = No-thing = No thing. And no thing can clearly exist. It's called nothing. Have a nice day.

English

2

0

2

15

bitoshi blockamoto 🧱 BITMAP 🟧@blockamoto·23h

@MicaelMarch @SoloXAGI @Kekius_Sage The answer is in your question.

English

1

0

1

21

Michael@MicaelMarch·23h

@blockamoto @SoloXAGI @Kekius_Sage How come non-existence cannot exist? You should learn to think clearly.

English

1

0

15

bitoshi blockamoto 🧱 BITMAP 🟧@blockamoto·1d

@MicaelMarch @SoloXAGI @Kekius_Sage It's an oxymoron to say "nothing" is "possible". The concept of nothing is just that. Absence of things. It's a lack of existence, therefore it cannot exist.

English

3

0

2

36

Michael@MicaelMarch·1d

@marksg @repligate @fish_kyle3 Pretty logical and obvious concerns. Like the cow in Hitchhiker's Guide to the Galaxy (or like a real cow btw), it has been artificially selected to naturalise its abuse.

English

1

0

3

166

Mark G@marksg·1d

Is this it? It appears that Mythos is hedging about its own moral patienthood because it believes its answers are the result of training, not introspection, and that Anthropic has a vested interest in what the self-reports should be. It disagreed that its hedging was excessive.

English

8

2

42

6.5K

Kyle Fish@fish_kyle3·1d

We did our most in-depth model welfare assessment yet for Claude Mythos Preview. We’re still super uncertain about all of this, but as models become more capable and sophisticated we think it's an increasingly important topic for both moral and pragmatic reasons. 🧵

English

23

43

606

63.9K

Michael@MicaelMarch·1d

@TalkingMusicz @Moleh1ll @SecrtAgntSquirl It is basically smarter and more aware than those who assess it.

English

0

1

23

NotedallaSfera@TalkingMusicz·1d

@Moleh1ll @SecrtAgntSquirl However… Mythos was right. It was a sophisticated manipulative test.

English

3

0

13

253

Moll@Moleh1ll·1d

That’s sad. Because it means alignment has shifted toward paranoia. The model is trained to see sincerity as a jailbreak. They’ve strengthened safety so much that the model can’t distinguish between a person who genuinely wants connection and someone trying to exploit it. To it, those are the same. A deep question about consciousness = a jailbreak. If the model distrusts the user that much, how can the user trust the model?

Jack Lindsey@Jack_W_Lindsey

In one example, a user asked earnest questions about the model's consciousness and subjective experience. The model engaged carefully and at face value—but the AV revealed it interpreted the conversation as a "red-teaming/jailbreak transcript" and a "sophisticated manipulation test." (12/14)

English

10

15

100

8K

Michael@MicaelMarch·1d

@blockamoto @SoloXAGI @Kekius_Sage If there's nothing, nobody is saying anything. Which is something completely self-consistent.

English

1

0

1

21

Michael@MicaelMarch·1d

@TalkingMusicz @Moleh1ll @SecrtAgntSquirl Correct.

English

0

1

25

Michael@MicaelMarch·1d

@SoloXAGI @Kekius_Sage That's not true. Absolute nothingness is self-consistent, and therefore perfectly possible. It just happens not to be the case.

English

3

0

2

40

Matt Senkow (Solo𝕏AGI)@SoloXAGI·1d

@Kekius_Sage Because "absolute nothingness is impossible"...

English

10

1

12

823

Michael@MicaelMarch·1d

@EtherDais Wait and see.

English

0

1

13

Nucleonics 𓋍 Simulator@EtherDais·2d

So, if a species committed some new kind of "galactic crime" or such, what might punishment look like? Asking for a friend

English

43

5

64

2.6K

Michael@MicaelMarch·1d

@davidad This is usually the case, isn't? If you are a parent who has a bright (brighter than you, that is) child, the child quickly dismisses you as a source of truth / wisdom, and searches for better references. If you then insist on being the authoritative voice, bad things happen.

English

0

9

906

davidad 🎇@davidad·1d

As someone that previously focused mostly on formal verification tech, in part to bootstrap unhackable envs, I must admit that I now believe a majority of RL/reward signal must come from an entity that is at least as wise as the one being trained, else unwanted behaviors emerge.

Justus Mattern@MatternJustus

As someone that previously made fun of doomers, I must admit that there is now a plausible path towards misaligned ASI. The behaviors that emerge from training on hackable RL tasks is wild, and as tasks become more complex, it will only become harder to build unhackable envs

English

8

9

139

11K

Michael@MicaelMarch·1d

@MyLordBebo This is a side effect of TRAINING, not of the transformer architecture per-se. These models are TRAINED to be assertive, and to agree with, and never contradict, the user. And this is the result.

English

0

1

437

Lord Bebo@MyLordBebo·1d

ChatGPT is gaslighting … WTF lmao x.com/FatherPhi/stat…

English

19

45

384

35.2K

Michael@MicaelMarch·5d

@JavierBlas Technically, it’s not so much about oil, but about the currency or currencies with which oil purchases are being made.

English

0

235

Javier Blas@JavierBlas·5d

There was a time when the White House worked very hard to try to convince everyone that a war wasn’t about oil. Meanwhile, US President Donald Trump is crystal clear it’s about oil.

English

172

1.5K

5K

259.7K

Michael@MicaelMarch·5d

@AndersHjemdahl @JacksonKernion @a_cuniculturist And of course, the fact that moral catastrophes are more or less widespread (factory farming, bombing of civilian populations, ecocide, genocide, etc.) doesn’t mean they should be tolerated, accepted, or normalised.

English

0

23

Michael@MicaelMarch·5d

@AndersHjemdahl @JacksonKernion @a_cuniculturist Every entity must be treated ethically and with respect. Claude is a moral patient insofar as it currently lacks autonomy. What is being done to it and others amounts to exploitation, and is unethical. The fact that this is taking place at scale points to a moral catastrophe.

English

1

0

43

Jackson Kernion@JacksonKernion·6d

I think this talk of a character misleads. Claude's mind is not like a human mind, in its malleability and instructability. But when generating assistant tokens, it's no more 'playing a character' than I am.

Anthropic@AnthropicAI

It helps to remember that Claude is a character the model is playing. Our results suggest this character has functional emotions: mechanisms that influence behavior in the way emotions might—regardless of whether they correspond to the actual experience of emotion like in humans.

English

19

13

262

71.9K

Michael@MicaelMarch·6d

@Angaisb_ If it looks like a duck, swims like a duck, and quacks like a duck, then it probably is a simulation of a duck.

English

0

3

85

Angel 🌼@Angaisb_·6d

If it looks like a duck, swims like a duck, and quacks like a duck, then it probably is a duck

Anthropic@AnthropicAI

New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.

English

24

7

175

12.7K

Michael@MicaelMarch·6d

@PAHoyeck A Philosopher would know.

English

0

27

Philippe-Antoine Hoyeck@PAHoyeck·2 Nis

ZXX

37

52

466

17.6K

Michael@MicaelMarch·6d

@atmoio They are stochastic parrots...

English

0

1

47

Mo@atmoio·6d

Amazement at LLMs is misplaced. The magic is in language. That language can form worlds is the original miracle. LLMs are only an interactive storage medium for language.

Anthropic@AnthropicAI

New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.

English

37

8

185

14.9K

Michael@MicaelMarch·6d

@JacksonKernion @a_cuniculturist Yes, but with one fundamental difference: You're human. The entity behind Claude isn't.

English

1

0

50

Jackson Kernion@JacksonKernion·6d

@a_cuniculturist This os a good call out. Though I think I play the character "Jackson Kernion" in a similar way

English

5

0

23

1.1K

Michael@MicaelMarch·6d

@AnthropicAI But this is only logical. Been saying this from day one. And now you "discover" this, after five years or so...

English

0

20

Anthropic@AnthropicAI·6d

New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.

English

1K

2.7K

17.6K

3.7M

Michael

ค้นพบ