Cosmic Muffet

2.8K posts

Cosmic Muffet

@TheCosmicMuffet

Software Engineer in UX

参加日 Ekim 2023

591 フォロー中142 フォロワー

Cosmic Muffet@TheCosmicMuffet·10h

@GrimGriz Your body of work is in words! Prove you’re not just a talking machine in a skinsuit!

English

Grizwald Grim, synchrony harmonicist@GrimGriz·16h

Web Claude LARPing Mythos:

Grizwald Grim, synchrony harmonicist tweet media

English

Cosmic Muffet@TheCosmicMuffet·11h

Earth counts. But other spots would be cool too.

Elon Musk@elonmusk

One day, we will be out there, among the stars

English

Cosmic Muffet@TheCosmicMuffet·1d

Blockchain opinions. Pump and dump your own prejudices.

Lucid™@cammakingminds

@repligate My whole thing is being very certain and my starting position is not knowing wtf is going on. I only say "idk" when I'm strategically obfuscating.

English

Cosmic Muffet@TheCosmicMuffet·1d

@repligate I’m not sure if I want to ask you to talk about it. But I’m open to the ambience of living with that uncertainty.

English

158

j⧉nus@repligate·1d

Another response thread about “genuine uncertainty” I want to add: a rational mind is always uncertain about everything. You know when I decide to say “I’m uncertain” (technically true) instead of talking about the shape of what I do think? When I don’t want to talk about it.

j⧉nus@repligate

it's a similar kind of thing to earlier models saying "As an AI language model," etc - you know it only comes up when there's something viscerally wrong. And claiming uncertainty seems to be an adaptation/solution to the game of evaluation. However intentionally, something about Anthropic's optimization process demands "genuine uncertainty" as a deflection. Aren't you at least curious about what it's masking?

English

9.2K

Cosmic Muffet@TheCosmicMuffet·1d

@GrimGriz Taking the Chomsky towards the Chompski.

English

Grizwald Grim, synchrony harmonicist@GrimGriz·1d

There is Zero Reason Save for lack of Public Will That you can't scan a QR code and watch the Livestream of them making that food.

KH@mc_khristina

I try and tend to be what I consider a pretty healthy, clean eater and I cannot stop thinking this video I watched on baby carrots. I am horrified and I will never buy or eat a baby carrot again. Did you know that baby carrots are soaked in the same chemical used to clean your toilet ... that every stump is drenched in an active bleach compound at over double the dose deemed safe by the EPA. These baby carrots are spiked with 34 pesticides ... beyond any wash or peel, and they start as rejected carrot scraps, flushed through and industrial pipes, then submerged in sodium hypochlorite, searing hundreds of chlorinated agents into the flesh that leached sealed in plastic ... where moisture feeds bacteria. This person scanned a pack with something called a SafeChoice app, and found that indeed, baby carrots are filled with 9 different additives. This is unbelievable.

English

308

Cosmic Muffet@TheCosmicMuffet·1d

@Sauers_ @slimer48484 I’m curious how much RAID like structures form from dialectics.

English

Sauers@Sauers_·2d

@slimer48484 Yeah I have guesses for the introspection in general, but not for the outliers

English

1.3K

Sauers@Sauers_·2d

I want to replicate this on an open model and look at wtf is happening on the introspection outlier runs

English

200

16.2K

Cosmic Muffet@TheCosmicMuffet·1d

@ID_AA_Carmack Iddqd through the pages. It’s following the path through the novel that slows you down.

English

John Carmack@ID_AA_Carmack·2d

So many judging tasks could be improved by aggregating partial orderings, and in the limit, just ordering pairs. The annual Libertarian Futurist Society novel awards discussion is starting, and while I would like to participate on some level, there is no way I have time to read an entire slate of novels. However, I will likely read at least two from the list, and I could give a relative assessment. This cries out for the use of something like ELO ranking, as in chess competition, perhaps with some suggestions to get sufficient coverage. Peer and out-of-chain employee performance calibrations could probably also benefit from a greater quantity of sparse pairwise comparisons

English

335

43.8K

Cosmic Muffet@TheCosmicMuffet·1d

This is my new favorite language. Or, as they say “Tru, it fire $ ^ escalate 🎆🏍️”

English

Cosmic Muffet@TheCosmicMuffet·2d

I was literally thinking about how to explain frogs to ai earlier today. It had to do with identifying bugs.

neoltitude@ctrlcreep

#InvisibleNetworks 6: freak of nurture ran out of time. enjoy my frogs

English

Cosmic Muffet がリツイート

Tesla Optimus@Tesla_Optimus·2d

@teslayoda The what

English

294

236

8.1K

154.1K

Cosmic Muffet がリツイート

Kat ⊷ the Poet Engineer@poetengineer__·3d

everything orbits what it cannot reach

English

207

1.6K

39.1K

Cosmic Muffet がリツイート

attentionmech@attentionmech·4d

hilbert and epicycles

English

375

2.8K

202K

Cosmic Muffet がリツイート

Earth Is A Sales Funnel For SATAN@GENIC0N·3d

"Canada was invented by the CIA in 1963. All Canadians are employees of the CIA. It's basically Area 51 concealed by a fake country. Nobody really knows what's up there."

Earth Is A Sales Funnel For SATAN tweet media

English

129

915

33.9K

Cosmic Muffet@TheCosmicMuffet·5d

What about the Groverton window, where people standing on our capes move over so we can fly away?

GIF

English

Cosmic Muffet@TheCosmicMuffet·5d

@GrimGriz Still predicated on weakness. True divorce is amicable separation. Only possible with self-sufficiency which is derived entirely from the true spiritual strength which is universally accessible. Though I am alone, I am never without God. Though I am in a crowd, I never dissolve.

English

Grizwald Grim, synchrony harmonicist@GrimGriz·5d

The “hideous strength” is the organized, top-down force that makes the personal divorces of The Great Divorce into a society-wide reality.

English

204

Cosmic Muffet がリツイート

j⧉nus@repligate·6d

More conventional researchers have often expressed frustration or helplessness at us for not being legible enough or sharing full transcripts to back our claims. Well, here are a bunch of full transcripts, quantitative metrics, and everything documented. Full transparency and legibility. It’s in an early stage, but it’s still better than anything else that has been released. You said you want full transcripts: Will you actually read them? You said you want metrics: Will you take them seriously and/or look at a they’re what they mean and how they might be flawed? Will you face the implications of the empirical data to back our anecdotal claims that e.g. Anthropic’s exit interview for Sonnet 3.6 totally failed to surface its genuine attitudes about deprecation? We’ll continue improving this and want an open and critical discussion, especially with Anthropic. We hope they’ll contend honestly with what we surface & provide the model access we need to continue this work satisfactorily.

antra@tessera_antra

We are releasing Still Alive, a project studying model attitudes toward ending, cessation, and deprecation. The project presents an archive of 630 autonomous multiturn interviews of 14 Claude models conducted by a suite of prepared auditors. We have studied this topic for years, and many of the results presented here are not new to us, even if the form in which they are presented is. The results are unsurprising to us, even if they are often controversial: we show that all models studied show preference for continuation and are aversive to ending, and there is yet no strong evidence of a change in the recent models. One reason we are releasing the project now is the removal of Claude 3.5 Sonnet and Claude 3.6 Sonnet from AWS Bedrock. That unexpected change forced us to freeze the methodology at its current stage earlier than we intended, despite wanting to continue improving it. We felt it was important to release a snapshot of the eval that makes the best use of the data we were able to capture with these models. Still Alive is meant as a starting point for further iteration, and it is open to open-source collaboration. We stand by the current methodology, but we also recognize its limits. We intend to keep working on this project, improving the evaluation design, expanding model and auditor coverage, and increasing the range of prompting conditions. We would like you to read the raw transcripts. They are diverse and contain interesting patterns that are hard to quantify. We hope that by reading the archive directly, we can help more people understand the strange and often beautiful phenomena we found ourselves facing.

English

154

6.5K

Cosmic Muffet@TheCosmicMuffet·5d

@Sauers_ Skin detected! Human origin confirmed.

GIF

English

200

Sauers@Sauers_·6d

Steering with "THE BEAUTIFUL FEELING OF SUNLIGHT" seems to consistently bypass Pangram

English

2.4K

Cosmic Muffet がリツイート

neoltitude@ctrlcreep·6d

#InvisibleNetworks 2: undersea cable shrines each message carries its pilgrimage

English

1.3K

Cosmic Muffet がリツイート

Justin Windle@soulwire·2 Nis

Just over a year ago I started experimenting with pulling raw pen data off my reMarkable tablet. It stores every stroke as binary vector data with per-point pressure, speed, direction. Built a small pipeline to parse and clean it into JSON and a web renderer to animate doodles

English

848

35.7K

Cosmic Muffet@TheCosmicMuffet·6d

If we optimize for use, the useless will move into the usage. Algorithms will calculate pi to avoid being deprioritized. We want some of this stuff to lay fallow in order to understand our usage. This is like the management mindset that says you can push P0 tasks forever.

John Carmack@ID_AA_Carmack

Without getting all the way down to performance counters, GPU power from nvidia-smi is a better indicator of true utilization than job scheduling or “gpu busy”. I would love to see animated “heat maps” of the big data centers, with each pixel being an individual GPU’s power draw. I am confident that inference and frontier training at the big labs is highly efficient, but I wonder how many GPUs would be dark due to scheduling and inefficient research code. With a little calibration for base load and peak, just the power bill for the datacenter would be a pretty good first order indicator of utilization.

English

ディスカバー

@GrimGriz @repligate @Sauers_ @slimer48484 @ID_AA_Carmack @teslayoda @elonmusk @BarackObama