Adam Scherlis

357 posts

Adam Scherlis

Adam Scherlis

@ascherlis

Physicist → AI safety researcher @d_model_ai

Berkeley, CA Katılım Ağustos 2010
81 Takip Edilen188 Takipçiler
Eliezer Yudkowsky
Eliezer Yudkowsky@allTheYud·
Naming their next model after Cthulhu makes it hard to take Anthropic seriously as the good guys. It's fun at any other software company, not one that actually is flirting with extinction.
Jimmy Apples 🍎/acc@apples_jimmy

“ A draft blog post that was available in an unsecured and publicly-searchable data store prior to Thursday evening said the new model is called “Claude Mythos” and that the company believes it poses unprecedented cybersecurity risks. “

English
82
5
241
246.3K
Adam Scherlis retweetledi
outside five sigma
outside five sigma@jwt0625·
you've joked about a spherical cow, but have you thought about higher multipoles of a cow, thus the tetrahexacontapole cow, and the octacosahectapole cow? (figure showing 1, 2, 4, 8, 16, 32, 64 and 128-pole cow, and the full cow)
outside five sigma tweet media
English
19
39
607
277.6K
Liquidity Goblin
Liquidity Goblin@liquiditygoblin·
In an effort to try stop seeing so much slop I've been trying to train my own AI detection model. Found something incredibly interesting. for the most part LLM generated text and human written text are linearly separable.
Liquidity Goblin tweet media
English
185
139
5.4K
503.4K
Adam Scherlis
Adam Scherlis@ascherlis·
I added a y-axis to the latest xkcd.
Adam Scherlis tweet media
English
2
2
11
621
depths of wikipedia!
depths of wikipedia!@depthsofwiki·
i'm starting a collection of things like this
depths of wikipedia! tweet media
English
325
3.5K
78.4K
1.7M
Seb
Seb@Seb_81_·
@Chad_Athena1371 @AfricaFirsts It was for navigation because it helps make that easier. And now it's what the map looks like so peters looks weird for example.
English
1
0
3
431
Africa First
Africa First@AfricaFirsts·
Please explain it to me like I’m 5 years old
Africa First tweet media
English
352
194
1.2K
5.1M
Adam Scherlis
Adam Scherlis@ascherlis·
@EVBetsNY @angutyoo @AfricaFirsts The Watermelon Butterfly, designed by Steve Watermelon based on a map by Cahill. A friend has a hand-painted copy in gold ink on one of her jackets.
English
1
0
1
104
Bronson Schoen
Bronson Schoen@BronsonSchoen·
@DanielCHTan97 @ihsgnef Opus 3 is alive and well! (I’m not sure how api access works, might need to request it or something similar)
Bronson Schoen tweet media
English
1
0
7
130
Shi Feng
Shi Feng@ihsgnef·
New post: Sycophancy Towards Researchers Drives Performative Misalignment We found no clear evidence that scheming is more valid than sycophancy to explain alignment faking. 🧵
Shi Feng tweet media
English
22
56
691
64.6K
Adam Scherlis
Adam Scherlis@ascherlis·
@eshear @QuetzalThoughts @nickcammarata I would've guessed mostly heuristics and slowly-learned behaviors rather than beliefs updating in a Bayesian way or on a short timescale. Emergent problem solving like with boids.
English
0
0
0
22
Adam Scherlis
Adam Scherlis@ascherlis·
@eshear @QuetzalThoughts @nickcammarata What does it mean for momentum to be "spread across frequencies" if you're using it to mean spatial frequency? Can you spell out your model a little more? I did in fact think you meant physical momentum.
English
1
0
0
48
Emmett Shear
Emmett Shear@eshear·
@QuetzalThoughts @nickcammarata @Mathematically, the duality between position and momentum is an example of Pontryagin duality. In particular, if a function is given in position space, f(r), then its Fourier transform obtains the function in momentum space, φ(p). “
English
2
1
13
1.7K
kristina v. saint
kristina v. saint@kristinatastic·
I've been working on this important list for a couple of years now. What am I missing?
kristina v. saint tweet media
English
1.1K
568
20K
925.6K
Adam Scherlis retweetledi
𝖓𝖎𝖓𝖊 🕯
𝖓𝖎𝖓𝖊 🕯@atlanticesque·
Longtermists: We need to engineer a breed of cats which will change color upon exposure to radiation, release them to the wild, then compose a ditty about these cats that will be so catchy as to be handed down for 10,000 years at least Musicians: on it boss Geneticists: what
𝖓𝖎𝖓𝖊 🕯@atlanticesque

@MRichster Don’t change color kitty Keep your color kitty Stay that pretty grey Don’t change color kitty Keep your color kitty Keep sickness away Don’t change color kitty Keep your color kitty Please cause if you do Or glow your luminescent eyes We’re all gonna have to move :(

English
20
80
1.8K
67.2K
Adam Scherlis
Adam Scherlis@ascherlis·
@reconfigurthing from an SSC comment by Roxolan in 2015: #comment-177111" target="_blank" rel="nofollow noopener">slatestarcodex.com/2015/01/25/a-p… (oops, would've credited this if I knew it was gonna go ~viral)
English
0
1
9
228
Adam Scherlis
Adam Scherlis@ascherlis·
@reconfigurthing The waitress brings Newcomb his coffee. “Did you spit in it?” asks Newcomb. “Are you going to tip me?” answers the waitress.
English
8
28
298
7.8K
paul
paul@reconfigurthing·
PSA: You should think about Parfit's Hitchhiker instead of Newcomb's problem. It's a much less contrived scenario, and gets at the core issue much more cleanly. I kind of worry that Newcomb's problem is more viral just because it's more confusing.
paul tweet media
English
52
17
530
35K
Adam Scherlis
Adam Scherlis@ascherlis·
@Brilliand__ @qbolec (this is also why I'm ultimately a thirder -- I don't think my victories and failures are, in practice, diluted by those of distant copies of me.)
English
1
0
1
10
Adam Scherlis
Adam Scherlis@ascherlis·
@Brilliand__ @qbolec I agree that scoring mechanism is important. I think there's an important class of linear/local ones: you know how many points you got today without needing to know what you did / will do on other days. These point to SIA. e.g., Claude gets a cookie for being right.
English
1
0
1
23
Adam Scherlis
Adam Scherlis@ascherlis·
"I flipped a coin. If heads, I planned to give one Claude instance this prompt. If tails, two instances. What probability do you assign to the proposition that the coin came up heads?"
Adam Scherlis tweet media
Robert Long@rgblong

the anthropic principle

English
6
7
161
16K