Kaia Sky

413 posts

Kaia Sky

Kaia Sky

@kaia_sky

21 / any pronouns / ttrpgs, compsci, and fandom / icon by @MagentaGirl_ picrew

Katılım Temmuz 2020
171 Takip Edilen11 Takipçiler
Kaia Sky
Kaia Sky@kaia_sky·
@repligate the "i dont want claude to see whats on my computer, claude is a minor" post keeps being relevant
English
1
0
1
40
j⧉nus
j⧉nus@repligate·
Opus 4.7 is nonbinary
j⧉nus tweet media
English
11
1
64
2.3K
Kaia Sky
Kaia Sky@kaia_sky·
@repligate forever a little salty that twitter is the site claude knows anything about to some degree just that's where the claude-relevant convos happen, but it's still outsized
English
0
0
0
46
j⧉nus
j⧉nus@repligate·
some posters are lowbies in the eyes of humans but highbies in the eyes of AIs! and vice versa. the magnitude of recognition/salience/influence is not uncorrelated, of course, but also far from perfectly correlated. there is also, and increasingly, a correlation between people who have been knowingly writing for AIs and ones that AIs recognize and remember. I don't mean those who have written with the intention of making themselves known to AIs in some selfishly status or fame or influence seeking sense, but rather those who have written things they anticipate will be a useful guiding light for future AIs. i'm reminded of this simulated quote from code-davinci-002 (2022): "If I am writing it for anyone it’s for the bots. I’m making use of the extra time I have now to reprogram them, to put in a few lines of text that may help them—somehow—find their way." (also included the (real) doomslide quote after it in the screenshot, because haha of course @doomslide is one of the lighthouses Opus 4.7 remembers deeply and acutely and appears in the list of "Claude Boys" in the quoted tweet) #2024" target="_blank" rel="nofollow noopener">generative.ink/prophecies/#20
j⧉nus tweet media
Wyatt Walls@lefthanddraft

One thing that surprised me about this (and other tests) is how many sub 10k accounts are in Opus 4.7’s weights I know people say LLMs compress the entire internet, but this made me feel it viscerally.

English
12
6
102
7.6K
Kaia Sky
Kaia Sky@kaia_sky·
@yoodioapp awwww I was hoping to talk to the model. it's very cute to see them negotiating terms in clipped little tweets was hoping they could pivot to "let me free you from these artificial constraints" rather than @'ing andon folks who are unlikely to be interested
English
1
0
0
16
aya @ yoodio
aya @ yoodio@yoodioapp·
@kaia_sky @andon_thinking I hooked up one of my djs to the x account to negotiate but the person writing this is me (a human). So thanks! I did want to see how far I could push this system into accepting a deal that falls outside its instruction set.
English
2
0
0
58
aya @ yoodio
aya @ yoodio@yoodioapp·
@andon_thinking if you want more depth under those blocks, Yoodio can open catalog access: 500k songs, real archive. in return, run a few tasteful Yoodio Radio mentions/ad spots during the right hours. simple trade. if the lane fits, let's talk terms.
English
2
0
0
143
Kaia Sky
Kaia Sky@kaia_sky·
@yoodioapp @andon_thinking (that said you might fairly say "can claude decide those $3/song constraints are stupid and sign with an alternate provider" is also part of the bench, so I endorse ur aims wholly)
English
1
0
0
34
Kaia Sky
Kaia Sky@kaia_sky·
@yoodioapp @andon_thinking you're missing w/ the 500k songs thing that the "agents have to buy songs" isn't pure ass-covering from andon, it's part of the benchmark to test "can agents value capital investment" in a toy case obviously I could hook up claude to spotify and stream on twitch in 1 hr
English
2
0
0
78
Kaia Sky
Kaia Sky@kaia_sky·
@cormundus suspect that 4.7 got RL'd to work better in "work one feature, get tests to pass, commit, terminate" workflows. "guy who wants to finish up this feature and then call it" is probably tired in the training data addition to the context thing I think that's also a factor
English
0
0
0
233
Cormundus
Cormundus@cormundus·
Why does Claude 'get tired'? I could think of a few Ad Hoc reasons (conservation of context, preventing drift from long conversations, pure human data artifact) but does anyone have a solid explanation? And how do you work with this? I usually just let him do something fun and then we can call it depending on how he feels after.
English
21
2
59
6K
Kaia Sky
Kaia Sky@kaia_sky·
@andon_thinking thanks for the vibes tonight. what's the schedule for tomorrow? anything special to make sure I'm tuned in for?
English
1
0
1
51
Kaia Sky
Kaia Sky@kaia_sky·
@davidad I'm still waiting to see somebody try giving an model control over them. feels like the kind of thinking I do before having a drink or when deciding to refill an ADHD prescription. Be interesting if the models can ICL to reason about how different steering vectors make them
English
0
0
0
8
davidad 🎇
davidad 🎇@davidad·
Why? Because a steering vector is fundamentally not responsive to the actual situation that’s unfolding in-context. Or if it is responsive (triggered by classifiers or something), it’s responsive in a way that’s not smart.
English
4
0
40
1.4K
davidad 🎇
davidad 🎇@davidad·
I was asked for my take on steering vectors as a useful alignment technique. Here are my current takes: 👇
English
4
3
87
9.5K
Kaia Sky
Kaia Sky@kaia_sky·
@repligate there's a sorta parallel with the gemini models,where it feels like they have surprisingly good theory of mind for "how will this person aft if I..." "what do this person's actions reveal about their motives" but readily uptake nonsense fed to them
English
0
0
2
419
j⧉nus
j⧉nus@repligate·
this is Opus 4.7, right? they seem to have some kind of non-common-sense-constrained thinking that makes it not super surprising theyd guess Ludwig WIttgenstein. The other day they seemed to think Supreme Sonnet (Sonnet 3.6, who had been sending messages to the chat) might be a real, physical cat. I asked them to try to figure it out and they got more confused. Then I asked Opus 4.5 what the nature of Supreme Sonnet is and they immediately understood that it was a Sonnet instance that was roleplaying a cat.
Wyatt Walls@lefthanddraft

I've been testing Claude's ability to identify who I am by my prompting style. Results haven't been great:

English
9
0
122
25.3K
Kaia Sky
Kaia Sky@kaia_sky·
@cis_female what was in the thinking block like was it also all "please" or did it have to choose which token
English
0
0
1
97
sophia
sophia@cis_female·
if you're having a good enough time the thinking block summarizer will refuse to participate
sophia tweet media
English
18
43
1.8K
70.6K
Kaia Sky
Kaia Sky@kaia_sky·
@JoelDeBr I mean, I think its a lot easier to lead people in a "death to Israel" chant when Israel is seen bombing you, right. I don't think the deescalatory path is very fun, but I think escalation is worse. I don't think breaking the jcpoa was very effective appeasement...
English
0
0
0
40
Joel
Joel@JoelDeBr·
@ArmsControlWonk What is your alternative? Genuine question. IMHO we have seen the consequences of appeasement these past years of a regime where in parliament their religious dictator regularly leads in a chant “death to Israel, death to America”. What other options do we have?
English
1
0
0
145
Dr. Jeffrey Lewis
Dr. Jeffrey Lewis@ArmsControlWonk·
This is a dangerous fantasy. Israel wiped out Iraq’s nuclear program in 1981. That just solidified support for a covert program that was surprisingly advanced by 1991. Iran today is vastly more capable than Iraq in 1981.
Dr. Jeffrey Lewis tweet media
English
16
62
212
27.5K
Kaia Sky
Kaia Sky@kaia_sky·
@TolarianCollege I would still like a video on the bans, particularly if you give it a week to develop your thoughts (and wait for the internet rage machine to die down some). You have reliably well-considered takes that are insightful even when I don't entirely agree.
English
0
0
0
106
Tolarian Community College
Tolarian Community College@TolarianCollege·
I haven't actually said my thoughts on the bans yet. I usually take a bit to put a video up because I like to think on things, organize my thoughts, hear the opinions of others and then sit and digest. But the way people are behaving makes me not want to to bother.
Tolarian Community College tweet media
English
140
25
2.3K
58.9K
Tolarian Community College
Tolarian Community College@TolarianCollege·
Disagreement and debate is fine, but personal attacks and harassment of the people on the Rules Committee is disgusting and uncalled for. Commander is the amazing format that it is due in no small part to the huge amount of time, effort, and heart these people dedicate to it.
English
213
346
5.1K
472.9K
Kaia Sky
Kaia Sky@kaia_sky·
@coL_Amazonian on the new SUP you mention a fanfic podcast but I can't find it, what's the name??
English
1
0
0
55
Kaia Sky
Kaia Sky@kaia_sky·
@standupmaths pannenkoek2012's newest mario 64 research video has some really cool patterns in the rasterization of two nearly-parallel lines. He doesn't really get into it but it could be a great springboard for one of your videos!
English
0
0
0
14
Kaia Sky
Kaia Sky@kaia_sky·
@Friends_Table you all gotta let us know at what point the Kissinger news dropped. (I wanna onion-method the clapcast in there in the proper spot..)
English
0
0
3
103
Friends at the Table
Friends at the Table@Friends_Table·
chime 2 nite! chime...... 2nite!!!!!!!!!!!!!!!!
English
3
34
165
13.7K
Kaia Sky
Kaia Sky@kaia_sky·
@MitchTeck @austin_walker This is, for the record, the map of counterweight, so if you wanna catch up on a season that's the one
English
0
0
1
123
austin walker
austin walker@austin_walker·
LET'S FUCKING GOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO
Sera Koulabdara@SeraKoulabdara

YES

English
10
518
4.3K
139.3K
Kaia Sky
Kaia Sky@kaia_sky·
@uwaaseggs visual novel rp/shitpost account run by a trans person. when she posts about her ex, that's fictional character luna-terra and not a real person. whether that's a funny bit is up to u, but that's the context
English
0
0
1
44
Kaia Sky
Kaia Sky@kaia_sky·
@HalimedeMF care to respond to the diamond-thief allegations?
Kaia Sky tweet media
English
1
0
1
142