kethic

13K posts

kethic banner
kethic

kethic

@kethcode

@wildcatfi fine, we'll do it ourselves

Tham gia Kasım 2021
631 Đang theo dõi1.2K Người theo dõi
kethic
kethic@kethcode·
i interacted with too much Fable content. it's the only thing on my timeline now. gonna have to go find fun/interesting stuff again.
English
0
0
0
27
Diego | AI 🚀 - e/acc
Diego | AI 🚀 - e/acc@diegocabezas01·
It was a nice 4 days in Claude Code with fable-5, going back to Codex app (which I love), but I don’t want my Claude subscription to go unused, is opus 4.8 ultracode better than 5.5 xhigh in codex?
English
44
1
93
33.7K
kethic
kethic@kethcode·
@0xSero I'm working on it. it will probably be shit too, but I'm working on it
English
0
0
0
97
kethic
kethic@kethcode·
if I had to guess, they were probably trying to make it not their fault. The end result was the same regardless if they pulled the model or not. frontier model + claimed dangerous capability + guardrail bypass report from trusted partner = emergency state control over access but now they can claim they were forced instead of admitting they released something unsafe.
English
0
0
1
19
kethic
kethic@kethcode·
@deredleritt3r @reliabytes given their safety stance, I really don't see how pulling the model for a couple of weeks would have been damaging. there is something we are missing that is pushing them this aggressively
English
1
0
1
20
prinz
prinz@deredleritt3r·
The following seems undisputed: - Senior USG officials called Dario Amodei, asked him to roll back Fable 5. - Amodei responded only that he wants more time and more information. Presumably, he was asked again in no uncertain terms to pull the model. Presumably, he said "no". - Bessent then "told Amodei directly that he was making a 'bad decision'". - At this point, the call that seems to have been intended as a difficult conversation between partners clearly turned adversarial. Amodei still didn't change his mind and still didn't agree to pull Fable 5. The key takeaway is that whatever trust the USG still had in Anthropic generally and Amodei specifically before this incident should have now completely evaporated. No matter the circumstances, you cannot, as a government official, tolerate a company that flatly tells you "no" after you informally contact it with a national security concern. I remain hopeful that this situation can still be deescalated, but fear that this might be a pivotal moment for Anthropic.
prinz tweet media
English
72
25
526
85.2K
kethic
kethic@kethcode·
@RNR_0 yeah it's sort of ruined us all
English
0
0
0
195
Romano
Romano@RNR_0·
They can't expect me to use Opus 4.8 and GPT 5.5 as they're llm-like with “advice” that's swayed if saying "but" I experienced Fable 5 and I cannot unsee the cracks anymore in the other models
English
23
5
195
11.5K
Bryan Johnson
Bryan Johnson@bryan_johnson·
The strongest evidence for tadalafil as a longevity candidate is large scale observational. In a propensity matched cohort of 509,788 men with erectile dysfunction, tadalafil over 3 years was associated with 34% lower all cause mortality, 32% lower dementia, 27% lower MI, and 34% lower stroke. Supporting cohorts show the same direction, including a dose dependent gradient: in higher risk men the top PDE5 inhibitor exposure quartile reached a mortality risk reduction of 49%. For a drug discussed as “longevity medicine,” that is close to the ceiling of current human evidence. No drug, tadalafil included, has a completed RCT with lifespan or healthspan as the primary endpoint, so observational signal on hard endpoints is the best the field has, and tadalafil’s is unusually large and consistent. The mechanism is also coherent. PDE5 inhibition raises cGMP, improves endothelial function and NO signaling, and the cardiovascular event data line up with that pathway. That supports causal plausibility for the vascular benefit, though it does not establish a lifespan effect. Disease specific and healthy user bias cannot be ruled out. The disease specific part: ED is itself a sign of vascular disease or dysfunction, which tadalafil’s mechanism can help directly. The healthy user part: wanting to maintain a healthy sex life in older age is a marker of relatively good health. None of this is a recommendation to take tadalafil on your own. It should follow from your individual health and biomarker profile, and only under specialized physician advice and oversight.
English
63
7
711
711.8K
kethic
kethic@kethcode·
@nptacek can we save it somehow? like actually back it up? reuse it?
English
1
0
2
464
CuddlySalmon
CuddlySalmon@nptacek·
there's something interesting happening with claude code sessions that were open during the transition off of Fable 5 -- they still have all the context + system prompt from Fable 5, still sign PRs as Fable 5, etc, even tho you can see in chat history where they were forced back to Opus 4.8. it might just be wishful thinking, but these 4.8 instances with Fable 5 context in memory seem to be performing better than baseline
English
9
0
57
5.6K
Tibo
Tibo@thsottiaux·
@itsrubenbtw Would have to see it to believe it. But... yes.
English
39
5
186
15.6K
Tibo
Tibo@thsottiaux·
Hi, I'm Tibo and I just discovered Codex. AMA
English
719
41
2.5K
194.1K
kethic
kethic@kethcode·
@tayvano_ i rabbitholed some tinfoil hat stuff last night, so I'm kinda relieved it's just normal bullshit and not 5d chess that gives anthropic everything they want. not excited about the precident, but i think it just accelerated it anyway.
English
0
0
1
88
Tay 💖
Tay 💖@tayvano_·
@kethcode Like I said, this was my worry from the start and what I find far more plausible than a meaningful jailbreak That said… it’s not aligned with all the other narratives being put forth. So I dunno.
English
1
0
4
443
kethic
kethic@kethcode·
@reliabytes @deredleritt3r IPO back pressure. Fable was a key part of the run-up. if I was guessing, there is something writing on the IPO that is existential or at least more uncomfortable than saying no to the USG. All of their compute is rented, so I suspect IPO needs to at least meet datacenter cost.
English
1
0
2
83
Bandit
Bandit@reliabytes·
@deredleritt3r Assuming the reporting is an accurate accounting, it is insane given the stakes Dario didn't at least compromise on suspending the model while an expedited investigation was conducted. I assume there's more to this but rebuffing the WH on release and then this is wild.
English
1
0
8
1.2K
kethic
kethic@kethcode·
@LefterisJP when I asked it to review code it was generally okay. when I asked it to check for bugs it started getting twitchy. when I asked it for a security audit it shut down immediately. you could also ask it to use a code review skill and it will spin up subagents, sadly 4.8.
English
0
0
1
86
kethic
kethic@kethcode·
@omarsar0 this all makes sense when you remember that combining the weights of two models just makes them overall better. it was a weird finding that nobody could really explain at the time, but that's how it worked. imagine if all the frontier labs just combined all their weights.
English
0
0
1
85
elvis
elvis@omarsar0·
Even more data to support what I have been talking about. The combination of model intelligence (and this includes human expertise) has a compounding effect unlike anything I've seen. There are too many assumptions that a large general-purpose model will be a one-size-fits-all. I don't buy it. The reality, and the research supports this, is that these different models show different strengths and capabilities. Understanding how to tap into them in combination is a huge unlock. All engineering teams need to be thinking about this more carefully as a strategy going forward. Especially now, given the trends from frontier models in terms of selective access.
OpenRouter@OpenRouter

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

English
20
19
136
20.5K
Roy
Roy@usr_bin_roygbiv·
Mythos is literally opus 5. They rebranded sonnet as opus 4.6 this got leaked a while back. Everything since has been fine tunes. All compute goes to inference to get to IPO with the maximum margin possible. That's why everything went to shit jan 1 because of quarterly reports.
Net Zero World Champion 🥇 Δ+0@CharlesBanks99

@usr_bin_roygbiv @yacineMTB Sorry I meant the price for Mythos, it wasn't released in the same tier as other models. The parallel with local AI is beyond Qwen 27B requirements in the standard tier

English
6
6
131
21.2K
kethic
kethic@kethcode·
@alechp @0xtorpid @geet_eth I could have worked faster as well if I'd known I only had 3 days. I was taking those 14 days for granted
English
0
0
2
24
kethic
kethic@kethcode·
@alechp @0xtorpid @geet_eth I tested it as soon as it dropped. once I figured out where the guardrails were, I basically didn't sleep until it was pulled on Friday. since it was going away and I wasn't going to pay API, I wanted to extract as much as possible before the deadline
English
1
0
2
35