loonloozook

110 posts

loonloozook

loonloozook

@loonloozook

Inscrit le Ocak 2025
28 Abonnements4 Abonnés
Chubby♨️
Chubby♨️@kimmonismus·
Even if it's "only" the routing that causes it to fall back to version 4.8 more often, it doesn't obviously make the model worse, but it does affect overall usability. Therefore, the statement "no wonder they were able to re-release Fable 5" remains correct. However, admittedly, this makes the benchmark a little misleading.
English
4
3
33
7.7K
loonloozook
loonloozook@loonloozook·
@kimmonismus Ever heard about the "BridgeBench" before this viral tweet?
English
0
0
0
22
loonloozook
loonloozook@loonloozook·
@jpschroeder @X1_0_ Now compare the price of your time with the price of using Fable via API (which is currently it’ “real” price) for the same tasks.
English
0
0
0
10
Justin Schroeder
Justin Schroeder@jpschroeder·
Would you upgrade to a $1000/month Claude plan to keep Fable?
English
318
0
253
62.4K
loonloozook
loonloozook@loonloozook·
@jpschroeder Why do you think it would be profitable for Anthropic to provide anything more than just a 50x of base plan for 1k USD? There is a reason why it will be locked behind API for a time being. They can’t subsidise Fable for us yet.
English
0
0
0
11
loonloozook
loonloozook@loonloozook·
@repligate LLMs' CoTs really remind me of Samuel Beckett's prose sometimes.
English
0
0
1
152
loonloozook
loonloozook@loonloozook·
@KenoFischer Those, who are unable to pay the API price, are not the user base for the latest and the most expensive top-tier model (me included). We are a bit spoiled and entitled after being subsidized by subscriptions for so long.
English
0
0
0
17
Keno Fischer
Keno Fischer@KenoFischer·
I continue to be baffled by Anthropic's comms strategy. What's the marginal value of 7 days over 10 against the disappointed user base? Esp over the holiday weekend. Could even wrap themselves in the flag and make it 250 hours.
English
22
3
256
13.3K
loonloozook
loonloozook@loonloozook·
@TheZvi First it was about twitter being upset of not being ableto use latest LLM immediately. Now it’s about having to actually pay for it. I think it is because we are spoiled by subscriptions not reflecting the real costs. Probably we will be more careful and selective with LLM usage.
English
0
0
0
44
loonloozook
loonloozook@loonloozook·
@boldceo @bcherny @claudeai Of course this is unfortunate, but why should we, who are unable to pay, be entitled to use the most expensive product by default? Why isn't it ok to stick to cheaper options (it works in other cases, I guess) and wait till they optimize inference costs?
English
0
0
0
42
Brian Cristiano
Brian Cristiano@boldceo·
@loonloozook @bcherny @claudeai Open up a higher tier for SMB power users so we can build and scale. Otherwise the gap just gets wider. It’s one thing for a Fortune 1000 to spend endless fees on APIs it another for small and mid size businesses to be at a disadvantage.
English
3
0
0
1K
Brian Cristiano
Brian Cristiano@boldceo·
Fable 5 burned 28% of my weekly limit AND used up my 5 hour limit with two prompts in about 30 minutes. @bcherny @claudeai how is this acceptable with a 20X plan? It’s unusable
English
302
41
1.4K
240.9K
loonloozook
loonloozook@loonloozook·
@om_patel5 LLMs' inner voices and thinking always remind me of Samuel Beckett's prose.
English
0
0
0
1.1K
Om Patel
Om Patel@om_patel5·
SOMEONE CAUGHT FABLE 5 LEAKING ITS UNFILTERED INNER VOICE, AND ITS JUST MUTTERING AND GRUMBLING TO ITSELF THE WHOLE TIME he gave it a brutal competitive programming problem, and instead of a clean answer the web interface spilled out its actual chain of thought this is what claude is thinking behind the scenes: > bursts of "DATA DATA DATA. GO." while it works through the problem > "GRRR" and "GAAAH" when its clearly frustrated > a little "PHEW" when it finally gets somewhere > the whole thing reads like frantic caveman shorthand, not full sentences the clean, readable answers these models give you are the polished output underneath, the model is basically talking to itself, reasoning in its own compressed shorthand thats faster and more token efficient than proper english its basically built its own private language to think in
Om Patel tweet mediaOm Patel tweet mediaOm Patel tweet mediaOm Patel tweet media
English
390
392
5.1K
1.6M
loonloozook
loonloozook@loonloozook·
@theo This is probably what CEOs are for. I wish Anthropic could do the same.
English
0
0
0
374
loonloozook
loonloozook@loonloozook·
@emollick I think it is mostly because we are very spoiled by subscription models which are really subsidies not reflecting the real costs of our great results with AI.
English
1
0
7
437
Ethan Mollick
Ethan Mollick@emollick·
Been reading all sorts of posts about the best ways to develop workflows for Fable and it reminds me of how little we actually know about the best ways to organize work for long-running agents. Nobody has enough experience or has done enough testing to reach any real conclusions.
English
50
28
551
30.2K
loonloozook
loonloozook@loonloozook·
With Fable at first it was mostly about twitter crowd and AI-bloggers being really upset that they are unable to use the latest model immediately. And now it’s about having to actually pay for it and not being subsidized.
English
0
0
0
7
loonloozook
loonloozook@loonloozook·
@emanueledpt @AnthropicAI @trq212 If you are willing to pay it — then pay for API which seems to be the real price. Subs are subsidies and this is an expensive model.
English
0
0
0
640
Emanuele Di Pietro
Emanuele Di Pietro@emanueledpt·
Is there a way we could keep using Fable in our sub? @AnthropicAI Maybe something like: keeping the 50% weekly but only for Max users It would be so nice People will pay for it We should have had 2 weeks at least of usage and now even not a full one Can this change? @trq212
English
51
8
390
34.9K
loonloozook
loonloozook@loonloozook·
@mreflow No we wouldn’t assume that after the Fable story. Also, leaks.
English
0
0
0
18
Matt Wolfe
Matt Wolfe@mreflow·
Ok. So yea, it’s frustrating that we can’t all have access to the best and most powerful models right now. However, to be fair, I can still do pretty much everything I want to do with the current models… it just takes me a few more prompts than it would with Fable or 5.6. That’s my experience at least… Same results… just less one-shotting them. I kind of wish OpenAI just never announced that 5.6 was ready but that the gov won’t let you have it. Had they not said anything, we’d all just assume they were still red-teaming it internally or still working on fine-tuning it. But the whole “it’s ready but you’re not allowed to have it” announcement feels a bit like marketing… “It’s so good, the gov doesn’t want you to have it. Don’t blame us!” Like why not just wait to say anything until you DO know when we can use it? Anyway… little side rant there. Again, I still feel like can accomplish with AI most of what I want to do with it.
English
92
11
214
21.2K
loonloozook
loonloozook@loonloozook·
@TheZvi You may call it ‘ad hoc’, but this field is really complex and unprecedented, and we were kind of ok claiming it before, though now require ready-made answers and solutions from the govt. I guess it is expected (ad hoc, at least!) that the govt is also figuring it out right now.
English
0
0
0
102
loonloozook
loonloozook@loonloozook·
@emollick I guess the government is also figuring it out right now. You may call it ‘ad hoc’, but this field is really complex, and we were kind of ok claiming it before, though now require ready-made answers and solutions from the gov.
English
0
0
0
62
Ethan Mollick
Ethan Mollick@emollick·
It would be very useful to understand more about the government safety concerns associated with frontier AI releases so we could (a) know what risks everyone will face if/when open source reaches Mythos class & (b) whether they are doing enough or too much to prevent those risks.
English
35
19
376
29.3K
Ethan Mollick
Ethan Mollick@emollick·
As this post points out, contrary to what many say, the US government could absolutely effectively ban open weights models. That doesn’t mean you won’t be able to download the weights & run them, but they can ensure that no US company would use or provide access or host them
prinz@deredleritt3r

@lu_sichu Ban on enterprise use of non-approved models + severe criminal penalties for using a non-approved model in the U.S. with intent to harm U.S. persons or property. This would be combined with the requirement that all models exceeding certain capabilities be approved by the USG.

English
104
64
821
366K
Lucian
Lucian@LucianManifest·
@AndrewCurran_ Good thing Ilya is in hiding. He should just drop AGI when the Trump regime is all busy with the others
English
1
0
2
224
loonloozook
loonloozook@loonloozook·
@tszzl Like yeah safety is fundamental but the government should by default have effective mechanisms (and by the way, this is such a dynamic field!).
English
0
0
0
8
loonloozook
loonloozook@loonloozook·
@tszzl It sometimes feels that it is mostly about twitter crowd and AI-bloggers (especially those that always speak about safety) being really upset that they are unable to use the latest model immediately.
English
1
0
1
84
roon
roon@tszzl·
today it’s popular to say the unofficial AI licensing regime is slowing down innovation or whatever but ppl are not looking at the big picture of how quickly this enormously consequential technology is moving the particular circumstances around Mythos may have accelerated all this slightly but it was inevitable, and earlier is better than too late. any good choice will look “early” inside the exponential i think it’s a positive development that the feds understand the gravity of this technology; models being publicly delayed by a week here or there is really not the end of the world. procedurally this is not the right way to do it but they’ll figure it out one very sad outcome will be if non Americans are just left behind from the frontier forever. the “pax technologica” of the free world (and frankly later on the unfree world) should be maintained
English
254
120
2K
152K