Louis

8.1K posts

Louis

Louis

@logicus

philosophy phd candidate, doing science with agents, views my own

San Diego, CA Katılım Ağustos 2015
489 Takip Edilen514 Takipçiler
Sabitlenmiş Tweet
Louis
Louis@logicus·
i'm bout to go thales on these milesians rn
English
0
0
2
184
Craig Weiss
Craig Weiss@craigzLiszt·
wifi is more essential than water
English
51
7
127
5.3K
Louis
Louis@logicus·
i always took “age of research” to mean attention is not all you need, but i don’t know now.
English
0
0
2
129
Jerry Tworek
Jerry Tworek@MillionInt·
The technology is already largely here, with consistent trend over many years, disrupting consumer and industry workflows. Demonstrating leaps of progress each year. Building largest companies in the world in the blink of an eye. And yet likely less than ten people in the world don’t think of it in too small terms.
English
29
13
357
34K
Louis
Louis@logicus·
trying to make sense of the model. i think the calculus is, access restriction is better than innovation ecosystem / broad adoption for preserving the capabilities edge, and all you need is chips and scaling to preserve that, not some complex feedback loop between performance and adoption.
English
0
0
0
63
Miles Brundage
Miles Brundage@Miles_Brundage·
Hardly the most important thing going on these days but FWIW I would have called this a blog post, analysis, article, etc., not a paper - undersell/overdeliver etc... bit of a stretch x.com/AnthropicAI/st…
Anthropic@AnthropicAI

We've published a paper that explains our views on AI competition between the US and China. The US and democratic allies hold the lead in frontier AI today. Read more on what it’ll take to keep that lead: anthropic.com/research/2028-…

English
8
1
66
8.1K
Louis
Louis@logicus·
if what’s bad is not having paid a cost to write a text then the cost to write the text can just be paid toward the cost of generating and curating the text well, toward engineering a good text, no offense. there are many cases where ai writing is exceptional with marginal human taste and touch applied to it. what’s bad is tasteless ai generated writing. the application of human taste to ai-generated writing might be low cost locally because it’s a dividend of being able to read and write or engineer well, and those skills cost a lot.
eigenrobot@eigenrobot

asking people to read ai-generated text is offensive. this is not because ai text is intrinsically bad. rather, the author has not paid a cost to write the text himself. this cost is a credible signal he finds its communication important. so: not paying that cost is telling

English
1
0
2
3.2K
Louis
Louis@logicus·
@kalomaze yeah. i still think it's risky for them. i suspect that consumers and a lot of enterprise will end up pissed for many of the same reasons and that enterprise is just slow to react.
English
0
0
4
211
Louis retweetledi
ChatGPT
ChatGPT@ChatGPTapp·
A preview for Pro users: a new personal finance experience in ChatGPT. Pro users in the U.S. can securely connect financial accounts, see where their money is going, and ask questions based on the information they choose to connect. Your full financial picture, now in ChatGPT.
English
1.1K
1.4K
21.9K
13.1M
Louis
Louis@logicus·
@morqon i canceled claude max which i'd had since last july. they probably are happy to lose claude max guys. their business must be high end customers who have a huge spend. given everything i know about ai... i think many of those orgs must just be slow to react.
English
0
0
0
26
morgan —
morgan —@morqon·
@logicus “prosumer peasant rebellion” is nice, and yes, they seem determined to test the limits of demand, so far so inelastic
English
1
0
1
18
morgan —
morgan —@morqon·
“to be clear, compute has never been a limiter in our rollout”
Logan Graham@logangraham

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…

English
4
0
14
2.9K
Louis
Louis@logicus·
hm, is anthropic is deliberately pissing off only the type of user it's happy to lose?
English
2
0
8
12.5K
Louis
Louis@logicus·
yeah, that’s a good point. glasswing is valid, and it could also work for that. most security reports have said mythos is very good though not always better and that it’s super expensive. maybe govt has to buy it. for ant i worry about the prosumer peasant rebellion that’s been bubbling. it just seems like they don’t care about users that much. except indirectly via the glasswing hardening pass.
English
1
0
1
40
morgan —
morgan —@morqon·
@logicus i take it as an incentive to become critical infrastructure
English
1
0
1
22
Louis
Louis@logicus·
@Kpaxs okay but also yes you can
English
0
0
0
47
Kpaxs
Kpaxs@Kpaxs·
You cannot logic someone out of a position that is currently regulating their fear, status, identity, or sense of belonging.
English
54
537
2.8K
69K