Louis

8.1K posts

Louis

@logicus

philosophy phd candidate, doing science with agents, views my own

San Diego, CA Katılım Ağustos 2015

489 Takip Edilen514 Takipçiler

Sabitlenmiş Tweet

Louis@logicus·1d

i'm bout to go thales on these milesians rn

English

184

Louis@logicus·9s

we are with ted on this x.com/TedCaseHill/st…

Ted Hill@TedCaseHill

@eigenrobot @logicus @grok Masterful

English

Louis@logicus·3m

before the dsm was, i am

eigenrobot@eigenrobot

@logicus @grok explain to him how hes missing the point on my tweet

English

Louis@logicus·13h

@craigzLiszt you die in 17 hours without wifi

English

Craig Weiss@craigzLiszt·20h

wifi is more essential than water

English

127

5.3K

Louis@logicus·13h

@shinboson can’t rt 🤷‍♂️

English

Louis@logicus·13h

i always took “age of research” to mean attention is not all you need, but i don’t know now.

English

129

Louis@logicus·13h

@MillionInt so tedai by 2030 or no?

English

281

Jerry Tworek@MillionInt·14h

The technology is already largely here, with consistent trend over many years, disrupting consumer and industry workflows. Demonstrating leaps of progress each year. Building largest companies in the world in the blink of an eye. And yet likely less than ten people in the world don’t think of it in too small terms.

English

357

34K

Louis@logicus·15h

trying to make sense of the model. i think the calculus is, access restriction is better than innovation ecosystem / broad adoption for preserving the capabilities edge, and all you need is chips and scaling to preserve that, not some complex feedback loop between performance and adoption.

English

Miles Brundage@Miles_Brundage·18h

Hardly the most important thing going on these days but FWIW I would have called this a blog post, analysis, article, etc., not a paper - undersell/overdeliver etc... bit of a stretch x.com/AnthropicAI/st…

Anthropic@AnthropicAI

We've published a paper that explains our views on AI competition between the US and China. The US and democratic allies hold the lead in frontier AI today. Read more on what it’ll take to keep that lead: anthropic.com/research/2028-…

English

8.1K

Louis@logicus·22h

if what’s bad is not having paid a cost to write a text then the cost to write the text can just be paid toward the cost of generating and curating the text well, toward engineering a good text, no offense. there are many cases where ai writing is exceptional with marginal human taste and touch applied to it. what’s bad is tasteless ai generated writing. the application of human taste to ai-generated writing might be low cost locally because it’s a dividend of being able to read and write or engineer well, and those skills cost a lot.

eigenrobot@eigenrobot

asking people to read ai-generated text is offensive. this is not because ai text is intrinsically bad. rather, the author has not paid a cost to write the text himself. this cost is a credible signal he finds its communication important. so: not paying that cost is telling

English

3.2K

Louis@logicus·1d

@kalomaze yeah. i still think it's risky for them. i suspect that consumers and a lot of enterprise will end up pissed for many of the same reasons and that enterprise is just slow to react.

English

211

kalomaze@kalomaze·1d

once you realize that their money lives predominantly in enterprise & the API, you realize that the consumer product heat they get on here is ~mostly void yelling honestly controlled consumer growth is what they want atm, they almost certainly can't handle OpenAI's consumer MAU

Louis@logicus

hm, is anthropic is deliberately pissing off only the type of user it's happy to lose?

English

194

12.2K

Louis retweetledi

ChatGPT@ChatGPTapp·1d

A preview for Pro users: a new personal finance experience in ChatGPT. Pro users in the U.S. can securely connect financial accounts, see where their money is going, and ask questions based on the information they choose to connect. Your full financial picture, now in ChatGPT.

English

1.1K

1.4K

21.9K

13.1M

Louis@logicus·1d

@morqon i canceled claude max which i'd had since last july. they probably are happy to lose claude max guys. their business must be high end customers who have a huge spend. given everything i know about ai... i think many of those orgs must just be slow to react.

English

morgan —@morqon·1d

@logicus “prosumer peasant rebellion” is nice, and yes, they seem determined to test the limits of demand, so far so inelastic

English

morgan —@morqon·2d

“to be clear, compute has never been a limiter in our rollout”

Logan Graham@logangraham

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…

English

2.9K

Louis@logicus·1d

hm, is anthropic is deliberately pissing off only the type of user it's happy to lose?

English

12.5K

Louis@logicus·1d

@47fucb4r8c69323 $10k is a lot of money

English

47fucb4r8curb4fc8f8r4bfic8r@47fucb4r8c69323·1d

No one has things as figured out as you think they do.

English

975

Louis@logicus·1d

yeah, that’s a good point. glasswing is valid, and it could also work for that. most security reports have said mythos is very good though not always better and that it’s super expensive. maybe govt has to buy it. for ant i worry about the prosumer peasant rebellion that’s been bubbling. it just seems like they don’t care about users that much. except indirectly via the glasswing hardening pass.

English

morgan —@morqon·1d

@logicus i take it as an incentive to become critical infrastructure

English

Louis@logicus·1d

@Kpaxs okay but also yes you can

English

Kpaxs@Kpaxs·1d

You cannot logic someone out of a position that is currently regulating their fear, status, identity, or sense of belonging.

English

537

2.8K

69K

Keşfet

@craigzLiszt @shinboson @MillionInt @kalomaze @morqon @47fucb4r8c69323 @elonmusk @BarackObama