Dominik Lukes

15.8K posts

Dominik Lukes

@techczech

Exploring schemas and propositions about language models of all kinds on https://t.co/GU07uzb7Ud and on https://t.co/UfdxBd7jvK.

UK Katılım Nisan 2009

796 Takip Edilen2K Takipçiler

Sabitlenmiş Tweet

Dominik Lukes@techczech·14 Ağu

@kepano Civilisation is built on delegating understanding. Doing your own understanding needs to be very strategic. In most situations, it becomes the equivalent of growing your own vegetables. It's enjoyable but does not meaningfully contribute to your nutrition.

English

4.1K

Dominik Lukes@techczech·7h

No respect for intellectual heritage ...

Nathaniel Rakich@baseballot

ABC News has now taken all FiveThirtyEight articles completely offline. They now redirect to abcnews dot com/politics. A needless erasure of thousands of pages of knowledge.

English

Dominik Lukes@techczech·7h

@akoustov Now let's do life time bans for correct references that clearly support the opposite of what the author claims they do or are about something entirely irrelevant.

English

486

Alexander Kustov@akoustov·1d

Good idea. Folks at other journals should be doing the same. Enforcement of professional norms matters.

Andrew White 🐦‍⬛@andrewwhite01

hallucinated references will land you a 1-year ban from arxiv now. wow

English

129

19.4K

Dominik Lukes@techczech·7h

Bitter lesson comes for the X recommendation algorithm: "We have eliminated every single hand-engineered feature and most heuristics from the system. The Grok-based transformer does all the heavy lifting by understanding your engagement history (what you liked, replied to, shared, etc.) and using that to determine what content is relevant to you."

Elon Musk@elonmusk

The latest 𝕏 algorithm has been published to GitHub github.com/xai-org/x-algo…

English

Dominik Lukes@techczech·10h

Yes, funnily enough the diffs you get in the developer view are very useful for non developers.

Ethan Mollick@emollick

Codex is very good, but it is still a very "developer coded" interface for an everything app. And it continues the somewhat annoying AI perspective that non-coders are just not as competent and need stuff hidden from them, as opposed to requiring a different form of complexity.

English

Dominik Lukes@techczech·15h

@Makuh90 @UIEnthusiasts Windows just makes some system level things harder. In many ways, it is better than MacOS but having a Unix core is just better in the agentic era. Also, the hardware fragmentation does not help.

English

297

Merk@Makuh90·18h

@UIEnthusiasts In what sense?

English

3.6K

Merk@Makuh90·1d

Sam. Windows users are getting pretty sick of being second tier.

Sam Altman@sama

Codex in the ChatGPT mobile app!

English

199

799

93.9K

Dominik Lukes@techczech·16h

My quick verdict on @raycast v2 beta: overall improvement, can't notice any speed loss on M4 Macbook Air (and I have loads of extensions) - perhaps a tiny ms dely on ⌘ + K but that could be an illusion. - cloudsync can't come soon enough - new settings are great - new file search a huge improvement - snippet tagging was long awaited Well done @thomaspaulmann and team.

English

Dominik Lukes@techczech·16h

@thomaspaulmann @ps73yk @raycast I don't mind rounded corners but when I want an application window to fill the desktop space (without going full screen) I want it to fill the whole thing, not leave distracting pixels in the top corners.

English

Thomas Paul Mann@thomaspaulmann·16h

@techczech @ps73yk @raycast The different corner radii are a bit annoying. I do like the more rounded windows, though.

English

Thomas Paul Mann@thomaspaulmann·1d

One app, two platforms, four programming languages. The things that look the simplest are often the hardest to build. @raycast is one of them. Here's a technical deep dive on how we built v2 👉 ray.so/v2-deep-dive

English

514

72.9K

Dominik Lukes@techczech·16h

@ps73yk @thomaspaulmann @raycast Wish there were step 2 to get rid of the annoying rounded corners.

English

Dominik Lukes@techczech·16h

@ps73yk @thomaspaulmann @raycast Tahoe Step 1

English

Dominik Lukes@techczech·16h

@Dimillian Only runs in default permissions - needs autoreview mode.

English

Thomas Ricouard@Dimillian·21h

It’s time. Now you’ve tried our Codex mobile, tell me the top missing feature for your workflow! And yes, we’re aware of the current bugs and shortcomings. We’re working hard on it!

English

287

527

35.6K

Dominik Lukes@techczech·1d

@thomaspaulmann @raycast Looking forward to it. Raycast makes MacOS tolerable for me. It's my main interface to the system.

English

Thomas Paul Mann@thomaspaulmann·1d

@techczech @raycast Thanks! While the first version of Tahoe had its issues, the latest ones are good. I'm sure you won't regret it, and hopefully we play a major part in helping you feel at home! Let me know how it goes...

English

429

Dominik Lukes@techczech·1d

The first thing an art critic has to learn is how not to feel shame.

Jediwolf@Jediwolf

What happens when you post a real Monet and say it’s AI? The coolest art social experiment I’ve seen in a while. Thank you @SHL0MS

English

162

Dominik Lukes@techczech·1d

I still remember how magical o3 felt. Now, I wouldn't touch it: "Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities."

Logan Graham@logangraham

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: xbow.com/blog/mythos-of… UK AISI report: aisi.gov.uk/blog/how-fast-…

English

Dominik Lukes@techczech·1d

The new AI generated code is solving old problems that were previously not economically valuable enough to devote scarce development resources to. The productivity impact of these will be gradual and cumulative. But also what software is will change.

François Chollet@fchollet

The quantity of code that devs ship has roughly 10xed. But net developer productivity (value created by unit of time) is only up by a bit, if at all. Part of it is that the additional code is solving more incremental problems. A bigger part is that the new code is creating problems of its own.

English

Dominik Lukes@techczech·1d

@CCguerilla What I meant is that it's not automatically a reasonable objection. But most importantly, it's often a political objection masquerading as an ethical one. Plus, everyone knows that this is true because we routinely do not accept all ethical objections as reasonable.

English

Kane Murdoch@CCguerilla·3d

@techczech Hmmmm, disagree. In my mind it's the best example of a reasonable objection.

English

Dominik Lukes@techczech·3d

Ethical objection does not mean a reasonable objection.

English

Dominik Lukes@techczech·1d

@arckollect @ClaudeDevs I cancelled my Claude sub I had dedicated to Nanoclaw last month. Not worth it.

English

150

Arck@arckollect·2d

@ClaudeDevs breaking down what this actually means here x.com/arckollect/sta…

Arck@arckollect

x.com/i/article/2054…

English

47.3K

ClaudeDevs@ClaudeDevs·2d

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

English

1.3K

12.4K

10M

Dominik Lukes@techczech·2d

Nobody seems to be talking about cognitive decoupling anymore, but I suspect differences in levels of willingness to engage in it are behind some of the biggest disagreements around AI, today.

English

Dominik Lukes retweetledi

Séb Krier@sebkrier·3d

If anyone builds it, everyone thrives. Over the past decade, a lot of important work on AI alignment has focused on avoiding harm. But freedom from harm isn't the same as freedom to flourish. In this paper, we introduce 'Positive Alignment'. A positively aligned agent is one that helps us navigate our own value trade-offs, builds our resilience, and acts as a scaffold for human flourishing. Doing this without slipping into top-down, technocratic paternalism is the great design challenge of our time. We think a lot more research is now needed to explore this frontier: how do we align models that actively help us thrive? Amazing work by @RubenLaukkonen, @drmichaellevin, @weballergy, @verena_rieser, @AdamCElwood, @996roma, @FranklinMatija, @shamilch, @_fernando_rosas, @scychan_brains, @matybohacek, @sudoraohacker, and others. arxiv.org/abs/2605.10310

English

224

1.1K

299.9K

Dominik Lukes@techczech·2d

Good riddance.

Ethan Mollick@emollick

The silent removal of Study Mode from ChatGPT is a big mistake (both Claude and Gemini still have theirs) We have enough evidence that using AI in assistant mode to study can hurt learning because it just gives you answers, making students think they learned when they have not. You can prompt the model to be a very good tutor, but most people don't know to do that. Study mode was an easy option that parents and teachers could suggest to mitigate negative effects, even if it wasn't perfect. OpenAI still has a page about it, and the link activates study mode but otherwise there seems to be no way to select it from a menu for most accounts. openai.com/index/chatgpt-… (Deleted this by accident, sorry, so reposted!)

English

113

Keşfet

@akoustov @Makuh90 @UIEnthusiasts @raycast @thomaspaulmann @ps73yk @Dimillian @elonmusk