Stefan Heidekrüger eu/acc 🇪🇺

658 posts

Stefan Heidekrüger eu/acc 🇪🇺

@hdkrgr

AI | Algorithmic Game Theory | Multi-Agent RL . eu/acc

Munich, Bavaria Katılım Eylül 2014

1.6K Takip Edilen206 Takipçiler

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Michael Straka@mstrakastrak·5h

An important and underrated skill - learning Latin so you can mog your coworkers by prompting Claude like a wizard

English

693

35.4K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Aaron Levie@levie·2d

CEOs are uniquely prone to AI psychosis because they’re sufficiently distant from the last mile of work that still has to happen to generate most value with AI. So when they play with AI, they see the happy path results, often not considering the next 10 or 20 things that have to happen to get sustainable results from agents. “Look I made this awesome product prototype”. Yes but you didn’t have to review the code before it went into production and fix a bunch of issues. “Look I generated a contract”. Yes but you didn’t verify all the terms before it goes out to the counterparty and didn’t have to wire up all the past contracts to work with. The best thing you can do as a CEO is to use AI a *ton* to figure out the real implications of agents in the enterprise, and come out the other side with an appreciation for both the upside and the real work that goes into them.

Michal Malewicz@michalmalewicz

CEOs are the most delusional about AI. Detached from reality.

English

289

707

6.7K

1.1M

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Kane 謝凱堯@kane·3d

This is the author who thrust AI water hysteria into the mainstream by overestimating data center water use by 100,000% in her book Empire of AI by mixing up units. She is a source of wild misinformation. Imagine writing an Econ book on the premise that minimum wage is $7,250/hr

Karen Hao@_KarenHao

On the one-year anniversary of EMPIRE OF AI, I am so, so excited to announce The AI Resist List, a new project that documents examples of resistance to the AI empires around the world. airesistlist.org

English

120

646

4.9K

464.5K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Lucas Beyer (bl16)@giffmana·4d

Can't believe we're getting this before GTA 6

CHRIS FIRST@chrisfirst

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

English

1.6K

222.3K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

will depue@willdepue·5d

bro it isn’t generally intelligent bro its only read every book and paper ever written and just making connections between them bro. its only thinking for twenty hours bro it’s just brute force thinking bro. its only solving erdos problems bro it could never be an accountant bro

English

146

559

7.8K

531.2K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·18 May

@mrexits No tip???

English

prayingforexits 🏴‍☠️@mrexits·18 May

Vibe coded my first project today! Let me know what you guys think ❤️

English

641

39.9K

2.2M

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·13 May

@reach_vb 🙋‍♂️

QME

Vaibhav (VB) Srivastav@reach_vb·13 May

putting together a group chat for Codex power users in London / Europe who are the biggest ballers around?

English

251

313

105.5K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·25 Nis

@Schmidtlepp

Stefan Heidekrüger eu/acc 🇪🇺 tweet media

QME

Christopher Lauer™@Schmidtlepp·24 Nis

Ein Fall für Aiman Abdallah und das Galileo-Mystery-Team.

DER SPIEGEL@derspiegel

Preisanstiege zwischen 14 und mehr als 16 Cent pro Liter: Am Freitag sind die Kosten für Benzin und Diesel so stark gewachsen wie bei keinem Mittagssprung zuvor. Die Preisexplosion droht den Tankrabatt aufzufressen. #ref=rss" target="_blank" rel="nofollow noopener">spiegel.de/mobilitaet/spr…

Deutsch

452

22.6K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·24 Nis

@johnjhorton I suspect that something similar is going on with the auctioneer, the bidders, the intended viewer of the image, and the person promoting the LLM. When told to “submit a bid”, the perspective is clear and actions are rational.

English

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·24 Nis

@johnjhorton …: especially when developing agentic systems themselves, the coding agents often write amazing code, but completely fail to reason about _who_ a certain artifact (code comment, copy in reports, docs) is for. (The user? The developer? Other agents? Future developer agents?) /2

English

John Horton@johnjhorton·23 Nis

If you have AI agents simulate bidders in an art auction, you'll get the auction theory predictions almost exactly (and has been true for years); asking it to one-shot draw the progression of such an auction, it gets the details wrong

English

9.7K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·22 Nis

@justalexoki My Claude is equally stuck, but has absolutely zero problems letting the competition take over …

English

167

taoki@justalexoki·21 Nis

i once got hard stuck with claude so i asked him to write a brief i could send to gpt instead and he lead with "HA, brutal!" and then he was super catty for the rest of the convo

English

168

6.6K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Chubby♨️@kimmonismus·17 Nis

I've now spent several hours using Opus 4.7 and comparing it to 4.6, and it's like night and day for me. Opus 4.7 feels like a disgruntled employee whose results you can't judge and have to check afterward. The trust you had with 4.6 is gone. It's like hiring a new employee who had excellent grades in their application but is totally sloppy and disgruntled in practice and doesn't follow instructions. The consequence: fire them. So, for now, I'm going back to 4.6. Seriously: did not expect such release from Anthropic. Biggest win for OpenAI was Anthropics Opus release.

Chubby♨️@kimmonismus

ok wtf, i say it. give me back 4.6 what the heck is this sh*t. The more i use 4.7 the more annoyed i am. this is such a rushed release.

English

1.1K

147.7K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·13 Nis

@benedictk__ @Muhtasham9 Looking forward to round 2! Great vibes last month

English

187

Benedict Kerres@benedictk__·13 Nis

Munich builders ☕️🇩🇪 Meet OpenAI and Codex at our event for coffee, conversation, and coworking. Show your work, meet the team, and spend the day with us. Potential bonus: Branded Codex coffee beans as gifts (collector item)! cc. @Muhtasham9 for organizing you are a rock star!

English

155

14.3K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·3 Mar

@kanavtwt

QME

398

kanav@kanavtwt·2 Mar

incredible things are happening on linkedin

English

240

548

11.3K

1.2M

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Chubby♨️@kimmonismus·2 Mar

Claude found out about iran in real time. (but does it know it was him that was used in the attacks?)

English

235

8.6K

364.5K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Roberto@RobJ02·28 Şub

@tszzl’s tweets, now deleted, seemingly minutes before learning of OpenAI’s deal with the DoD. See specifically the second

English

129

285.5K

Stefan Heidekrüger eu/acc 🇪🇺 retweetledi

Chubby♨️@kimmonismus·25 Şub

I love Karpathy's posts because they're so on point. He's not only a leading expert in his field, but he also manages to capture the zeitgeist with his statements. But this post is particularly impactful. Since December, (agentic) coding has undergone a significant transformation, one could even say a qualitative leap. Before, it was a matter of iterative improvements, but since the end of last year, it has demonstrated its true value in a completely different way. Or, in Karpathy's words: "It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since. (...) As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel." Two points on this that need to be repeated again and again because they are often still misunderstood. 1) The very basic truth: this is the worst it will ever be. From here on out, things will get better. Even if the status quo were to remain as it is, it would be serious. But what we have today is the worst it will ever be. 2) The pace of progress is constantly increasing. It is exponential. And that's the crucial point: from December to February, more happened than in a very long time. And this trajectory will likely (almost certainly) continue. If points 1) and 2) are true, it is simply impossible to foresee and predict how this will affect society and all essential areas. As much as I welcome and approve of this, the near future is unpredictable. That's all I wanted to say.

Andrej Karpathy@karpathy

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.

English

1.1K

223.1K

Stefan Heidekrüger eu/acc 🇪🇺@hdkrgr·20 Şub

@atelicinvest Also: - Read that “90% of GenAI use cases fail” headline. - IT said we can’t send our data to an API, but we could host llama 3.1 internally. Cool,how much worse can it be? - employees prefer ChatGPT w/o to llama w/ company context - I guess AI isn’t there yet.

English

895

Unemployed Capital Allocator@atelicinvest·20 Şub

Your median decision maker, btw - uses chatgpt to look up stuff - may have tried Claude cowork if they're adventurous - swamped at their job - been told that this ai stuff is gonna solve all our problems - see random posts of ppl supposedly doing cool stuff on LinkedIn / twitter / reddit - barely had enough time to read the posts - confused by what is even possible and what's available - what is this claw thing everyone is talking about - why arent my employees 10x more productive like they're supposed to - God damnit I really need to check this out + Told Steve to do some research on it and write up a report. He gave me something that's like 50 pages and I don't have time to go through it - wsj says goldman got bunch of ppl from Claude to put their back office out of work - are all my competitors adopting this? - am I gonna get out gunned in 6 months? etc

English

1.4K

136.3K

Keşfet

@mrexits @reach_vb @Schmidtlepp @johnjhorton @justalexoki @benedictk__ @Muhtasham9 @kanavtwt