Stefan Heidekrüger eu/acc 🇪🇺

658 posts

Stefan Heidekrüger eu/acc 🇪🇺

Stefan Heidekrüger eu/acc 🇪🇺

@hdkrgr

AI | Algorithmic Game Theory | Multi-Agent RL . eu/acc

Munich, Bavaria Katılım Eylül 2014
1.6K Takip Edilen206 Takipçiler
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Michael Straka
Michael Straka@mstrakastrak·
An important and underrated skill - learning Latin so you can mog your coworkers by prompting Claude like a wizard
Michael Straka tweet media
English
15
33
693
35.4K
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Aaron Levie
Aaron Levie@levie·
CEOs are uniquely prone to AI psychosis because they’re sufficiently distant from the last mile of work that still has to happen to generate most value with AI. So when they play with AI, they see the happy path results, often not considering the next 10 or 20 things that have to happen to get sustainable results from agents. “Look I made this awesome product prototype”. Yes but you didn’t have to review the code before it went into production and fix a bunch of issues. “Look I generated a contract”. Yes but you didn’t verify all the terms before it goes out to the counterparty and didn’t have to wire up all the past contracts to work with. The best thing you can do as a CEO is to use AI a *ton* to figure out the real implications of agents in the enterprise, and come out the other side with an appreciation for both the upside and the real work that goes into them.
Michal Malewicz@michalmalewicz

CEOs are the most delusional about AI. Detached from reality.

English
289
707
6.7K
1.1M
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Kane 謝凱堯
This is the author who thrust AI water hysteria into the mainstream by overestimating data center water use by 100,000% in her book Empire of AI by mixing up units. She is a source of wild misinformation. Imagine writing an Econ book on the premise that minimum wage is $7,250/hr
Kane 謝凱堯 tweet media
Karen Hao@_KarenHao

On the one-year anniversary of EMPIRE OF AI, I am so, so excited to announce The AI Resist List, a new project that documents examples of resistance to the AI empires around the world. airesistlist.org

English
120
646
4.9K
464.5K
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
will depue
will depue@willdepue·
bro it isn’t generally intelligent bro its only read every book and paper ever written and just making connections between them bro. its only thinking for twenty hours bro it’s just brute force thinking bro. its only solving erdos problems bro it could never be an accountant bro
English
146
559
7.8K
531.2K
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
putting together a group chat for Codex power users in London / Europe who are the biggest ballers around?
English
251
4
313
105.5K
Stefan Heidekrüger eu/acc 🇪🇺
@johnjhorton I suspect that something similar is going on with the auctioneer, the bidders, the intended viewer of the image, and the person promoting the LLM. When told to “submit a bid”, the perspective is clear and actions are rational.
English
0
0
1
25
Stefan Heidekrüger eu/acc 🇪🇺
@johnjhorton …: especially when developing agentic systems themselves, the coding agents often write amazing code, but completely fail to reason about _who_ a certain artifact (code comment, copy in reports, docs) is for. (The user? The developer? Other agents? Future developer agents?) /2
English
1
0
1
44
John Horton
John Horton@johnjhorton·
If you have AI agents simulate bidders in an art auction, you'll get the auction theory predictions almost exactly (and has been true for years); asking it to one-shot draw the progression of such an auction, it gets the details wrong
John Horton tweet mediaJohn Horton tweet media
English
2
1
25
9.7K
taoki
taoki@justalexoki·
i once got hard stuck with claude so i asked him to write a brief i could send to gpt instead and he lead with "HA, brutal!" and then he was super catty for the rest of the convo
English
5
1
168
6.6K
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Chubby♨️
Chubby♨️@kimmonismus·
I've now spent several hours using Opus 4.7 and comparing it to 4.6, and it's like night and day for me. Opus 4.7 feels like a disgruntled employee whose results you can't judge and have to check afterward. The trust you had with 4.6 is gone. It's like hiring a new employee who had excellent grades in their application but is totally sloppy and disgruntled in practice and doesn't follow instructions. The consequence: fire them. So, for now, I'm going back to 4.6. Seriously: did not expect such release from Anthropic. Biggest win for OpenAI was Anthropics Opus release.
Chubby♨️@kimmonismus

ok wtf, i say it. give me back 4.6 what the heck is this sh*t. The more i use 4.7 the more annoyed i am. this is such a rushed release.

English
95
57
1.1K
147.7K
Benedict Kerres
Benedict Kerres@benedictk__·
Munich builders ☕️🇩🇪 Meet OpenAI and Codex at our event for coffee, conversation, and coworking. Show your work, meet the team, and spend the day with us. Potential bonus: Branded Codex coffee beans as gifts (collector item)! cc. @Muhtasham9 for organizing you are a rock star!
English
21
6
155
14.3K
kanav
kanav@kanavtwt·
incredible things are happening on linkedin
kanav tweet media
English
240
548
11.3K
1.2M
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Chubby♨️
Chubby♨️@kimmonismus·
Claude found out about iran in real time. (but does it know it was him that was used in the attacks?)
Chubby♨️ tweet media
English
77
235
8.6K
364.5K
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Roberto
Roberto@RobJ02·
@tszzl’s tweets, now deleted, seemingly minutes before learning of OpenAI’s deal with the DoD. See specifically the second
Roberto tweet mediaRoberto tweet media
English
25
129
2K
285.5K
Stefan Heidekrüger eu/acc 🇪🇺 retweetledi
Chubby♨️
Chubby♨️@kimmonismus·
I love Karpathy's posts because they're so on point. He's not only a leading expert in his field, but he also manages to capture the zeitgeist with his statements. But this post is particularly impactful. Since December, (agentic) coding has undergone a significant transformation, one could even say a qualitative leap. Before, it was a matter of iterative improvements, but since the end of last year, it has demonstrated its true value in a completely different way. Or, in Karpathy's words: "It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since. (...) As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel." Two points on this that need to be repeated again and again because they are often still misunderstood. 1) The very basic truth: this is the worst it will ever be. From here on out, things will get better. Even if the status quo were to remain as it is, it would be serious. But what we have today is the worst it will ever be. 2) The pace of progress is constantly increasing. It is exponential. And that's the crucial point: from December to February, more happened than in a very long time. And this trajectory will likely (almost certainly) continue. If points 1) and 2) are true, it is simply impossible to foresee and predict how this will affect society and all essential areas. As much as I welcome and approve of this, the near future is unpredictable. That's all I wanted to say.
Andrej Karpathy@karpathy

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.

English
29
63
1.1K
223.1K
Stefan Heidekrüger eu/acc 🇪🇺
@atelicinvest Also: - Read that “90% of GenAI use cases fail” headline. - IT said we can’t send our data to an API, but we could host llama 3.1 internally. Cool,how much worse can it be? - employees prefer ChatGPT w/o to llama w/ company context - I guess AI isn’t there yet.
English
0
1
3
895
Unemployed Capital Allocator
Unemployed Capital Allocator@atelicinvest·
Your median decision maker, btw - uses chatgpt to look up stuff - may have tried Claude cowork if they're adventurous - swamped at their job - been told that this ai stuff is gonna solve all our problems - see random posts of ppl supposedly doing cool stuff on LinkedIn / twitter / reddit - barely had enough time to read the posts - confused by what is even possible and what's available - what is this claw thing everyone is talking about - why arent my employees 10x more productive like they're supposed to - God damnit I really need to check this out + Told Steve to do some research on it and write up a report. He gave me something that's like 50 pages and I don't have time to go through it - wsj says goldman got bunch of ppl from Claude to put their back office out of work - are all my competitors adopting this? - am I gonna get out gunned in 6 months? etc
English
28
46
1.4K
136.3K