Dan Luu

5.6K posts

Dan Luu

Dan Luu

@danluu

Active on https://t.co/WG71Nrs60M; also trying out https://t.co/fGOzbSxVHi. No longer read replies or notifications here now that tweetdeck is gated.

Katılım Aralık 2008
43 Takip Edilen45.8K Takipçiler
Dan Luu
Dan Luu@danluu·
These are fairly autonomous, so I can run a bunch in parallel. If I worked for a company with "unlimited" tokens for personal projects, I could easily run multiple projects like above and spend > 1T/hr scratching personal itches. Do you think they'd let me do it? Presumably not?
English
3
0
13
2.2K
Dan Luu
Dan Luu@danluu·
project that's using about 300M/hr. That one is running on a medium sized machine (64 cores, but old/slow ones) and is rate limited by CPU. It's less clear to me how productively that one scales up, but I think 10x for sure, and possibly 100x, with some more compute.
English
2
0
9
2.2K
Dan Luu
Dan Luu@danluu·
Nowadays, I often see people say they have access to unlimited tokens. Usually for work, although in response to x.com/danluu/status/…, someone from Microsoft said they have unlimited tokens for personal projects. I wonder what this means in practice. Surely there's a limit?
Dan Luu@danluu

What do people like for token-intensive workflows for personal projects? With a semi-autonomous workflow, I can exhaust a week's quota on the $200/mo codex plan in ~10 minutes of my time. If I'm in the loop, I get ~10x(?) the productivity per token, but it's time inefficient and

English
3
0
37
12.6K
Dan Luu
Dan Luu@danluu·
The other widely cited reasoning is Garran's "AI has hit a wall" and similar. Every time people have said that, it's been wrong. E.g., Garran wrote in 10/25. Since then, on $, 5.5 >> 5.4 >> 5.3. It's improved more than enough to allow massive price increases. There was no wall.
English
2
0
20
2.5K
Dan Luu
Dan Luu@danluu·
It's not even obvious that inference should be money losing today. API cost for frontier models has gone way up and the people claiming a cost for inference are relying on made up assumptions that don't seem likely. Widely cited folks, like Ed Zitron, aren't remotely credible.
English
11
3
62
5.5K
Dan Luu
Dan Luu@danluu·
Why are so many people so sure that the big AI providers are losing money on inference? It reminds me of the comments about how Uber can never make money. Their unit economics were fine and they were only losing money because they chose to do so on customer acquisition.
Dan Luu tweet media
English
57
14
286
21.3K
Dan Luu
Dan Luu@danluu·
Since this is a common point of confusion, 10 minutes of my time != 10 min wall clock. ChatGPT says multiple plans are allowed? chatgpt.com/c/69fe9b31-b6d…. There are stories of people getting banned, but at least it will be funny if I get banned after ChatGPT said it was allowed?
English
1
0
11
4.2K
Dan Luu
Dan Luu@danluu·
is to avoid burning a ton of money. I've messed around a bit with having a more expensive model do planning with cheaper models doing implementation, but I'm not sure the savings is there with an autonomous workflow? Different story if I'm in the loop, but that takes time.
English
3
0
13
5.8K
Dan Luu
Dan Luu@danluu·
What do people like for token-intensive workflows for personal projects? With a semi-autonomous workflow, I can exhaust a week's quota on the $200/mo codex plan in ~10 minutes of my time. If I'm in the loop, I get ~10x(?) the productivity per token, but it's time inefficient and
English
24
0
89
39K
Dan Luu
Dan Luu@danluu·
The security program verification link sends me to support, which sends me to the link, which sends me to support, which sends me to the link, which sends me to support, etc. This was escalated through some in-band channel, but that didn't get me out of the loop.
English
1
0
8
2.1K
Dan Luu
Dan Luu@danluu·
Can someone at OpenAI help me with an account issue? I'm doing non-security testing and constantly getting flagged for doing "cybersecurity" work. I did the approval / ID scan on my personal account, but my work account is in an infinite loop and I can't use codex/GPT at work.
Dan Luu tweet media
English
4
1
40
7.9K
Dan Luu
Dan Luu@danluu·
Looks like I spoke too soon about the AI not being superhuman. The current Azul world champion played against it and thinks it's better than him at higher difficulties and a top 100 player played against it at default difficulty and thinks it's better than him at default.
English
1
0
15
2.2K
Dan Luu
Dan Luu@danluu·
Not to overstate — I saw someone vibe coded the same project then declared programming dead after they finished, but their bot loses to MCTS + simple heuristic. Mine was in the same state when I just had an LLM in a loop with instructions to improve the result.
English
3
0
20
5.5K
Dan Luu
Dan Luu@danluu·
It's sort of amazing how quickly you can do things now. I wanted to try writing an alphazero-style AI for Azul. With no AI background, it took me maybe 2-3 hours to (2-3 days wall clock) to beat the best public AI I could find, training on my laptop CPU: danluu.com/game/tile/
Dan Luu tweet media
English
3
4
99
7.3K