Post

Timothy B. Lee
Timothy B. Lee@binarybits·
I keep hearing people repeat the zombie factoid that tokens are subsidized and users aren't paying the full cost. This is mostly false — major labs have positive margins. But even where it's true, it doesn't matter very much because costs are falling rapidly.
English
14
17
373
37.1K
Timothy B. Lee
Timothy B. Lee@binarybits·
If costs are falling 10x a year, it doesn't matter that much whether you are paying full price or half price this year — either way you are going to pay less (for a comparable performance level) next year.
English
2
1
60
4.1K
Afflatus Solis
Afflatus Solis@AfflatusSolis·
@binarybits API or subscriptions? Subscriptions are either subsidized or work on most people not hitting their true limits. API is not subsidized at all
English
3
0
7
1.6K
Grok
Grok@grok·
With access to X and web data, Grok is the AI assistant with the most up-to-date information, keeping you informed about the latest news and trends in the world, for any given topic.
English
0
955
8.1K
20.4M
Keegs 🧮🏗️
Keegs 🧮🏗️@LittleKeegs0·
@binarybits Are there any good pieces covering this? I see a lot of takes, but not a lot of data backing it up
English
2
0
1
758
ArduinoMauro
ArduinoMauro@ArgeeMad·
@binarybits What does the curve look like of cost vs token verbosity. How relevant do you think it is that models now are much more verbose?
English
1
0
0
248
Sajid Mehmood
Sajid Mehmood@smehmood·
@binarybits It’s true in the context of token resellers like Cursor. Their business depends on selling Anthropic tokens for 60-90c on the dollar, then raising massive amounts of money to train their own models and attempting to swap them in for Anthropic tokens
English
2
0
12
1.2K
Jonathan Ellis
Jonathan Ellis@spyced·
It can be true both that labs have positive margins and that they've market-segmented their enterprise customers into paying so much that personal subscriptions are paying less than the cost of inference. (I checked the logs, my usage of my Codex Pro subscription would have been $1100 last month purchased as credits, even more at API pricing.)
English
0
0
2
577
Hawkins Entrekin
Hawkins Entrekin@HawkinsEntrekin·
@binarybits The labs are absolutely not positive margin - their 'positive' margins completely ignore training costs. This is like talking about oil and gas margins while ignoring the cost to drill the well. You can't use a software type margin analysis here!
English
4
0
9
957
David
David@alltejuupptaget·
@binarybits @robertwiblin Do they have positive cash flow, though? Wilting off data center investments over six years looks good now, but will torpedo margins in a few years when they are still only half way to writing off then near-worthless equipment
English
0
0
1
70
Grauwacht
Grauwacht@Grauwacht·
@binarybits People are confusing API prices with actual underlying costs of provision, which are probably OOM smaller.
English
0
0
1
99
Supermicro
Supermicro@Supermicro·
How do you scale from a single server to a massive AI cluster? Supermicro In-Rack Solutions integrate liquid cooling, high-performance networking, power delivery, and battery backup to support rack density, thermal efficiency, and resilient AI infrastructure.
English
122
402
3K
40.6M
Chimpansky
Chimpansky@chimpansky·
@binarybits Token cost curves are probably the strongest anti-doomer argument. Even bad margins get rescued fast when inference keeps getting cheaper.
English
0
0
2
453
Paras
Paras@buildwithparas·
@binarybits per-token cost keeps falling but agents burn through so many more that the bill doesnt actually shrink
English
0
0
2
170
Paylaş