ohnotheirtokenlimit

749 posts

ohnotheirtokenlimit

ohnotheirtokenlimit

@tokenlimitto0

Melbourne, Victoria Katılım Mart 2025
73 Takip Edilen11 Takipçiler
ohnotheirtokenlimit
ohnotheirtokenlimit@tokenlimitto0·
@J_Von_Random @danluu Based on understanding the architecture of a transformer model, and seeing the sheer amount of recursion that is needed to have an LLM operate.
English
0
0
0
13
ThePrimeagen
ThePrimeagen@ThePrimeagen·
Honestly why stop at 100x engineer? Just use more agents, you literally could be 1000x, 10000x, 100000x just by scaling You could what you use to in an entire year in one second
English
229
97
3K
71.4K
viktorg
viktorg@viktorg475·
@danluu Yes its obvious its very profitable to sell inference. There are literal publicly traded bitcoin mining companies rebranding as inference providers... **because its more profitable to sell inference than mine shitcoins**
English
2
0
2
275
Dan Luu
Dan Luu@danluu·
Why are so many people so sure that the big AI providers are losing money on inference? It reminds me of the comments about how Uber can never make money. Their unit economics were fine and they were only losing money because they chose to do so on customer acquisition.
Dan Luu tweet media
English
55
14
274
19.6K
Thomas Ip
Thomas Ip@_thomasip·
@danluu Inference is firmly profitable. The costs comes from training larger and better models to stay ahead of the competition.
English
1
0
1
325
Ryan Topps
Ryan Topps@RyanJTopps·
@danluu Open AI’s operating margin is -122% so yeah it’s confirmed at this point
English
1
0
1
98
ohnotheirtokenlimit
ohnotheirtokenlimit@tokenlimitto0·
@CosmicSenate @Fooney_ @SocialistMormon @danluu I’ll tell you now, enterprise customers are billed higher, but the cost is still heavily subsidised. They need enterprise to have no choice but use LLMs, THEN the prices go to what they need for a heavy profit margin.
English
0
0
0
4
ohnotheirtokenlimit
ohnotheirtokenlimit@tokenlimitto0·
@jintaeghong @danluu This is the raw details that they don’t look into. Once you build even a small model (not run one, BUILD one with PyTorch), you realise just how many calcs are being ran for inferring one token. Then look at the number of tokens being spat out in LLMs and agent flows. It’s insane
English
0
0
0
5
28291odjwk
28291odjwk@jintaeghong·
@danluu If you know the mechanism of the transformer model, you'll understand why it cost more than human when it comes to replacing human labor
English
1
0
1
284
LandonCryptoExplr
LandonCryptoExplr@LandonExplr·
@danluu AWS was 'unprofitable' for years too. Inference unit economics are fine at reasonable utilization. The loss claims always price hardware per query instead of amortizing capex over years.
English
1
0
2
742
ohnotheirtokenlimit
ohnotheirtokenlimit@tokenlimitto0·
And at that point, general estimates seem to be suggesting the costs will 40x to consumers of tokens. They are running them cheap, like Uber did, to capture market and make it hard to leave. This is also why they try to stop ppl becoming engineers; it makes LLMs the only choice
English
0
0
0
6
ohnotheirtokenlimit
ohnotheirtokenlimit@tokenlimitto0·
“How can they be so sure they are losing money on inference? Uber lost money too, to capture market” Two points: 1. You say they cant be losing money but then explained that they are, but its for a reason 2. We agree with you. Once they get market share the token subsidy vanishes
Dan Luu@danluu

Why are so many people so sure that the big AI providers are losing money on inference? It reminds me of the comments about how Uber can never make money. Their unit economics were fine and they were only losing money because they chose to do so on customer acquisition.

English
1
0
0
24
TradingCardNV
TradingCardNV@TradingCardNV·
@Polymarket I want andreesen to get liquidated sooo fucking bad. Wipe that smug smile off his egg head face.
English
1
0
2
1.4K
Polymarket
Polymarket@Polymarket·
NEW: Marc Andreessen declares AGI was achieved three months ago.
English
499
386
4.9K
1M