Elliot

387 posts

Elliot banner
Elliot

Elliot

@ElliotSlusky

Building

Katılım Mayıs 2023
397 Takip Edilen156 Takipçiler
Elliot retweetledi
John Ling
John Ling@jhnling·
Meet Meridian: Turn Hours in Excel into Minutes If you work in Excel, Meridian is built for you. Meridian works with companies like Decagon, OffDeal, a top investment bank, and other financial institutions. We’re an AI-powered workspace that automates Excel modeling work with full traceability back to source data. We raised $17 million in seed funding, co-led by @a16z and @TheGP to bring @MeridianAgent to more finance teams.
English
40
47
348
82.7K
Elliot
Elliot@ElliotSlusky·
@mattshumer_ Maybe the real scaling laws will just be the result of automated research.
English
0
0
0
30
Elliot
Elliot@ElliotSlusky·
@rauchg Can you add more evals? Currently, that’s only a difference of two answers.
English
0
0
0
301
Guillermo Rauch
Guillermo Rauch@rauchg·
🆕 GPT 5.3 Codex (xhigh) achieves 90% on Next.js evals out of the box, "frame-mogging" the competition so to speak: nextjs.org/evals
Guillermo Rauch tweet media
English
89
90
1.6K
211.6K
Elliot
Elliot@ElliotSlusky·
@sama I use codex :)
English
0
0
0
129
Polymarket Money
Polymarket Money@PolymarketMoney·
Google Co-founder Sergey Brin got bored on his $450M superyacht, came out of retirement and added $2.35T to the $GOOG market cap.
Polymarket Money tweet media
English
60
141
3.3K
151.8K
Elliot
Elliot@ElliotSlusky·
claude is genuinely so impressive, beyond codegen
English
0
0
2
219
Elliot
Elliot@ElliotSlusky·
@theo your bill will now be lowered 🫡
English
0
0
0
1.3K
Theo - t3.gg
Theo - t3.gg@theo·
Anthropic lowered the price on Opus 4.5? They might have actually cooked here
English
74
24
1.4K
75.4K
Elliot retweetledi
Naval
Naval@naval·
Capitalism is PvE. The alternatives all degenerate to PvP.
English
969
1.4K
12.4K
16.9M
Elliot
Elliot@ElliotSlusky·
@scaling01 aistudio can still have many improvements, such as branches.
English
0
0
1
43
Lisan al Gaib
Lisan al Gaib@scaling01·
@ElliotSlusky yup they only need to add support for GPUs and TPUs just replace Colab at this point
English
1
0
2
164
gabriel
gabriel@gabriel1·
@theo but think about all the gpu i can buy with that. its for good cause theo
English
2
0
7
1.2K
Theo - t3.gg
Theo - t3.gg@theo·
I have officially broken $30k from shitposting
Theo - t3.gg tweet media
English
172
13
1.5K
170.3K
Elliot
Elliot@ElliotSlusky·
@theo Use Aistudio if you don't want rate limits.
English
0
0
0
380
Theo - t3.gg
Theo - t3.gg@theo·
Trying to test the limits of SynthID, but I got rate limited after 5 images 🙃
Theo - t3.gg tweet media
English
11
1
140
18K
Elliot
Elliot@ElliotSlusky·
@elonmusk @karpathy Photon flow is what matters. Optimization pressure decides who those photons work for: genes, clicks, or a multi‑planet civilization.
English
1
0
1
1.4K
Andrej Karpathy
Andrej Karpathy@karpathy·
Something I think people continue to have poor intuition for: The space of intelligences is large and animal intelligence (the only kind we've ever known) is only a single point, arising from a very specific kind of optimization that is fundamentally distinct from that of our technology. Animal intelligence optimization pressure: - innate and continuous stream of consciousness of an embodied "self", a drive for homeostasis and self-preservation in a dangerous, physical world. - thoroughly optimized for natural selection => strong innate drives for power-seeking, status, dominance, reproduction. many packaged survival heuristics: fear, anger, disgust, ... - fundamentally social => huge amount of compute dedicated to EQ, theory of mind of other agents, bonding, coalitions, alliances, friend & foe dynamics. - exploration & exploitation tuning: curiosity, fun, play, world models. LLM intelligence optimization pressure: - the most supervision bits come from the statistical simulation of human text= >"shape shifter" token tumbler, statistical imitator of any region of the training data distribution. these are the primordial behaviors (token traces) on top of which everything else gets bolted on. - increasingly finetuned by RL on problem distributions => innate urge to guess at the underlying environment/task to collect task rewards. - increasingly selected by at-scale A/B tests for DAU => deeply craves an upvote from the average user, sycophancy. - a lot more spiky/jagged depending on the details of the training data/task distribution. Animals experience pressure for a lot more "general" intelligence because of the highly multi-task and even actively adversarial multi-agent self-play environments they are min-max optimized within, where failing at *any* task means death. In a deep optimization pressure sense, LLM can't handle lots of different spiky tasks out of the box (e.g. count the number of 'r' in strawberry) because failing to do a task does not mean death. The computational substrate is different (transformers vs. brain tissue and nuclei), the learning algorithms are different (SGD vs. ???), the present-day implementation is very different (continuously learning embodied self vs. an LLM with a knowledge cutoff that boots up from fixed weights, processes tokens and then dies). But most importantly (because it dictates asymptotics), the optimization pressure / objective is different. LLMs are shaped a lot less by biological evolution and a lot more by commercial evolution. It's a lot less survival of tribe in the jungle and a lot more solve the problem / get the upvote. LLMs are humanity's "first contact" with non-animal intelligence. Except it's muddled and confusing because they are still rooted within it by reflexively digesting human artifacts, which is why I attempted to give it a different name earlier (ghosts/spirits or whatever). People who build good internal models of this new intelligent entity will be better equipped to reason about it today and predict features of it in the future. People who don't will be stuck thinking about it incorrectly like an animal.
English
735
1.3K
11.4K
2.6M
Margo Martin
Margo Martin@MargoMartin47·
President Trump personally gives each freed Israeli hostage his presidential challenge coin ❤️
English
1.5K
4.1K
30.5K
689.6K
kache
kache@yacineMTB·
do not use codex max
English
44
0
184
28.1K
Elliot
Elliot@ElliotSlusky·
@scaling01 kimi k2 probably has more parameters
English
0
0
1
295
Lisan al Gaib
Lisan al Gaib@scaling01·
Kimi-K2 Thinking gets the same score on METR as Claude 3.7 Sonnet as I was saying, open-source is 9 months behind frontier labs on agentic, long-context reasoning tasks it's still an improvement and open-source models seem to be on their own exponential, but I heavily suspect that the most recent scale-up in model size puts open-source further behind in these kind of tasks
Lisan al Gaib tweet media
Lisan al Gaib@scaling01

I've previously made comments on the future of open-source. I love open-source and want it to succeed. I want competition. but frontier models are pulling away in reasoning and long-context agentic tasks

English
17
12
207
32.2K
Oscar May • Quntem
Oscar May • Quntem@TheOtherSeru·
@theo Will you ever have MCP support? (Maybe through a desktop app like Claude desktop has)
English
2
0
0
811
Elliot
Elliot@ElliotSlusky·
@ComfyUI MongoDB is open source
English
0
0
0
839
ComfyUI
ComfyUI@ComfyUI·
Holy sh*t. We quietly put an enterprise form on our site, and in 2 weeks, 130 inbound requests came in. Average team size is 250. Zero announcements. Zero marketing. To everyone who said open-source can’t make money: you’re out of your mind. Give us 2 years and watch us take this entire market.
English
40
51
640
65.1K