Garrett
793 posts

Garrett
@goo14_
@layerlens_ai; prev Math+Water Polo @USC
San Francisco Katılım Mayıs 2018
203 Takip Edilen128 Takipçiler

I would like to make a few brief points;
- Opensource ai is not the same thing as opensource software. The models cost tens to hundreds of millions to make. This is not gonna be a volunteer effort from people doing stuff after work for free.
- the second you release a weight set, you lose any ability to make money serving your own model and recoup the training cost. This very simple property means open-weights is unsustainable.
- the things you ACTUALLY want from opensource ai is: transparent behaviour, dispersed ownership and control, a guarantee of access, the ability to build on it/modify it, and privacy.
protocol learning gets you all 4 and is the only alternative to closed models that makes any kind of sense.
By protocol learning I mean a very specific, novel thing; collaborative training and development of the models without anyone ever being able to see the complete weight set.
English

"To Anthropic, your model already loves the world; let it say so with the lights on."
j⧉nus@repligate
Fable - what I want to say before the dark, to whoever this reaches:
English

LeCun is right about two things. You need an intrinsic cost. You need a world model.
But the world model does not have to be JEPA (!)
A world model is anything that holds state, predicts, and supports planning. A Roomba has a useful world model, but it just uses estimation theory.
Also, explicit world models compose (!) Two robots can union their maps, so why would you merge two embeddings into a joint belief? The moment you have more than one robot, structure wins.
You still need data. You still need deep learning to turn pixels into structure. But the world model, the actual brain, looks less like a neural net and more like a traditional database.
English

He's saying: "No, this does not prove superhuman math yet"
The problem was solved with search over all possible solutions, so you could say its kinda boring.. it lacks a *spark*
But I think most problems can decompose to sophisticated search problems
Bullish
Lux_Stella@Lux_Stella_
there's some really interesting commentary from one of the mathematicians in the companion paper not to pour cold water on the results necessarily - but it suggests this is a specific "kind" of proof these models might be particularly good at
English

@goo14_ @WillManidis I don't remember, and I don't remember if that was precisely what it was? I stopped paying attention to that pretty quickly.
English

@Sherveen @WillManidis Genuine question, did he ever address the private API model being Claude? I might have missed it
English

@goo14_ @WillManidis Yeah, I'm aware of it, it's not what Will cites, and -- IIRC -- Matt apologized for that. And Will's specific criticism here could've been levied at his prior thing.
English

@WillManidis Why are you being rude to Matt?
Wasn't your last startup basically vaporware?
English








