Chess Stetson

63 posts

Chess Stetson

Chess Stetson

@ChessStetson

Beigetreten Temmuz 2015
66 Folgt87 Follower
Chess Stetson retweetet
Andrej Karpathy
Andrej Karpathy@karpathy·
In era of pretraining, what mattered was internet text. You'd primarily want a large, diverse, high quality collection of internet documents to learn from. In era of supervised finetuning, it was conversations. Contract workers are hired to create answers for questions, a bit like what you'd see on Stack Overflow / Quora, or etc., but geared towards LLM use cases. Neither of the two above are going away (imo), but in this era of reinforcement learning, it is now environments. Unlike the above, they give the LLM an opportunity to actually interact - take actions, see outcomes, etc. This means you can hope to do a lot better than statistical expert imitation. And they can be used both for model training and evaluation. But just like before, the core problem now is needing a large, diverse, high quality set of environments, as exercises for the LLM to practice against. In some ways, I'm reminded of OpenAI's very first project (gym), which was exactly a framework hoping to build a large collection of environments in the same schema, but this was way before LLMs. So the environments were simple academic control tasks of the time, like cartpole, ATARI, etc. The @PrimeIntellect environments hub (and the `verifiers` repo on GitHub) builds the modernized version specifically targeting LLMs, and it's a great effort/idea. I pitched that someone build something like it earlier this year: x.com/karpathy/statu… Environments have the property that once the skeleton of the framework is in place, in principle the community / industry can parallelize across many different domains, which is exciting. Final thought - personally and long-term, I am bullish on environments and agentic interactions but I am bearish on reinforcement learning specifically. I think that reward functions are super sus, and I think humans don't use RL to learn (maybe they do for some motor tasks etc, but not intellectual problem solving tasks). Humans use different learning paradigms that are significantly more powerful and sample efficient and that haven't been properly invented and scaled yet, though early sketches and ideas exist (as just one example, the idea of "system prompt learning", moving the update to tokens/contexts not weights and optionally distilling to weights as a separate process a bit like sleep does).
Prime Intellect@PrimeIntellect

Introducing the Environments Hub RL environments are the key bottleneck to the next wave of AI progress, but big labs are locking them down We built a community platform for crowdsourcing open environments, so anyone can contribute to open-source AGI

English
258
857
7.3K
945K
Chess Stetson
Chess Stetson@ChessStetson·
Please record more of it for me to work out to. So that _I_ can get shredded.
English
0
0
2
68
Chess Stetson
Chess Stetson@ChessStetson·
@thefolake I couldn't believe your sound at the @GEANCOFDN gala last night! Like King Sunny Ade mixed with Animals as Leaders. You shredded.
Chess Stetson tweet media
English
1
1
5
665
Shaun Maguire
Shaun Maguire@shaunmmaguire·
With my new boss
Shaun Maguire tweet media
English
110
118
4K
148.4K
Chess Stetson
Chess Stetson@ChessStetson·
As we see whatever Tesla plans to show us this evening, keep in mind that Teslas do better than humans in scenarios where we have inattention, but rather worse when you need human-level perception and intuition. Try this analysis yourself with conode.ai #drisk_ai
English
0
0
0
97
Chess Stetson
Chess Stetson@ChessStetson·
Tesseract projection into 3D as a minimum energy soap membrane -- mad props to the Caltech CPA
Chess Stetson tweet media
English
1
0
0
97
Chess Stetson
Chess Stetson@ChessStetson·
@chamath The US definitely needs to focus more on scientific achievement, but science is a friendly competition. I'm at CVPR right now and seeing the amazing things Chinese (among other) researchers are doing pumps me up.
English
1
0
1
17
Chamath Palihapitiya
Chamath Palihapitiya@chamath·
We need to get our priorities straight. China is absolutely destroying us in discovering the future.
Chamath Palihapitiya tweet mediaChamath Palihapitiya tweet mediaChamath Palihapitiya tweet media
English
256
232
1.9K
261.1K
Chess Stetson
Chess Stetson@ChessStetson·
oh, and happy Revenge of the 5th and Revenge of the 6th too ...that's what this last whole bank holiday was for right?
English
0
0
0
62
Chess Stetson
Chess Stetson@ChessStetson·
May the 4th be with you. Finally got the definitive answer to the eternal question of who's the best star wars character, (in this cool AI tool, currently called "edge")
English
1
0
0
140
Chess Stetson
Chess Stetson@ChessStetson·
@wdavidmarx @TheAtlantic What you write about cultural arbitrage is, I think, what people used to just call "trade," albeit just in cool stuff in your case. But you may be hinting at something bigger...that all trade could just be in information.
English
0
0
0
95
W. David Marx
W. David Marx@wdavidmarx·
I wrote a piece for @TheAtlantic about how, just like we've seen with financial arbitrage, the internet makes it a lot harder to engage in "cultural arbitrage" and why that may be contributing to the feeling of cultural stasis theatlantic.com/culture/archiv…
English
7
32
141
43.8K
Chess Stetson
Chess Stetson@ChessStetson·
This was cool. The UK is a great place to build bombproof AI.
Chess Stetson tweet media
English
0
0
1
72
Chess Stetson
Chess Stetson@ChessStetson·
Big "Big AI" meetup today, at King's Row in Old Town. Come chat about techniques for deploying AI on real business data, and socialize with practitioners. Can't wait to see ya'. meetup.com/pasadena-big-d…
English
0
0
3
88
Chess Stetson
Chess Stetson@ChessStetson·
@nithyavraman Nice to have a roof over one's head; too bad not everyone does
English
0
0
2
286
Nithya Raman
Nithya Raman@nithyavraman·
In the last 24 hours, Downtown LA has seen 6.2 inches of rain — more than we’ve had in some entire years recently. And the rain is expected to continue for at least 24 hours more. This is a state of emergency — we all need to be extremely vigilant in navigating the city today.
English
17
35
139
45.1K
Chess Stetson
Chess Stetson@ChessStetson·
@nithyavraman @latimes Big fan of what you're doing. IMO a focus on fixing homelessness LA will also help us improve everything from housing affordability to mental health. You've got my vote.
English
0
0
0
22