Jesse Zhou

856 posts

Jesse Zhou banner
Jesse Zhou

Jesse Zhou

@_jezhou

@meet_cocoon engineering, ex-@gustohq. INFP-T (thoughts and views are my own)

San Francisco, CA Katılım Temmuz 2011
430 Takip Edilen116 Takipçiler
Jesse Zhou retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
My pleasure to come on Dwarkesh last week, I thought the questions and conversation were really good. I re-watched the pod just now too. First of all, yes I know, and I'm sorry that I speak so fast :). It's to my detriment because sometimes my speaking thread out-executes my thinking thread, so I think I botched a few explanations due to that, and sometimes I was also nervous that I'm going too much on a tangent or too deep into something relatively spurious. Anyway, a few notes/pointers: AGI timelines. My comments on AGI timelines looks to be the most trending part of the early response. This is the "decade of agents" is a reference to this earlier tweet x.com/karpathy/statu… Basically my AI timelines are about 5-10X pessimistic w.r.t. what you'll find in your neighborhood SF AI house party or on your twitter timeline, but still quite optimistic w.r.t. a rising tide of AI deniers and skeptics. The apparent conflict is not: imo we simultaneously 1) saw a huge amount of progress in recent years with LLMs while 2) there is still a lot of work remaining (grunt work, integration work, sensors and actuators to the physical world, societal work, safety and security work (jailbreaks, poisoning, etc.)) and also research to get done before we have an entity that you'd prefer to hire over a person for an arbitrary job in the world. I think that overall, 10 years should otherwise be a very bullish timeline for AGI, it's only in contrast to present hype that it doesn't feel that way. Animals vs Ghosts. My earlier writeup on Sutton's podcast x.com/karpathy/statu… . I am suspicious that there is a single simple algorithm you can let loose on the world and it learns everything from scratch. If someone builds such a thing, I will be wrong and it will be the most incredible breakthrough in AI. In my mind, animals are not an example of this at all - they are prepackaged with a ton of intelligence by evolution and the learning they do is quite minimal overall (example: Zebra at birth). Putting our engineering hats on, we're not going to redo evolution. But with LLMs we have stumbled by an alternative approach to "prepackage" a ton of intelligence in a neural network - not by evolution, but by predicting the next token over the internet. This approach leads to a different kind of entity in the intelligence space. Distinct from animals, more like ghosts or spirits. But we can (and should) make them more animal like over time and in some ways that's what a lot of frontier work is about. On RL. I've critiqued RL a few times already, e.g. x.com/karpathy/statu… . First, you're "sucking supervision through a straw", so I think the signal/flop is very bad. RL is also very noisy because a completion might have lots of errors that might get encourages (if you happen to stumble to the right answer), and conversely brilliant insight tokens that might get discouraged (if you happen to screw up later). Process supervision and LLM judges have issues too. I think we'll see alternative learning paradigms. I am long "agentic interaction" but short "reinforcement learning" x.com/karpathy/statu…. I've seen a number of papers pop up recently that are imo barking up the right tree along the lines of what I called "system prompt learning" x.com/karpathy/statu… , but I think there is also a gap between ideas on arxiv and actual, at scale implementation at an LLM frontier lab that works in a general way. I am overall quite optimistic that we'll see good progress on this dimension of remaining work quite soon, and e.g. I'd even say ChatGPT memory and so on are primordial deployed examples of new learning paradigms. Cognitive core. My earlier post on "cognitive core": x.com/karpathy/statu… , the idea of stripping down LLMs, of making it harder for them to memorize, or actively stripping away their memory, to make them better at generalization. Otherwise they lean too hard on what they've memorized. Humans can't memorize so easily, which now looks more like a feature than a bug by contrast. Maybe the inability to memorize is a kind of regularization. Also my post from a while back on how the trend in model size is "backwards" and why "the models have to first get larger before they can get smaller" x.com/karpathy/statu… Time travel to Yann LeCun 1989. This is the post that I did a very hasty/bad job of describing on the pod: x.com/karpathy/statu… . Basically - how much could you improve Yann LeCun's results with the knowledge of 33 years of algorithmic progress? How constrained were the results by each of algorithms, data, and compute? Case study there of. nanochat. My end-to-end implementation of the ChatGPT training/inference pipeline (the bare essentials) x.com/karpathy/statu… On LLM agents. My critique of the industry is more in overshooting the tooling w.r.t. present capability. I live in what I view as an intermediate world where I want to collaborate with LLMs and where our pros/cons are matched up. The industry lives in a future where fully autonomous entities collaborate in parallel to write all the code and humans are useless. For example, I don't want an Agent that goes off for 20 minutes and comes back with 1,000 lines of code. I certainly don't feel ready to supervise a team of 10 of them. I'd like to go in chunks that I can keep in my head, where an LLM explains the code that it is writing. I'd like it to prove to me that what it did is correct, I want it to pull the API docs and show me that it used things correctly. I want it to make fewer assumptions and ask/collaborate with me when not sure about something. I want to learn along the way and become better as a programmer, not just get served mountains of code that I'm told works. I just think the tools should be more realistic w.r.t. their capability and how they fit into the industry today, and I fear that if this isn't done well we might end up with mountains of slop accumulating across software, and an increase in vulnerabilities, security breaches and etc. x.com/karpathy/statu… Job automation. How the radiologists are doing great x.com/karpathy/statu… and what jobs are more susceptible to automation and why. Physics. Children should learn physics in early education not because they go on to do physics, but because it is the subject that best boots up a brain. Physicists are the intellectual embryonic stem cell x.com/karpathy/statu… I have a longer post that has been half-written in my drafts for ~year, which I hope to finish soon. Thanks again Dwarkesh for having me over!
Dwarkesh Patel@dwarkesh_sp

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self driving took so long 1:57:08 - Future of education Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

English
577
2K
16.9K
4.1M
Jesse Zhou retweetledi
Chris Bakke
Chris Bakke@ChrisJBakke·
*open app* "We've just raised a $50M pre-seed to help your toaster talk to your microwave." "We just raised a $230M pre-pre seed to agenticly agent your AI agents." "I'm 4 and I just dropped out of preschool to go all-in on AI -enabled candles." *close app*
English
209
465
6.8K
254.3K
dunkey
dunkey@vgdunkey·
Our first videogame is out today! It's good i promise!
dunkey tweet media
English
439
1.9K
38.1K
1.2M
Jesse Zhou
Jesse Zhou@_jezhou·
@REI your coupons aren't working for the MEMBER24 sale and I'm not going to check out until the price reflects the applied coupon on the web page :/
English
1
0
2
70
Jesse Zhou
Jesse Zhou@_jezhou·
@perplexity_ai @arcinternet the arc + perplexity search engine default is broken... every search query just brings me to the perplexity homepage. just fyi
English
1
0
1
81
Jesse Zhou
Jesse Zhou@_jezhou·
@browsercompany datadog not working in arc but working in chrome... i just updated arc. not acceptable for a work browser
Jesse Zhou tweet media
English
0
0
0
87
Jesse Zhou retweetledi
A24
A24@A24·
💔
A24 tweet media
QME
97
6.1K
47.2K
1.6M
Eric Jang
Eric Jang@ericjang11·
My book, "AI is Good for You", is finally out! I've been working on this (slowly) for the last 3 years. It covers the last decade of progress in AI and 6 ingredients I think are important to build towards increasingly general AI systems. evjang.com/book/
Eric Jang tweet media
English
76
131
1K
218.1K
Jesse Zhou
Jesse Zhou@_jezhou·
super fascinating in-depth look at the writer's strike. worth watching if you want to see the deep ripples that technology has on jobs/careers and the creative industry as a whole youtube.com/watch?v=ILaU78…
YouTube video
YouTube
English
0
0
0
81
Jesse Zhou
Jesse Zhou@_jezhou·
Going to start thinking about advertising with other platforms because clearly using every means I have available to try to contact a real person to help me fix this is not possible. Except maybe complaining publicly on Twitter. We'll see
English
0
0
0
48
Jesse Zhou
Jesse Zhou@_jezhou·
Also all of your help articles just tell me to go back to account quality, which I can never get past due to the 500s. I have no idea how to contact your support team because there's just layers of layers of help articles / redirections to account quality. Can't figure it out
English
1
0
0
64
Jesse Zhou
Jesse Zhou@_jezhou·
@facebook your account quality feature is so broken - I've been stuck on this step for days and whatever endpoint you guys use to go to the next part of the flow 500s. Very frustrating - literally trying to buy ads / give Meta money but it seems like yall don't want it
Jesse Zhou tweet mediaJesse Zhou tweet media
English
3
0
0
149
Jesse Zhou
Jesse Zhou@_jezhou·
pretty sure facebook for business + trying to figure out why my account is restricted from ad spending is the worst online experience i've had in my life
English
0
0
0
79
Jesse Zhou
Jesse Zhou@_jezhou·
It was really fun to try out a small side project that wasn't related to coding and more on the XFN aspects of business that I'm not really good at. Needless to say, I'm still gonna try to make this a thing, so expect to see more of this lolbrand soon. Please buy a shirt!! Thx 😄
English
0
0
0
50
Jesse Zhou
Jesse Zhou@_jezhou·
I made a joke at work awhile ago and I thought it was so dumb and good that I tried this weekend to see if I could spin-up a lifestyle brand around it with merch. It turns out Etsy + POD services like Printify are scarily good and now it's a thing. etsy.com/shop/Dehydrati…
English
1
0
1
89