Official Avonews

518 posts

Official Avonews banner
Official Avonews

Official Avonews

@avonewsonline

Student reporting on all things Avonworth High School

Pittsburgh, PA Katılım Mart 2012
40 Takip Edilen168 Takipçiler
Official Avonews
Official Avonews@avonewsonline·
@HenningSittler @karpathy Just be careful in Pittsburgh with the vernacular here. JI instead of AI might spawn a lovingly ironic T-Shirt or two with some locals (Jagoffs)
English
0
0
0
6
Andrej Karpathy
Andrej Karpathy@karpathy·
Jagged Intelligence The word I came up with to describe the (strange, unintuitive) fact that state of the art LLMs can both perform extremely impressive tasks (e.g. solve complex math problems) while simultaneously struggle with some very dumb problems. E.g. example from two days ago - which number is bigger, 9.11 or 9.9? Wrong. x.com/karpathy/statu… or failing to play tic-tac-toe: making non-sensical decisions: x.com/polynoamial/st… or another common example, failing to count, e.g. the number of times the letter "r" occurs in the word "barrier", ChatGPT-4o claims it's 2: x.com/karpathy/statu… The same is true in other modalities. State of the art LLMs can reasonably identify thousands of species of dogs or flowers, but e.g. can't tell if two circles overlap: x.com/fly51fly/statu… Jagged Intelligence. Some things work extremely well (by human standards) while some things fail catastrophically (again by human standards), and it's not always obvious which is which, though you can develop a bit of intuition over time. Different from humans, where a lot of knowledge and problem solving capabilities are all highly correlated and improve linearly all together, from birth to adulthood. Personally I think these are not fundamental issues. They demand more work across the stack, including not just scaling. The big one I think is the present lack of "cognitive self-knowledge", which requires more sophisticated approaches in model post-training instead of the naive "imitate human labelers and make it big" solutions that have mostly gotten us this far. For an example of what I'm talking about, see Llama 3.1 paper section on mitigating hallucinations: x.com/karpathy/statu… For now, this is something to be aware of, especially in production settings. Use LLMs for the tasks they are good at but be on a lookout for jagged edges, and keep a human in the loop.
Andrej Karpathy tweet mediaAndrej Karpathy tweet mediaAndrej Karpathy tweet mediaAndrej Karpathy tweet media
English
213
396
3.4K
407.8K
Official Avonews
Official Avonews@avonewsonline·
#ReadAcrossAmerica @spj_tweets Read Across America - Avonews style! Reading through decades of student journalism with today's HS students. 2005: AHS Pirates fans say they can't win with Bob Nutting 2026: AHS Pirates fans say they can't win with Bob Nutting
Official Avonews tweet media
English
0
0
1
40
Official Avonews
Official Avonews@avonewsonline·
@thegarrettscott How much electricity does one of these replies require and is there a systematic way to replenish the natural resources this program uses so that there is an equal or larger amount of them at the end of a given day?
English
0
0
0
17
Garrett Scott 🕳
Garrett Scott 🕳@thegarrettscott·
I just subscribed to OpenAI's $200/month subscription. Reply with questions to ask it and I will repost them in this thread.
Garrett Scott 🕳 tweet media
English
774
473
10.5K
2.1M
Official Avonews
Official Avonews@avonewsonline·
@keeradwulit @AvonworthHigh Thank you Catrina, Laurel, Matthew, and Rohini. That's a quality last school day ever at AHS... finding out your staff earned First Place/Special Merit and Most Outstanding from High School Digital Newspaper {American Scholastic Press Association}!
Official Avonews tweet media
English
1
0
6
116