Sumjit

4.6K posts

Sumjit banner
Sumjit

Sumjit

@sumjitg

22 y/o. learning about ai agents and experimenting!

Katılım Kasım 2023
87 Takip Edilen552 Takipçiler
Sabitlenmiş Tweet
Sumjit
Sumjit@sumjitg·
Perplexity just launched persistent memory across sessions: the feature users begged for, but also the one that makes everyone nervous. The idea: it remembers your preferences, interests, and past convos so you stop re-explaining yourself. You can view/delete memories in settings, but some users opened theirs and found a long list of inferred habits they never explicitly shared. Useful? Yes. Creepy? Also yes. What research says: Memory-augmented LLMs aren't new. Systems like Mem0 show ~26% better accuracy and 90% token savings with good memory design. But the privacy trade-offs: aggregated memories can reveal sensitive stuff users didn't mean to expose. Perplexity's Team: Encrypted, avoidable (incognito mode exists), but deleted memories linger in logs for ~30 days and feed model training unless you opt out. Do you turn memory ON or keep it OFF?
Sumjit tweet media
Aravind Srinivas@AravSrinivas

We've been testing Memory (short-term and long-term) on Perplexity for a while. The results are great, and we are rolling it out widely. You can ask personalized questions, questions about past chats, and use any model or search mode with personal context (both apps and web).

English
6
9
175
21.4K
Sumjit
Sumjit@sumjitg·
No practice, no trial and error. It’s already there, pre-wired by evolution. Then he flips it to AI: our models start from completely random weights. Zero instincts. Everything they “know” comes from copying massive piles of internet text during training. No biology, no real-world grounding, just imitation. Karpathy basically calls LLMs “ghosts” more than animals. this whole new kind of intelligence that doesn’t grow the way living things do. Idk, it just makes me pause and think: we’re not really building digital brains that work like ours. We’re summoning something… else. Kinda eerie, kinda cool. Anyone else feel that way?
English
0
0
1
12
Sumjit
Sumjit@sumjitg·
Just watched this short clip from Dwarkesh’s podcast with Andrej Karpathy and it honestly stuck with me. He was saying how a huge chunk of animal smarts isn’t even learned: it’s straight-up baked into their DNA. Like, a baby zebra can stand up and run minutes after birth.
Sumjit tweet media
Dwarkesh Patel@dwarkesh_sp

A large portion of animal intelligence doesn't require any learning, claims @karpathy: it's baked into DNA. AI models, by contrast, start from random weights. They have to learn their intelligence, mostly by imitating the internet. This is so different that Andrej thinks it's a fundamentally different kind of intelligence: LLMs are more like ghosts than animals.

English
1
0
2
28
Sumjit
Sumjit@sumjitg·
Then the bigger question he left hanging: once we get past AGI, what keeps humans feeling useful? Creativity, judgment, meaning- those things don’t disappear, but we’ll have to rethink how we use them.
English
0
0
1
9
Sumjit
Sumjit@sumjitg·
At the same time, multimodal models and everyday tools are already getting noticeably better. The hardest part was medicine!! He talked about compressing drug discovery from years down to months or even weeks. Starting with cancers and immunology. If that actually happens, it changes everything for a lot of families.
English
1
0
1
20
Sumjit
Sumjit@sumjitg·
Just watched Dario Amodei talk about why he actually left OpenAI to start Anthropic. Two things really hit him hard back then. First, he was already seeing the scaling laws work in real life with GPT-2 in 2019. You give these models way more data and compute and boom the jumps in capability are huge. A lot of people inside and outside the company thought it was nonsense at the time, but he and a few others kept pushing the leadership: “This is gonna be massive.” Second, and this feels more personal- he realized these models were heading toward human-brain level power. If that’s happening, you can’t just talk about “doing it responsibly.” You need real, serious conviction about safety, not just nice words on a slide deck. He wasn’t convinced OpenAI had that deep-down commitment as things got serious. It’s interesting because both bets seem to be playing out now. Scaling delivered, and the culture/safety question is still the big one everyone’s wrestling with.
Sumjit tweet media
Jaynit Makwana@JaynitMakwana

Dario Amodei reveals the two convictions that made him leave OpenAI to start Anthropic "The first conviction was the scaling laws" "If you scale up models, give them more data and more compute, you find incredible increases in performance. I was finding that in 2019 with GPT-2" "There were a lot of folks inside and outside who didn't believe it at all. We made the case to leadership, this is going to be a big deal" "The second was, if these models are going to match the capability of the human brain, we better get this right" "Despite a lot of language about doing it in the right way, I was just not convinced there was a real and serious conviction to do it the right way"

English
0
1
2
64
Sumjit retweetledi
Sumjit
Sumjit@sumjitg·
Just saw Demis Hassabis push back hard on all the recent Erdős problem hype. He basically said: sure, today’s AIs are knocking out some of these tough math problems, but that’s nowhere near real AGI. It doesn’t come close to the kind of raw creative invention someone like
Sumjit tweet media
NIK@ns123abc

🚨 Google DeepMind CEO Sir Demis Hassabis: “Today’s systems, are nowhere near [AGI]. Doesn’t matter how many Erdős problems you solve… I think it’s far, far from what a true invention or someone like a Ramanujan would have been able to do” it’s over for the Erdős hype

English
23
5
63
19.4K
therootboss
therootboss@TherootB42158·
@sumjitg No blue checkmark just posting outdated bullshit for the love of the game
English
1
0
1
23
Sumjit
Sumjit@sumjitg·
@DealsForge Yess! I am also optimistic about open model for the future
English
0
0
2
12
Ucav
Ucav@DealsForge·
@sumjitg exactly. these boxes are not “kill your OpenAI bill”, they’re “stop sending every small dumb task to a frontier model”. great sidekick, bad miracle machine!!
English
1
0
1
22
Sumjit
Sumjit@sumjitg·
Worth buying if you’re into tinkering? Probably yes. Thinking it’s going to 1:1 replace ChatGPT for serious daily work? Not yet.
English
0
0
1
29
Sumjit
Sumjit@sumjitg·
Great little device if you want to experiment, keep things private, run stuff offline or build small projects/automation. It’s fun and impressive for the price. But don’t expect it to make your cloud subscription disappear in a few weeks unless your usage is mostly light tasks.
English
1
0
1
29
Sumjit
Sumjit@sumjitg·
crazy ideas one week and grinding to make it actually work the next. But I thought if calling everything “engineering” makes us focus too much on quick wins and shipping, and less on the deep questions that actually move the field forward in big ways. What do you guys think?
English
0
0
1
31
Sumjit
Sumjit@sumjitg·
them properly and publish. A lot of what we’re using today came from research done years ago. Elon: forget the fancy labels, it’s all engineering. I’ve been thinking about this and the line has blurred a lot. These days most good teams have people doing both- experimenting with
Sumjit tweet media
English
1
0
1
40