Shikhar Gupta

47 posts

Shikhar Gupta

Shikhar Gupta

@shik1470

Applied Scientist at Amazon LinkedIn: https://t.co/doKsmKQ2Jy

Seattle, USA Katılım Kasım 2015
538 Takip Edilen250 Takipçiler
Shikhar Gupta retweetledi
Jeremy Howard
Jeremy Howard@jeremyphoward·
For those that hope (or worry) that LLMs will do breakthrough scientific research, I've got good (or bad) news: LLMs are particularly, exceedingly, marvellously ill-suited to this task. (if you're a researcher, you'll have noticed this already) Here's why🧵
English
114
576
4K
1M
Shikhar Gupta retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful! youtube.com/watch?v=7xTGNN…
YouTube video
YouTube
Andrej Karpathy tweet media
English
767
2.9K
20.2K
2.4M
RPO Ghaziabad
RPO Ghaziabad@rpoghaziabad·
@shik1470 @passportsevamea You are required to visit Passport office( Hapur Chungi, Kamla Nehru Nagar, Ghaziabad) from Monday to Thursday between 10 AM to 01 PM and meet PA section.
English
1
0
0
45
Shikhar Gupta
Shikhar Gupta@shik1470·
@rpoghaziabad @passportsevamea The passport renewal application for Umang Gupta, with file number GZ1078709139623 was submitted on 19/10/2023, and unfortunately, it has been under review since then. Could you provide an update on when can we receive the passport?
English
2
1
0
556
Shikhar Gupta retweetledi
Terence Parr
Terence Parr@the_antlr_guy·
Workin on slides to teach #machinelearning this Fall for @usfca_msds. Here is (1D) visual difference between L1 and L2 regularizations' effect on training loss function. Pushes loss up and min parameter (beta) towards the origin. That's why betas are constrained.
GIF
English
2
8
26
0
Shikhar Gupta retweetledi
Chip Huyen
Chip Huyen@chipro·
I'm working on a book on machine learning interviews so I've been spending the last few months talking to companies about their hiring process for ML roles. This thread is a summary of what I've learned. It will be updated as the book progresses. (1/n)
English
46
533
2.2K
0
Shikhar Gupta retweetledi
Kerem Turgutlu
Kerem Turgutlu@KeremTurgutlu·
@keremturgutlu/understanding-building-blocks-of-ulmfit-818d3775325b" target="_blank" rel="nofollow noopener">medium.com/@keremturgutlu… Here, I tried to explain building blocks of SOTA ULMFIT model. What is an AWD-LSTM? How Dropout is used everywhere? What is a QRNN and why might it be better? ...I also used excel spreadsheets to simplify things in a different way :)
English
2
61
245
0
Shikhar Gupta retweetledi
Jeremy Howard
Jeremy Howard@jeremyphoward·
Practical Deep Learning for Coders, 2019 edition, will be released tomorrow. It's looking amazing.
English
53
307
1.7K
0
Shikhar Gupta retweetledi
🔭 Rajat
🔭 Rajat@rajat9393·
Anyone who has studied at Bansal Classes (Kota) would agree that their cycle parking needed this!
English
0
1
3
0