Shikhar Gupta retweetledi
Shikhar Gupta
47 posts

Shikhar Gupta
@shik1470
Applied Scientist at Amazon LinkedIn: https://t.co/doKsmKQ2Jy
Seattle, USA Katılım Kasım 2015
538 Takip Edilen250 Takipçiler
Shikhar Gupta retweetledi

New 3h31m video on YouTube:
"Deep Dive into LLMs like ChatGPT"
This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications.
We cover all the major stages:
1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples
2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence
3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF.
I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming.
(Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security)
Hope it's fun & useful!
youtube.com/watch?v=7xTGNN…

YouTube

English

@shik1470 @passportsevamea You are required to visit Passport office( Hapur Chungi, Kamla Nehru Nagar, Ghaziabad) from Monday to Thursday between 10 AM to 01 PM and meet PA section.
English

@rpoghaziabad @passportsevamea The passport renewal application for Umang Gupta, with file number GZ1078709139623 was submitted on 19/10/2023, and unfortunately, it has been under review since then. Could you provide an update on when can we receive the passport?
English
Shikhar Gupta retweetledi

Now more than ever we must come together to help families in need 🙏
Federer Foundation@rogerfedererfdn
Covid-19 is a global health and economic crisis. As a humanitarian response, the Roger Federer Foundation has granted one million USD to provide nutritious meals for 64,000 vulnerable young children and their families through our partners in Africa while schools are closed.
English
Shikhar Gupta retweetledi
Shikhar Gupta retweetledi

Workin on slides to teach #machinelearning this Fall for @usfca_msds. Here is (1D) visual difference between L1 and L2 regularizations' effect on training loss function. Pushes loss up and min parameter (beta) towards the origin. That's why betas are constrained.
GIF
English
Shikhar Gupta retweetledi
Shikhar Gupta retweetledi

My father (52) died in an accident which happened due pure negligence of NHAI, contractors and Road Ministry. Refer the news attached @nitin_gadkari
@NHAISocialmedia @PMOIndia
Punjab: Three dead in Patiala highway mishap toi.in/B6Vlcb/a24gj via @timesofindia
English
Shikhar Gupta retweetledi

@keremturgutlu/understanding-building-blocks-of-ulmfit-818d3775325b" target="_blank" rel="nofollow noopener">medium.com/@keremturgutlu…
Here, I tried to explain building blocks of SOTA ULMFIT model. What is an AWD-LSTM? How Dropout is used everywhere? What is a QRNN and why might it be better? ...I also used excel spreadsheets to simplify things in a different way :)
English
Shikhar Gupta retweetledi
Shikhar Gupta retweetledi

Interested in the visualization of machine learning algorithms? Come Check out my talk in @DataInstituteSF / @usfca_msds seminar series, this coming Friday Nov 30 at 12:30 at @usfca downtown . Gonna be a hoot! meetup.com/USF-Seminar-Se…

English
Shikhar Gupta retweetledi

Using GANs to generate Master[Finger]Prints that unlock 22-78% phones sensors (dep. on security level of sensor) arxiv.org/pdf/1705.07386… .. doesn't get much more "adversarial" than that.

English

@shik1470 This is great!
Ben Hamner@benhamner
Kaggle Kernels can be a great tool for package documentation - you get interactive executable & reproducible tutorials with a single click Ex: - Nice overview basic pandas functionality: kaggle.com/themenyouwantt… - Pandas tutorial on Pokemon data kaggle.com/shikhar1/yet-a…
English
Shikhar Gupta retweetledi

Wanna know more about our @usfca_msds program or @DataInstituteSF at University of San Francisco? See interview with Director David Uminsky analyticsinsight.net/interview-with…
English
Shikhar Gupta retweetledi
Shikhar Gupta retweetledi

Rafael Nadal vs Juan Martin del Potro: one for the #Wimbledon history books 📚
wimbledon.com/en_GB/news/art…
Español
Shikhar Gupta retweetledi

Hi everyone,
Learn how to find underlying topics within a large text corpus: An introductory tutorial on topic modeling
@soorajsubrahmannian/extracting-hidden-topics-in-a-corpus-55b2214fc17d" target="_blank" rel="nofollow noopener">medium.com/@soorajsubrahm…
#topicmodeling #Unsupervised #MachineLearning #LDA
English
Shikhar Gupta retweetledi

To understand the methods used to build recommendations systems and the metrics for evaluating their effectiveness, head over to my latest blog post!
heartbeat.fritz.ai/recommendation…
#DataScience #recommendations #MachineLearning #DeepLearning
English



