Parth Thaker

4.2K posts

Parth Thaker banner
Parth Thaker

Parth Thaker

@ParthThaker15

PhD in Aerospace Engineering, IIT Kharagpur

Kharagpur, India Katılım Aralık 2018
226 Takip Edilen102 Takipçiler
Parth Thaker
Parth Thaker@ParthThaker15·
@flappyairplanes Congrats guys. This!!! Would love to contribute in paid/unpaid ways if remote work is acceptable.
English
0
0
0
18
Flapping Airplanes
Flapping Airplanes@flappyairplanes·
Announcing Flapping Airplanes! We’ve raised $180M from GV, Sequoia, and Index to assemble a new guard in AI: one that imagines a world where models can think at human level without ingesting half the internet.
GIF
English
339
258
3.6K
2.1M
Parth Thaker retweetledi
Pau Labarta Bajo
Pau Labarta Bajo@paulabartabajo_·
Real-time audio transcription. Entirely on-device. This hands-on tutorial shows you how to build it from scratch. No cloud dependencies, no API calls, complete privacy, using LFM2-Audio-1.5B by @liquidai Enjoy ↓ github.com/Liquid4All/coo…
English
17
116
783
50.5K
Sam Bhagwat 🇸🇬 AIE Singapore
icymi we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy
Sam Bhagwat 🇸🇬 AIE Singapore tweet media
English
5.3K
748
8K
1.1M
Pavel S Lelyukh
Pavel S Lelyukh@PavsProjects·
@prajdabre You don’t necessarily need recommendations. There’s a tutoring platform and if you can game it you can make $70/h but yea Ivy League probably required at that level.
English
4
0
23
14.5K
Raj Dabre
Raj Dabre@prajdabre·
Ok since many of you asked here is some more info: 1. You can't become a tutor if you're a rando. They almost always accept IVY League students/graduates or the equivalent. Preference is for native English speaking people but flexible if you have top credentials. So unless you are from the top 0.01-0.1% of your class/peer group, it's best if you don't daydream. 2. UAE and Saudi are not the only places. China also has its fair share of this market. Singapore too. Basically any country with ungodly rich people will have this situation. 3. You don't simply tap into the market. It's mostly based on recommendations. Also it's a heavily gatekept market. Good luck!
Raj Dabre@prajdabre

There's an obscure market involving tutoring rich kids from UAE and Saudi. You can work 10 hours a week and make as much money as an entry level SWE in FAANG.

English
17
45
1K
313.5K
Rakshit (chessiro.com)
Rakshit (chessiro.com)@Ra1kshit·
Super interested to put together a group of people that understand chess and tech well. Comment below if you are one, or i know someone in this intersection.
English
529
32
1.3K
115.6K
Parth Thaker retweetledi
Gukesh D
Gukesh D@DGukesh·
What a moment for Indian cricket! Huge congratulations to the Indian Women’s Team on winning the World Cup. Absolute champions! 🏆👏
Gukesh D tweet media
English
98
1.3K
22.9K
163K
Parth Thaker retweetledi
Ashwin 🇮🇳
Ashwin 🇮🇳@ashwinravi99·
From being left out of the team earlier in the World Cup to taking your team to the final. Take a Bow Jemimah 👏
English
270
4K
54K
511.5K
Parth Thaker
Parth Thaker@ParthThaker15·
This repo has everything - tokenization - pretraining - finetuning - evaluation - inference - web serving. Thanks to @karpathy
Andrej Karpathy@karpathy

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single, dependency-minimal codebase. You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI. It weighs ~8,000 lines of imo quite clean code to: - Train the tokenizer using a new Rust implementation - Pretrain a Transformer LLM on FineWeb, evaluate CORE score across a number of metrics - Midtrain on user-assistant conversations from SmolTalk, multiple choice questions, tool use. - SFT, evaluate the chat model on world knowledge multiple choice (ARC-E/C, MMLU), math (GSM8K), code (HumanEval) - RL the model optionally on GSM8K with "GRPO" - Efficient inference the model in an Engine with KV cache, simple prefill/decode, tool use (Python interpreter in a lightweight sandbox), talk to it over CLI or ChatGPT-like WebUI. - Write a single markdown report card, summarizing and gamifying the whole thing. Even for as low as ~$100 in cost (~4 hours on an 8XH100 node), you can train a little ChatGPT clone that you can kind of talk to, and which can write stories/poems, answer simple questions. About ~12 hours surpasses GPT-2 CORE metric. As you further scale up towards ~$1000 (~41.6 hours of training), it quickly becomes a lot more coherent and can solve simple math/code problems and take multiple choice tests. E.g. a depth 30 model trained for 24 hours (this is about equal to FLOPs of GPT-3 Small 125M and 1/1000th of GPT-3) gets into 40s on MMLU and 70s on ARC-Easy, 20s on GSM8K, etc. My goal is to get the full "strong baseline" stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed). I think it also has potential to grow into a research harness, or a benchmark, similar to nanoGPT before it. It is by no means finished, tuned or optimized (actually I think there's likely quite a bit of low-hanging fruit), but I think it's at a place where the overall skeleton is ok enough that it can go up on GitHub where all the parts of it can be improved. Link to repo and a detailed walkthrough of the nanochat speedrun is in the reply.

English
0
0
0
24
Parth Thaker
Parth Thaker@ParthThaker15·
@paraschopra Working (in terms of Sadhana) on realising the nature of consciousness (experientially, not as a concept or perception). Happy to discuss.
English
0
0
0
47
Parth Thaker retweetledi
ChessBase India
ChessBase India@ChessbaseIndia·
Twelve Indian athletes, including Indian Chessboxing Champion Sneha Waykar, were selected to represent India at the World Chessboxing Championship in Serbia, which starts tomorrow. But they are facing a major hurdle just a day before the tournament. Their visa applications are still “under process.” They have received no response from the authorities!
ChessBase India tweet media
English
11
43
323
21.6K
neural nets.
neural nets.@cneuralnetwork·
public question to all i want to go down two paths in the future. 1. a pre-doc 2. a masters (outside india) i mainly work in llm posttraining + machine translation in low resource settings have 1 A paper rn + (2 more this year upcoming) can someone guide me on some good options for me + when should i ideally start applying?
English
15
7
262
23.8K
Parth Thaker retweetledi
Sadhguru
Sadhguru@SadhguruJV·
Mukti is not for those who seek an escape. Mukti is for those who are overflowing with Fulfillment. #SadhguruQuotes
Sadhguru tweet media
English
207
986
2.7K
33.7K
Parth Thaker retweetledi
Ananda
Ananda@_anandaonly·
Stop imagining
Ananda tweet media
English
5
55
446
8.5K
Parth Thaker
Parth Thaker@ParthThaker15·
@miniapeur Force is rate of change of momentum. (NS is also derived using this)
English
0
0
0
62
Mathieu
Mathieu@miniapeur·
What is the most interesting differential equation? And why?
English
41
4
168
22.5K
Parth Thaker
Parth Thaker@ParthThaker15·
@cneuralnetwork Hey Make sure table has depth also, so that monitor is at enough of a distance. Otherwise it will invite headaches (atleast it happened in my case) Then I bought a 1400 Rs table from local shop (2.5*4 feet)
English
0
0
2
461
neural nets.
neural nets.@cneuralnetwork·
i need tables like this (<6k) ~ will host my monitor,laptop, keyboard, mouse suggest
neural nets. tweet media
English
39
4
211
21.2K
Parth Thaker
Parth Thaker@ParthThaker15·
@cneuralnetwork If coloring is random then accuracy should drop. We are adding an additional feature (color) which has no useful information. So Model should struggle. If model is very good - at best it may learn to ignore that feature and acc dips just a little? interesting experiment.
English
0
0
0
67
neural nets.
neural nets.@cneuralnetwork·
weird idea so the core concept of MNIST is that it kind of understands the pattern the number is in, like circle and a line might be a 9 what if we make the task harder like instead of black and white we introduce random coloured pictures like some numbers will be in red some numbers will be green and so on will it affect the accuracy in any aspect really fun idea at the middle of the night let's try this tomorrow morning I'll let you know the result
English
12
0
81
9.2K