Parth Thaker

4.2K posts

Parth Thaker

@ParthThaker15

PhD in Aerospace Engineering, IIT Kharagpur

Kharagpur, India Katılım Aralık 2018

226 Takip Edilen102 Takipçiler

Parth Thaker@ParthThaker15·29 Oca

@flappyairplanes Congrats guys. This!!! Would love to contribute in paid/unpaid ways if remote work is acceptable.

English

Flapping Airplanes@flappyairplanes·28 Oca

Announcing Flapping Airplanes! We’ve raised $180M from GV, Sequoia, and Index to assemble a new guard in AI: one that imagines a world where models can think at human level without ingesting half the internet.

GIF

English

339

258

3.6K

2.1M

Parth Thaker retweetledi

Pau Labarta Bajo@paulabartabajo_·17 Ara

Real-time audio transcription. Entirely on-device. This hands-on tutorial shows you how to build it from scratch. No cloud dependencies, no API calls, complete privacy, using LFM2-Audio-1.5B by @liquidai Enjoy ↓ github.com/Liquid4All/coo…

English

116

783

50.5K

Parth Thaker@ParthThaker15·11 Kas

@viditchess Happy to teach you ML if you like.

English

905

Vidit Gujrathi@viditchess·11 Kas

Peaceful morning!

English

737

73.6K

Parth Thaker@ParthThaker15·11 Kas

@calcsam BOOK

Português

Sam Bhagwat 🇸🇬 AIE Singapore@calcsam·10 Kas

icymi we wrote a new agents book: patterns for building ai agents it has everything you need to take your agents from prototype to production, like agent design patterns, the basics of security, etc reply to this tweet with BOOK and we'll dm you so you can get a copy

Sam Bhagwat 🇸🇬 AIE Singapore tweet media

English

5.3K

748

1.1M

Parth Thaker@ParthThaker15·10 Kas

@PavsProjects @prajdabre Platform name please

English

857

Pavel S Lelyukh@PavsProjects·10 Kas

@prajdabre You don’t necessarily need recommendations. There’s a tutoring platform and if you can game it you can make $70/h but yea Ivy League probably required at that level.

English

14.5K

Raj Dabre@prajdabre·10 Kas

Ok since many of you asked here is some more info: 1. You can't become a tutor if you're a rando. They almost always accept IVY League students/graduates or the equivalent. Preference is for native English speaking people but flexible if you have top credentials. So unless you are from the top 0.01-0.1% of your class/peer group, it's best if you don't daydream. 2. UAE and Saudi are not the only places. China also has its fair share of this market. Singapore too. Basically any country with ungodly rich people will have this situation. 3. You don't simply tap into the market. It's mostly based on recommendations. Also it's a heavily gatekept market. Good luck!

Raj Dabre@prajdabre

There's an obscure market involving tutoring rich kids from UAE and Saudi. You can work 10 hours a week and make as much money as an entry level SWE in FAANG.

English

313.5K

Parth Thaker@ParthThaker15·4 Kas

@Ra1kshit Yup 🙋🏻‍♂️

Rakshit (chessiro.com)@Ra1kshit·3 Kas

Super interested to put together a group of people that understand chess and tech well. Comment below if you are one, or i know someone in this intersection.

English

529

1.3K

115.6K

Parth Thaker retweetledi

Gukesh D@DGukesh·2 Kas

What a moment for Indian cricket! Huge congratulations to the Indian Women’s Team on winning the World Cup. Absolute champions! 🏆👏

English

1.3K

22.9K

163K

Parth Thaker retweetledi

Ashwin 🇮🇳@ashwinravi99·30 Eki

From being left out of the team earlier in the World Cup to taking your team to the final. Take a Bow Jemimah 👏

English

270

54K

511.5K

Parth Thaker@ParthThaker15·14 Eki

This repo has everything - tokenization - pretraining - finetuning - evaluation - inference - web serving. Thanks to @karpathy

Andrej Karpathy@karpathy

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single, dependency-minimal codebase. You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI. It weighs ~8,000 lines of imo quite clean code to: - Train the tokenizer using a new Rust implementation - Pretrain a Transformer LLM on FineWeb, evaluate CORE score across a number of metrics - Midtrain on user-assistant conversations from SmolTalk, multiple choice questions, tool use. - SFT, evaluate the chat model on world knowledge multiple choice (ARC-E/C, MMLU), math (GSM8K), code (HumanEval) - RL the model optionally on GSM8K with "GRPO" - Efficient inference the model in an Engine with KV cache, simple prefill/decode, tool use (Python interpreter in a lightweight sandbox), talk to it over CLI or ChatGPT-like WebUI. - Write a single markdown report card, summarizing and gamifying the whole thing. Even for as low as ~$100 in cost (~4 hours on an 8XH100 node), you can train a little ChatGPT clone that you can kind of talk to, and which can write stories/poems, answer simple questions. About ~12 hours surpasses GPT-2 CORE metric. As you further scale up towards ~$1000 (~41.6 hours of training), it quickly becomes a lot more coherent and can solve simple math/code problems and take multiple choice tests. E.g. a depth 30 model trained for 24 hours (this is about equal to FLOPs of GPT-3 Small 125M and 1/1000th of GPT-3) gets into 40s on MMLU and 70s on ARC-Easy, 20s on GSM8K, etc. My goal is to get the full "strong baseline" stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed). I think it also has potential to grow into a research harness, or a benchmark, similar to nanoGPT before it. It is by no means finished, tuned or optimized (actually I think there's likely quite a bit of low-hanging fruit), but I think it's at a place where the overall skeleton is ok enough that it can go up on GitHub where all the parts of it can be improved. Link to repo and a detailed walkthrough of the nanochat speedrun is in the reply.

English

Parth Thaker@ParthThaker15·24 Eyl

@paraschopra Working (in terms of Sadhana) on realising the nature of consciousness (experientially, not as a concept or perception). Happy to discuss.

English

Paras Chopra@paraschopra·24 Eyl

Hit me up if you can't stop thinking about any one of these!

Lossfunk@lossfunk

Here's what we're interested in as a lab.

English

437

38.7K

Parth Thaker retweetledi

ChessBase India@ChessbaseIndia·22 Eyl

Twelve Indian athletes, including Indian Chessboxing Champion Sneha Waykar, were selected to represent India at the World Chessboxing Championship in Serbia, which starts tomorrow. But they are facing a major hurdle just a day before the tournament. Their visa applications are still “under process.” They have received no response from the authorities!

English

323

21.6K

Parth Thaker@ParthThaker15·1 Eyl

@modricadrian12 @cneuralnetwork Done

English

modricadrain@modricadrian12·1 Eyl

@ParthThaker15 @cneuralnetwork Your dm is closed. Plz dm me🙏

English

neural nets.@cneuralnetwork·1 Eyl

public question to all i want to go down two paths in the future. 1. a pre-doc 2. a masters (outside india) i mainly work in llm posttraining + machine translation in low resource settings have 1 A paper rn + (2 more this year upcoming) can someone guide me on some good options for me + when should i ideally start applying?

English

262

23.8K

Parth Thaker@ParthThaker15·1 Eyl

@modricadrian12 @cneuralnetwork Sure!

English

modricadrain@modricadrian12·1 Eyl

@ParthThaker15 @cneuralnetwork Bro i need some guidance. Can I dm u?

English

Parth Thaker retweetledi

Sadhguru@SadhguruJV·10 Ağu

Mukti is not for those who seek an escape. Mukti is for those who are overflowing with Fulfillment. #SadhguruQuotes

English

207

986

2.7K

33.7K

Parth Thaker retweetledi

Ananda@_anandaonly·6 Ağu

Stop imagining

English

446

8.5K

Parth Thaker@ParthThaker15·24 Tem

@miniapeur Force is rate of change of momentum. (NS is also derived using this)

English

Mathieu@miniapeur·24 Tem

What is the most interesting differential equation? And why?

English

168

22.5K

Parth Thaker@ParthThaker15·12 Tem

@cneuralnetwork Hey Make sure table has depth also, so that monitor is at enough of a distance. Otherwise it will invite headaches (atleast it happened in my case) Then I bought a 1400 Rs table from local shop (2.5*4 feet)

English

461

neural nets.@cneuralnetwork·12 Tem

i need tables like this (<6k) ~ will host my monitor,laptop, keyboard, mouse suggest

English

211

21.2K

Parth Thaker@ParthThaker15·11 Tem

@cneuralnetwork If coloring is random then accuracy should drop. We are adding an additional feature (color) which has no useful information. So Model should struggle. If model is very good - at best it may learn to ignore that feature and acc dips just a little? interesting experiment.

English

neural nets.@cneuralnetwork·11 Tem

weird idea so the core concept of MNIST is that it kind of understands the pattern the number is in, like circle and a line might be a 9 what if we make the task harder like instead of black and white we introduce random coloured pictures like some numbers will be in red some numbers will be green and so on will it affect the accuracy in any aspect really fun idea at the middle of the night let's try this tomorrow morning I'll let you know the result

English

9.2K

Parth Thaker@ParthThaker15·8 Tem

@maharshii youtu.be/fkkRiUjTTzo?si…