GlowLy

84 posts

GlowLy

GlowLy

@glowly29

Software Engineer at @vllm_project

Hanoi, Vietnam Katılım Ekim 2021
3K Takip Edilen78 Takipçiler
GlowLy retweetledi
LIFE AI
LIFE AI@LifeNetwork_AI·
1/ Life AI Testnet is coming! We’re opening access to the first OG members who will help shape the future of personalized, proactive healthcare. OG Role = Exclusive benefits, early access, and special rewards throughout our journey. Here’s EXACTLY how to secure your spot 👇
English
2.1K
19.7K
22.5K
576.9K
GlowLy retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single, dependency-minimal codebase. You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI. It weighs ~8,000 lines of imo quite clean code to: - Train the tokenizer using a new Rust implementation - Pretrain a Transformer LLM on FineWeb, evaluate CORE score across a number of metrics - Midtrain on user-assistant conversations from SmolTalk, multiple choice questions, tool use. - SFT, evaluate the chat model on world knowledge multiple choice (ARC-E/C, MMLU), math (GSM8K), code (HumanEval) - RL the model optionally on GSM8K with "GRPO" - Efficient inference the model in an Engine with KV cache, simple prefill/decode, tool use (Python interpreter in a lightweight sandbox), talk to it over CLI or ChatGPT-like WebUI. - Write a single markdown report card, summarizing and gamifying the whole thing. Even for as low as ~$100 in cost (~4 hours on an 8XH100 node), you can train a little ChatGPT clone that you can kind of talk to, and which can write stories/poems, answer simple questions. About ~12 hours surpasses GPT-2 CORE metric. As you further scale up towards ~$1000 (~41.6 hours of training), it quickly becomes a lot more coherent and can solve simple math/code problems and take multiple choice tests. E.g. a depth 30 model trained for 24 hours (this is about equal to FLOPs of GPT-3 Small 125M and 1/1000th of GPT-3) gets into 40s on MMLU and 70s on ARC-Easy, 20s on GSM8K, etc. My goal is to get the full "strong baseline" stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed). I think it also has potential to grow into a research harness, or a benchmark, similar to nanoGPT before it. It is by no means finished, tuned or optimized (actually I think there's likely quite a bit of low-hanging fruit), but I think it's at a place where the overall skeleton is ok enough that it can go up on GitHub where all the parts of it can be improved. Link to repo and a detailed walkthrough of the nanochat speedrun is in the reply.
Andrej Karpathy tweet media
English
690
3.4K
24.2K
5.8M
GlowLy
GlowLy@glowly29·
RT @durov: Telegram and xAI have agreed to a 1-year partnership to distribute Grok to Telegram’s billion+ users and integrate it into its a…
English
0
335
0
6
GlowLy retweetledi
Elon Musk
Elon Musk@elonmusk·
Tesla vs other electric car companies
Elon Musk tweet media
Español
5.2K
10.7K
94.4K
25.1M
GlowLy retweetledi
Jeff Dean
Jeff Dean@JeffDean·
Someone just reminded me of this lecture I gave in 2009 that described the evolution of Google Search from 1999 to 2009. People who are interested in how our search systems work might find this interesting. It touches on disk-based serving systems, in-memory indices, compression schemes for inverted indices, latency issues due to interference from background processes, queries of death, evolution of hardware, and more.. Video: videolectures.net/videos/wsdm09_… Slides: static.googleusercontent.com/media/research…
English
39
257
2.4K
280K
Jeff Dean
Jeff Dean@JeffDean·
My wife found some artwork of now-adult @vdean314 of the two of us and my longtime colleague Sanjay.
Jeff Dean tweet media
English
30
12
697
65.6K
GlowLy retweetledi
Orbit Labs
Orbit Labs@orbit__labs·
🚀 Hello #LUNC Community! Following the successful approval of the v3.4.0 software upgrade proposal (#12157), we are now actively working on the Wasmd unforking as our next priority. Link below: 👇 📌 What’s Next? After completing the Wasmd unforking, our next major milestone will be upgrading to Cosmos SDK 50. This upgrade will bring significant improvements to the Terra Classic ecosystem. Stay tuned for more updates! 🚀🔥
English
13
32
145
5.2K
GlowLy
GlowLy@glowly29·
I just claimed my spot for @MachidotXYZ liquidity bootstrapping event. Come check your eligibility at machi.xyz
English
0
0
1
38
bread.mega
bread.mega@bread_·
The entire memecoin lifecycle is going to be productized. All developed levels are absolutely printing. Currently waiting for the next level:
bread.mega tweet media
English
22
4
57
8.1K
GlowLy retweetledi
Đỗ Hùng Việt
Đỗ Hùng Việt@dohungviet·
The entire nation mourning the loss of its leader - Comrade Nguyen Phu Trong, General-Secretary of the Communist Party of Viet Nam 🇻🇳
Đỗ Hùng Việt tweet media
English
35
81
692
63.8K
GlowLy retweetledi
Avail
Avail@AvailProject·
Mainnet is coming. Turn on your notification. 23-07-24
English
351
860
3.7K
459.6K
GlowLy
GlowLy@glowly29·
@Sickde_One @obumnwabude I have same error. It works when I deploy, test on localnet. But I got error when run on testnet/devnet
English
1
0
2
29
Obum
Obum@obumnwabude·
"Access violation in stack frame 5 at address 0x2000051f0 of size 8"
English
3
0
1
924
GlowLy retweetledi
Andrew Huang ⚡️⛓️
Andrew Huang ⚡️⛓️@KAndrewHuang·
Chains in service of great apps — love to see some of the best builders in the space get it!
English
2
4
33
7.6K
luxe
luxe@luxetheluxe·
I have $1000, drop me the so-called trader to follow to make this grow 10X in the next few months
English
1
0
3
226
Kevin 🇺🇦
Kevin 🇺🇦@dj_d_sol·
@hdevalence By a rough count I have ~16 commits to repos in that list, is that enough? UI says not eligible
English
2
0
2
390