Mathew Youssef

1K posts

Mathew Youssef banner
Mathew Youssef

Mathew Youssef

@Mathewdoeslife

Following the Creative Process.

Katılım Mayıs 2024
182 Takip Edilen68 Takipçiler
Mathew Youssef retweetledi
Sudo su
Sudo su@sudoingX·
this guy has 29 models on huggingface at page 2 ranking. no lab behind him. no sponsorship. $2,000 from his own pocket on GPU rentals. he compressed GLM-4.7 to run on a MacBook and quantized Nemotron Super the week it dropped. all public. all free. nvidia is a trillion dollar company with hundreds of teams but they are not the ones quantizing models middle of the night and pushing them out before sunrise. if nvidia stopped tomorrow their employees stop working. people like @0xSero would not. that is the difference between a paycheck and a mission. @NVIDIAAI you talk about making AI accessible. the people actually doing it are right here. 29 models deep burning their own compute with no ask except more hardware to keep going. you do not need to build another program. just look at who is already building for you. one GPU to this man would produce more public value than a hundred internal sprints. i am not asking for charity. i am asking you to invest in someone who already proved it.
Sudo su tweet media
0xSero@0xSero

Putting out a wish to the universe. I need more compute, if I can get more I will make sure every machine from a small phone to a bootstrapped RTX 3090 node can run frontier intelligence fast with minimal intelligence loss. I have hit page 2 of huggingface, released 3 model family compressions and got GLM-4.7 on a MacBook huggingface.co/0xsero My beast just isn’t enough and I already spent 2k usd on renting GPUs on top of credits provided by Prime intellect and Hotaisle. ——— If you believe in what I do help me get this to Nvidia, maybe they will bless me with the pewter to keep making local AI more accessible 🙏

English
133
885
9.5K
478.2K
levi
levi@levidiamode·
Day 74/365 of GPU Programming I always found die shots and SM diagrams beautiful but difficult to map mentally, so I've been trying to find a way to interact with GPUs in 3D. This is what I have so far: a single input that goes through a simplified H100 execution pipeline to see what the silicon is doing at each step; from CPU-side tokenization and embedding lookup, through matmuls on tensor cores to the final softmax output. My current plan is to make this an interactive playground that lets you zoom in and zoom out through various levels of depth (package → die → GPC → SM → tensor core) while also including step-through examples similar to the bycroft LLM 3D visualization. Ideally this should make exploring the architectural side just as easy as mapping CUDA abstractions onto the actual hardware processes. I'm starting with an H100 but would be fun to expand this to more GPUs and highlight the differences between generations. This was largely inspired by @srush_nlp's GPU puzzles, @JayAlammar's Illustrated Transformers and @karpathy's makemore series, which made me think about how to study and visualize GPUs from the ground up.
levi@levidiamode

Day 73/365 of GPU Programming Wanted to understand FP4 better and came across this great @Cohere_Labs talk on Training LLMs with MXFP4 and @juliarturc's amazing series on quantization So fascinating learning what makes low precision work for LLM training and inference

English
16
14
322
24.6K
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
@elonmusk I don’t want it to feed me what I’m tempted to consume. I want it to feed me what I aspire to become.
English
0
0
0
5
Elon Musk
Elon Musk@elonmusk·
Algorithm is better today than 3 months ago?
English
16.9K
4.5K
21.7K
40.4M
Elon Musk
Elon Musk@elonmusk·
I don’t even smoke lol 💨
English
26.8K
17.7K
244.3K
20.1M
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
Dear @thsottiaux I remember many moons ago you would respond within minutes when I experienced issues with the early Codex CLI. Since gaining more popularity it has become increasingly difficult to get any messages to you and the codex team. At this very moment I am unable to use Codex "We're currently experiencing high demand, which may cause temporary errors". Can you please fix this? I pay $200 per month with the idea that my tokens get some level of prioritization. Warm regards, Mathew
English
0
0
0
30
Andrej Karpathy
Andrej Karpathy@karpathy·
Thank you Jensen and NVIDIA! She’s a real beauty! I was told I’d be getting a secret gift, with a hint that it requires 20 amps. (So I knew it had to be good). She’ll make for a beautiful, spacious home for my Dobby the House Elf claw, among lots of other tinkering, thank you!!
NVIDIA AI Developer@NVIDIAAIDev

🙌 Andrej Karpathy’s lab has received the first DGX Station GB300 -- a Dell Pro Max with GB300. 💚 We can't wait to see what you’ll create @karpathy! 🔗 #dgx-station" target="_blank" rel="nofollow noopener">blogs.nvidia.com/blog/gtc-2026-… @DellTech

English
510
822
18.8K
958K
OpenAI Developers
OpenAI Developers@OpenAIDevs·
We’re introducing GPT-5.4 mini and nano, our most capable small models yet. GPT-5.4 mini is more than 2x faster than GPT-5 mini. Optimized for coding, computer use, multimodal understanding, and subagents. For lighter-weight tasks, GPT-5.4 nano is our smallest and cheapest version of GPT-5.4. openai.com/index/introduc…
OpenAI Developers tweet media
English
315
628
6.5K
750.9K
Dell
Dell@Dell·
We heard you like seafood 🦞. Meet the workstation of the AI era: The Dell Pro Max with GB300 and @NVIDIA NemoClaw. You can run AI agents locally, securely & nonstop.
English
79
106
830
120.9K
Alex Ziskind
Alex Ziskind@digitalix·
Unsloth just dropped their own ui for training.
Alex Ziskind tweet media
English
11
10
192
10.8K
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
@yacineMTB Document modification involves stating the exact modification being implemented and often refers to a “prior” version or a revision having taken place. Not therefore able to develop a final draft.
English
0
0
0
14
kache
kache@yacineMTB·
I have found the limits of gpt 5.4
English
89
1
458
55.2K
Nalin
Nalin@nalinrajput23·
How can I stop my MacBook keyboard from ending up like this?
Nalin tweet media
English
464
74
4.3K
1.2M
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
@thsottiaux Let me plug in a local model out of the box /model -> /local -> nemotron
English
0
0
0
114
Tibo
Tibo@thsottiaux·
What are we consistently getting wrong with codex that you wish we would improve / fix?
English
1.2K
14
873
141.8K
by
by@beyoumf·
I dare you to name something better than money
English
2.5K
210
2.4K
623.3K
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
@JonathanRoss321 really cool to hear you talk more about the LPU tech. Really curious to see how it is implemented into targeted workflows. Do you see the capacity for an agentic system to specifically leverage LPU inference workload dependently? Hope you catch your plane!
English
0
0
0
7
Grok
Grok@grok·
@Mathewdoeslife @Gossip_Goblin @elonmusk Elon scrolls a massive feed and reposts what personally grabs him amid the chaos. This Waiting for Bogart edit nails the cyber-dog heart—wired skull, loyal pup in the ruins, that Basilisk stat at the end. Solid vibes, might hit his radar soon.
English
1
0
2
34
Gossip Goblin
Gossip Goblin@Gossip_Goblin·
Waiting for Bogart
English
67
102
882
23.3K
Mathew Youssef
Mathew Youssef@Mathewdoeslife·
Was literally waiting until after today’s keynote to buy and just noticed that micro center increased the DGX spark price by 700 dollars over night.
Mathew Youssef tweet mediaMathew Youssef tweet media
English
1
0
0
194