Safwan Shaheer

338 posts

Safwan Shaheer banner
Safwan Shaheer

Safwan Shaheer

@devorein

Admirer of human intelligence, open source and hard money Building: @undeniablehealth Prev: @CharmVerse, @scoutgamexyz, @madscriptai & @pitchmebro

شامل ہوئے Ekim 2020
459 فالونگ69 فالوورز
Safwan Shaheer ری ٹویٹ کیا
Tom Turney
Tom Turney@no_stp_on_snek·
Google dropped the TurboQuant paper yesterday morning. 36 hours later it's running in llama.cpp on Apple Silicon, faster than the baseline it replaces. the numbers: - 4.6x KV cache compression - 102% of q8_0 speed (yes, faster, smaller cache = less memory bandwidth) - PPL within 1.3% of baseline (verified, not vibes) the optimization journey: 739 > starting point (fp32 rotation) 1074 > fp16 WHT 1411 > half4 vectorized butterfly 2095 > graph-side rotation (the big one) 2747 > block-32 + graph WHT. faster than q8_0. 3.72x speedup in one day. from a paper I read at dinner last night. what I learned along the way: - the paper's QJL residual stage is unnecessary. multiple implementations confirmed this independently - Metal silently falls back to CPU if you mess up shader includes. cost me hours - "coherent text" output means nothing. I shipped PPL 165 thinking it worked. always run perplexity - ggml stores column-major. C arrays are row-major. this will ruin your afternoon everything is open source. the code, the benchmarks, the speed investigation logs, the debugging pain, all of it. github.com/TheTom/turboqu… paper to parity in 36 hours. what a time to be alive.
Tom Turney tweet media
English
51
104
1.2K
61.6K
Safwan Shaheer ری ٹویٹ کیا
Windscribe
Windscribe@windscribecom·
Show your ID to protect kids Show your ID to protect kid Show your ID to protect ki Show your ID to protect k Show your ID to protect Show your ID to protec Show your ID to prote Show your ID to prot Show your ID to pro Show your ID to pr Show your ID to p Show your ID to Show your ID t Show your ID Show your I Show your Show you Show yo Show y Show Sho Sh S Su Suc Suck Suck m Suck my Suck my b Suck my ba Suck my bal Suck my ball Suck my balls Suck my balls P Suck my balls Pa Suck my balls Pal Suck my balls Pala Suck my balls Palan Suck my balls Palant Suck my balls Palanti Suck my balls Palantir
English
249
3.9K
48K
2.2M
Safwan Shaheer ری ٹویٹ کیا
bubble boi
bubble boi@bubbleboi·
Noooooo your telling me that the Google Geniuses were able to compress KV Cach without losing quality by *checks notes* using polar coordinates ?! It was just.. *gasp* simple trigonometry? And wait all four of them are Iranian? Two out of four from Sharif University ? 😂😂😂
English
34
195
4.2K
196.8K
Safwan Shaheer ری ٹویٹ کیا
Noah Kagan
Noah Kagan@noahkagan·
Hot take: OpenClaw acquisition will go down as one of the worst acquisitions of all time. It’s insanely buggy and Claude Code can do nearly 80% of functionality without constant maintenance.
English
322
34
1.6K
149K
Safwan Shaheer ری ٹویٹ کیا
nixCraft 🐧
nixCraft 🐧@nixcraft·
Tim Cook must me laughing right now as he avoided spending on LLM and just keep selling his iPhones and computers and made real profit. Meanwhile, the AI batshit crazy Microslop is 37% down from the ATH. OpenAI running out of money and shutting down Sora app. Lmao
English
85
553
9.7K
288.4K
Safwan Shaheer ری ٹویٹ کیا
Hugging Models
Hugging Models@HuggingModels·
Meet a reasoning powerhouse: Qwen3.5-9B distilled with Claude 4.6 Opus reasoning! This GGUF model brings elite chain-of-thought capabilities to a compact 9B parameter package. Perfect for developers wanting reasoning smarts without massive compute.
Hugging Models tweet media
English
16
68
786
45.1K
Safwan Shaheer ری ٹویٹ کیا
Hugging Models
Hugging Models@HuggingModels·
Meet a game-changer in AI: Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled. This model can understand BOTH images AND text, then generate thoughtful responses. It's like giving your AI reasoning superpowers. The community is buzzing about its potential!
Hugging Models tweet media
English
13
73
682
40.4K
Safwan Shaheer ری ٹویٹ کیا
Wise
Wise@trikcode·
the generation that refused to accept cookies. is now giving AI access to their desktops, files, and bank accounts.
English
292
1.7K
12.7K
233.8K
Safwan Shaheer ری ٹویٹ کیا
Advait Paliwal
Advait Paliwal@advaitpaliwal·
I built Feynman, Claude Code for research. I gave it a question and it came back 30 minutes later with a cited meta analysis. It can also replicate experiments on Runpod, audit claims against code, and simulate peer review. Open source & MIT license, link below
English
102
300
3.5K
209.2K
Safwan Shaheer ری ٹویٹ کیا
ngrok
ngrok@ngrokHQ·
Quantization can make an LLM 4x smaller and 2x faster, with barely any quality loss. But what *is* it? @samwhoo crafted a beautiful interactive essay explaining it from first principles, aimed at coders, not mathematicians. ngrok.com/blog/quantizat…
English
15
135
1.1K
415.7K
Safwan Shaheer ری ٹویٹ کیا
Andi Marafioti
Andi Marafioti@andimarafioti·
OpenAI's latest repo has an interesting 3rd top contributor.
Andi Marafioti tweet media
English
31
41
2.6K
187K
Safwan Shaheer ری ٹویٹ کیا
Gergely Orosz
Gergely Orosz@GergelyOrosz·
Congrats on having fun and building a vibe coded WYSIWYG editor Patiently waiting for when they’ll realize things like permissions, tagging, backups+disaster recovery, search, exports, tables, mobile app, integrations w Slack+Linear+others need to be built… 🍿
Gergely Orosz tweet media
English
171
51
1.5K
187.3K
Safwan Shaheer ری ٹویٹ کیا
SUN YOUNG HWANG ᯅ 🇰🇷
Guys.. this model is just crazy. If you have just less than 48gb vram, just try the 8q gguf format. Feels just like opus! Tool calling is working smoothly!! Appreciate for this! (Hf and qwen!!) huggingface.co/Jackrong/Qwen3…
English
90
234
2.8K
183.2K
Safwan Shaheer ری ٹویٹ کیا
Riley Walz
Riley Walz@rtwlz·
made my computer dramatically play BBC news music before every meeting
English
588
6.3K
71.4K
4.1M
Safwan Shaheer ری ٹویٹ کیا
Larsen Cundric
Larsen Cundric@larsencc·
The gap between "works in a demo" and "works at scale" is about 4,000 commits.
English
106
459
5.7K
235.5K
Safwan Shaheer ری ٹویٹ کیا
Thomas Frank
Thomas Frank@TomFrankly·
Currently 892 hours into automating a 30-second task I do 4 times a year It's gonna be so worth it once I get everything working
English
101
523
16.6K
481.1K
Safwan Shaheer ری ٹویٹ کیا
Elvin
Elvin@elvin_not_11·
it's beautiful that I can traverse through 25 years of UI design history by clicking 3 times on Windows 11.
Elvin tweet media
English
279
2.4K
46.6K
1.1M