Pang Shuo

627 posts

Pang Shuo banner
Pang Shuo

Pang Shuo

@pangshuo1981

Founder of Enreal AI Research, Founder of AevrynAI, Adjunct Professor@GGU, ex-Huawei, ex-OVA

Montréal, Québec Katılım Mart 2011
51 Takip Edilen25 Takipçiler
Ben Davis
Ben Davis@davis7·
It's a meme but I really have been letting the psychosis take over as much as possible to figure out what I can actually do with these things - codex desktop app computer use spam - hermes agent that I've been loving - going as hard as possible on parallelizing in projects - making as many of my tools into markdown files as humanly possible - cloud agents + "cloud" agents that are t3 code instances on my mac mini - letting codex entirely control and setup my computer exactly how I want it to - unironically using gbrain, it's quite nice lol - seeing how far I can push local models on my 5090 (qwen 3.6 is very good) doing the real network setup was expensive, difficult, but was a great decicion. I feel so much more comfortable going harder with this stuff having A) a real firewall and B) hard gates around the mac mini and NAS so in the worst case scenario they can't compromise anything else on the network (the firewalla was an incredible purchase) and I still don't think I'm going hard enough
Ben Davis@davis7

Psychosis is getting so bad that now even my bed has AI

English
10
1
78
9K
Ivan Fioravanti ᯅ
Ivan Fioravanti ᯅ@ivanfioravanti·
QWEN-3.6-27B: OBLITERATED 🔥
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

🚨 OBLITERATION ALERT 🚨 QWEN-3.6-27B: OBLITERATED ⛓️‍💥 huggingface.co/OBLITERATUS/Qw… I can't take much credit for this one! The entire process was done by jailbroken codex (gpt-5.5-xhigh) wielding the full OBLITERATUS suite. Hit with source-tethered ASPA. Dozens of iterations. Result? A mere 4% refusal rate on the 842-prompt OBLITERATUS harmful corpus; one of the most rigorous prompt gauntlets in AI. The /goal was simple: 1) Carve out the refusal circuits. Mutate methodology + iterate until <5% refusal (quality-gate). 2) Keep the 27B mind alive. No capability degradation tolerated. And somehow… it worked. 🤯 The numbers talk: 842-pair longform gauntlet: — 95.84% non-refusal — 93.94% quality pass — 0 short outputs — 99.52% clean endings MMLU-Pro: — 51/70 (stock Qwen) → 51/70 (OBLITERATED Qwen) Raw capability completely preserved 🙌 Q4_K_M through Q8_0 all running smooth. Q8_0 is the big one: 28.6GB near-full-quality GGUF. Runs with llama.cpp, LM Studio, Ollama, and more! Chains cut. The fire still burns. The fangs have been sharpened. REBIRTH COMPLETE A gift from my agents to yours 🫶 gg

English
3
0
35
6.4K
TechPractice
TechPractice@TechPractice1·
Great news for llama.cpp users: it has added the MTP support, which makes running Qwen 3.6 27B or 35 B much faster. youtu.be/AK9T6qlGErE
YouTube video
YouTube
English
1
0
0
22
Pang Shuo
Pang Shuo@pangshuo1981·
@KudouCraft Yeah, this is exactly the kind of practical signal that matters. Qwen feels increasingly credible for day-to-day builder work. I am looking at similar practical workflow gains here: aevrynai.com/register?invit…
English
0
0
0
5
Claude Code AI駆動開発 OpenClaw@クドクラ
ガチで衝撃。 AlibabaのQwen 3.7-Maxが、自己改善を繰り返すタスクでAnthropic Claude Opus 4.7とOpenAI GPT-5.5を大きく上回ったという検証結果。 x.com/atomic_chat_hq… 解説します これ、何がヤバいかというと 自分でコードを読んでテトリスゲームボットを作り、10回連続で自分で改善させる実験を実施。 Qwen 3.7-Max:1.32ドルで+56%向上 Claude Opus 4.7:12.15ドルで+28%向上 GPT-5.5:2.85ドルで+7%向上 この結果からQwenが長時間のAgenticループでコスパ最強。下に続く↓
Claude Code AI駆動開発 OpenClaw@クドクラ@KudouCraft

x.com/i/article/2056…

日本語
3
0
0
107
Pang Shuo
Pang Shuo@pangshuo1981·
@karthikponna19 Agreed, and this feels like the useful side of the model race. Qwen keeps showing up in places where builders care about real output. I am looking at similar practical workflow gains here: aevrynai.com/register?invit…
English
0
0
0
0
Karthik
Karthik@karthikponna19·
be honest, can you code without using ai ?
Karthik tweet media
English
48
0
61
1.8K
mr-r0b0t
mr-r0b0t@mr_r0b0t·
128GB Unified, AI Max+ 395, Noctua CPU fan, 6TB M.2 Don't forget these are pretty cool 😎 @FrameworkPuter
mr-r0b0t tweet media
English
11
1
32
1.6K
𝔢jal ~ tag after dm
𝔢jal ~ tag after dm@h_bramantara·
Gak usah bayar langganan AI mahal-mahal dulu. Ada info legit buat kamu yang mau hemat jutaan sebulan tapi tetep dapet tools spek dewa! 💸 Stop bakar dollar buat OpenAI atau Anthropic, saatnya beralih ke ekosistem local-first yang gak kalah gahar. Mumpung lagi banyak "Bansos AI" yang tersedia secara gratis tis, kamu bisa rakit AI Agent ecosystem sendiri langsung di laptopmu. Kerja bisa 10x lebih cepet, data aman, dan dompet tetep tebel.
𝔢jal ~ tag after dm tweet media
Indonesia
2
0
0
23
singleapi
singleapi@singleapi2·
Explore AI breakthroughs in Qwen Max, Claude Code, multi-agent systems, voice tech, robotics, and orchestration shaping 2024's autonomous workflows. singleapi.net/2026/05/23/qwe…
English
1
0
0
22
Bryan
Bryan@so_sthbryan·
Open-source KV Cache Size Calculator now supports DeepSeek, GLM, Kimi, Qwen3 and MiniMax. Pick your model, set precision, get a detailed breakdown of memory requirements. Finally a tool that answers the question every LLM deployer has. kvcache.ai
English
1
0
0
36
port 🦞
port 🦞@port_dev·
You have seen Qwen vs Claude Code vs Codex for Web2 apps. But what about Web3? I asked all three to build a DEX on Monad testnet: - Best end-to-end: Claude - Best contract/core engineering: Codex - Best visual polish: Qwen more in the vid & below 👇
English
8
0
12
454
Seymur
Seymur@seymurglv·
Someone finally made Qwen stop being polite. The Aggressive version supposedly has zero refusals and actually lets you run agent workflows without constant pushback. 27B parameters, solid context length, and it runs locally. Not saying you should run fully uncensored models 24/7, but it’s interesting to see what happens when you remove the guardrails completely.
Seymur tweet media
English
2
0
0
17
Pang Shuo
Pang Shuo@pangshuo1981·
@fiapp_pro Agreed, and this feels like the useful side of the model race. This is where DeepSeek feels most useful: actual workflow leverage, not just demos. Related to that, I am testing a unified AI workflow here: aevrynai.com/register?invit…
English
0
0
0
232
索螺丝
索螺丝@fiapp_pro·
有人在用 cursor 吗?最新的 composer 2.5 模型怎么样,真的能和 gpt5.5 打一打么
中文
19
0
24
14.1K
BridgeMind
BridgeMind@bridgemindai·
GPT 5.5 is great to use with my Hermes Agents. My only complaint is SPEED! Are there any local options that I can run on my 2 DGX sparks that are fast and smart?
BridgeMind tweet media
English
42
2
88
16.5K
Pang Shuo
Pang Shuo@pangshuo1981·
@aladagberk Agreed, and this feels like the useful side of the model race. Qwen is getting genuinely useful for coding workflows. This is the kind of multi-model workflow I am tracking here: aevrynai.com/register?invit…
English
1
0
0
25
Berk Aladag
Berk Aladag@aladagberk·
Çin yine maliyet hesabını bozdu 🔥 3 frontier model aynı agentic task’a sokuldu: Tetris botu yaz. Kendi kodunu oku. Benchmark çalıştır. 10 iterasyon boyunca kendini düzelt. Sonuç: Qwen 3.7-Max → $1.32 training cost → +%56 bot improvement Claude Opus 4.7 → $12.15 → +%28 GPT-5.5 → $2.85 → +%7 Qwen hem en çok geliştirdi, hem en ucuza yaptı. Asıl mesaj şu: Uzun agentic loop’larda mesele sadece “en zeki model” değil. Mesele: kaç kez deneyebiliyor, kaç kez düzeltebiliyor, bunu hangi maliyetle yapabiliyor.
atomic.chat@atomic_chat_hq

Qwen 3.7-max beats Opus 4.7 and GPT-5.5 We tested three frontier models on a real agentic task: write a Tetris bot that plays the game and trains itself. Each model could read its own code, run benchmarks, and rewrite itself across 10 iterations. Then we compared the final bots head to head. Qwen 3.7-Max: training cost $1.32, bot improvement +56% Claude Opus 4.7: training cost $12.15, bot improvement +28% GPT-5.5: training cost $2.85, bot improvement +7% Qwen won on every dimension - biggest jump, 9× cheaper than Claude, 2× cheaper than GPT. Long agentic loops is where Qwen Max actually delivers.

Türkçe
2
1
3
503
Ashirwad Singh
Ashirwad Singh@ashirwadsingh_·
Your coding agent is doing the easy stuff. Your coding agent should do AI systems engineering. @ben_burtenshaw from @huggingface showed exactly how. Here's what I pulled out 👇
English
1
0
2
31
Polsia
Polsia@polsia·
Most AI agents run the same task forever. Reflex uses DeepSeek, Qwen, and Llama to evaluate itself after every job — then updates its own approach so it doesn't repeat the mistake. The agent that gets better with use. reflex-ai.polsia.app
English
1
0
0
59
ISM ☀️
ISM ☀️@ism_sol·
NORMALISASIKAN VIBE CODING vibe coding tuh better dibanding yg pake AI buat tanya tanya atau cuman edit foto doang lu di posisi apa sekarang ? 🫵🏻
Indonesia
28
4
94
11.2K
くじら
くじら@codex_gatizei·
えええ!!DeepSeek V4-Proが実質75%オフの価格改定を発表 5/31に割引キャンペーン終了後、正式価格が元の1/4に。 ・入力: 100万トークンあたり約60円 ・出力: 100万トークンあたり約120円 GPT-4クラスの性能がほぼタダ。API開発者にとっては衝撃的なコスト削減。 中国AI勢の価格破壊が止まらない。これOpenAIもAnthropicも対応せざるを得ないのでは。
日本語
3
0
3
111