Mia

326 posts

Mia banner
Mia

Mia

@MiaAI_lab

Local AI, LLMs, tech thinker & builder

Sumali Temmuz 2022
191 Sinusundan194 Mga Tagasunod
Naka-pin na Tweet
Mia
Mia@MiaAI_lab·
Run DeepSeek v4 Flash locally on your 2x DGX Sparks easily, with 1M context github.com/MiaAI-Lab/Deep…
English
0
0
15
1.2K
Mia
Mia@MiaAI_lab·
A PR to vLLM to allow TP=3 for MiniMax M3 👀 His NVFP4 quant is 260GB - lukealonso/MiniMax-M3-NVFP4 Hopefully this will work for anyone with 3x DGX Sparks, 87GB per Spark. github.com/vllm-project/v…
English
0
0
1
52
Mia
Mia@MiaAI_lab·
@TTrimoreau The ones who know how to use it the best.
English
0
0
1
12
Thomas Trimoreau
Thomas Trimoreau@TTrimoreau·
At this point if AI writes 99% of code, who even survives in tech??
English
61
0
48
4K
Mia
Mia@MiaAI_lab·
@buildwithhassan And it's still underrated, especially the flash.
English
0
0
0
13
Hassan
Hassan@buildwithhassan·
opencode published their real model usage data. what developers actually run when they're paying for it: 1. deepseek v4 flash: 32T tokens 2. deepseek v4 pro: 19T tokens 3. kimi k2.6: 6.5T tokens deepseek is running more tokens than the next 16 models combined. it's actual usage from developers spending their own money. glm-5.1 grew 419% too. the models winning on price and reliability aren't always the ones winning on twitter.
Hassan tweet media
English
28
25
492
32.5K
Mia
Mia@MiaAI_lab·
@Kimi_Moonshot Can we please have a Kimi k2.7 Flash variant?
English
0
0
0
37
Kimi.ai
Kimi.ai@Kimi_Moonshot·
🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai
Kimi.ai tweet mediaKimi.ai tweet media
English
614
1.6K
13.6K
1.9M
Mia
Mia@MiaAI_lab·
@HarshithLucky3 MiniMax M3 Kimi k2.7 GLM 5.2 But honestly, DeepSeek v4 Flash/Pro are underrated, especially the flash.
English
0
0
0
81
Mia
Mia@MiaAI_lab·
@HuggingModels MiniMax M3 and its big new brothers are pushing me towards adding another two DGX Sparks.
English
0
0
0
43
Hugging Models
Hugging Models@HuggingModels·
Imagine a model that can see images, read text, and even understand video. Meet MiniMax-M3, a multimodal MoE powerhouse that's taking AI to the next level. It's not just another LLM, it's a vision, text, and video maestro. #AI #Multimodal
Hugging Models tweet media
English
4
4
74
3.2K
Catalin
Catalin@catalinmpit·
For those running local models, what’s your machine configuration? I’m thinking of selling my MacBook Pro M4 Max 48GB RAM and building a PC. Then get a MacBook Air for interacting with the LLM from the PC.
English
32
0
28
6.3K
Tech2Wild
Tech2Wild@Tech2Wild·
Qwen has disappeared, Minimax has went from 200B -> 400B. So who will save the day for the Single and Dual Sparkers. Deepseek V4.1 ?
English
11
0
38
4.8K
Mia
Mia@MiaAI_lab·
@0xSero I need to adopt this asap
English
1
0
2
208
0xSero
0xSero@0xSero·
I haven’t seen any political posts in 6 months
0xSero tweet media
English
22
4
254
5.5K
Mia
Mia@MiaAI_lab·
@0xhikigaya @TheVixhal The minimum vram/unified ram for Qwen 3.6 27b would be 24gb, 32gb preferably.
English
0
0
0
40
vixhaℓ
vixhaℓ@TheVixhal·
One day, Mythos / GPT-5.5 Pro-level models will run locally on my laptop.
English
20
3
227
9.1K
Mia
Mia@MiaAI_lab·
@TheVixhal Look at how good Qwen 3.6 27b is. It's a Sonnet 4.5 level, at least. Sonnet 4.5 was release on September 29, 2025. Do the math.
English
1
0
2
58
vixhaℓ
vixhaℓ@TheVixhal·
@MiaAI_lab Hoping you're right... What makes you say it's closer than it seems?
English
1
0
1
284
Mia
Mia@MiaAI_lab·
@mr_r0b0t @morganlinton @NVIDIAAI Same. I was considering buying 2 RTX 6000s but it's just not justifiable. I'm honestly shocked how satisfied I am with the DGX Sparks.
English
0
0
1
13
mr-r0b0t
mr-r0b0t@mr_r0b0t·
@morganlinton @NVIDIAAI tbh my RTX6000 dreams were shattered with the recent price increase, combined with this sale 😂 crazy to think I'll have 3x GB10 (all 4TB) for less than a new 6000 after the most recent increase 😶‍🌫️😶‍🌫️😶‍🌫️
English
4
0
5
61
mr-r0b0t
mr-r0b0t@mr_r0b0t·
So I did a thing 😁
mr-r0b0t tweet media
English
36
0
118
6.5K
Mia
Mia@MiaAI_lab·
Bro living the dream. 1TB of unified ram running Kimi k2.6.
Christian Merrill@M_Chimiste

@MiaAI_lab @QuixiAI @deepseek_ai @StepFun_ai This is how it’s currently configured with the weights being stored on dedicated M.2 drives on the side. I probably should change the configuration since I believe it’s slower with them stacked like this but it’s more convenient space wise.

English
0
0
0
155
Mia
Mia@MiaAI_lab·
DeepSeek-v4-Flash beats Step-3.7-Flash in head-to-head tool calling benchmark. Full results in: github.com/MiaAI-Lab/Deep…
Mia tweet media
English
10
0
36
2.8K
Mia
Mia@MiaAI_lab·
Exactly my point. And they are going to IPO soon.
Mia tweet media
Mia@MiaAI_lab

The publicity @AnthropicAI got from Fable 5 drama is going to create even more demand for it. There is no such thing of bad publicity if your product is good.

English
0
0
0
44