image72

815 posts

image72

@image72

Inscrit le Kasım 2009

12 Abonnements10 Abonnés

image72@image72·7h

@0xkakarot888 M4 24G RAM上MLX跑那个26B-E4B 可以到15tks

日本語

0x卡卡撸特@0xkakarot888·10h

给各位老师汇报一下，m5 48g mbp 跑 31b 还是有点吃力，冷启动要等3分钟，然后回答平均在 1 分钟左右，我现在换回 26b 了，响应速度快很多，风扇也不像 31b 那样狂转，使用下来效果还可以。和在线模型还是不能比，但本地搞搞简单任务还是够够的。后悔笔记本内存买小了😅 openclaw不知道是什么问题，连接ollama的gemma模型连不上，有没有搞通的，来给我指导下呀🫡

0x卡卡撸特@0xkakarot888

x.com/i/article/2040…

中文

8.5K

image72@image72·21h

@BrianLi65636283 @bbcchinese 由你替他接受枪决

中文

596

Brian Liu@BrianLi65636283·23h

@bbcchinese 确实没有必要，虽然说按照中国的法律是应该被处决的，考虑到国际关系，还是应该放一马为好。

中文

10.2K

BBC News 中文@bbcchinese·23h

法国表示“震惊”获悉法籍毒贩陈森被囚禁20年后在广州被处决，中国驻法使馆称“打击毒品犯罪是各国共同责任”。 bbc.in/4cco0Kv

中文

170

255

177.7K

image72@image72·1d

@jjeang_ch @pmamtraveller thief

English

485

쨍그랑@jjeang_ch·1d

@pmamtraveller 오른쪽 아래에 한글 안보이나.. 동아시아에서 니들 못지않게 오래된 나라야. 서로 오랜시간 영향을 받았으니 비슷한 문화를 가지기도 하고 그러는거겠지. 그럼 케이팝이나 드라마에 영향 받은 너희들한테 도둑이라 하리? 아..저작권 생각없이 카피해가는건 도둑이지. 저 작품은 한국작가분의 그림임.

한국어

5.3K

image72@image72·1d

@lmstudio hi dude, mlx-vlm just released v0.4.4, LM Studio MLX v1.50 only support mlx-lm 0.4.3; x.com/Prince_Canuma/… Gemma 4 26B-A4B is now ~2x faster at 375K context with TurboQuant on MLX-VLM v0.4.4

Prince Canuma@Prince_Canuma

Gemma 4 26B-A4B is now ~2x faster at 375K context with TurboQuant on MLX-VLM v0.4.4 🚀 The model's official max context is 262K but I pushed it to 375K anyway. That's roughly 5–6 full novels (the entire LOTR trilogy + The Hobbit). Up to ~20K tokens they're neck and neck, but after that TBQ dominates with ~1GB memory savings. KV savings are modest (4–17%) because only 5/30 layers get compressed. But those 5 layers dominate decode time at long contexts, so the speed gains are massive. Device: M3 Max 96GB

English

161

LM Studio@lmstudio·3d

Learn more about this model release x.com/Google/status/…

Google@Google

We just released Gemma 4 — our most intelligent open models to date. Built from the same world-class research as Gemini 3, Gemma 4 brings breakthrough intelligence directly to your own hardware for advanced reasoning and agentic workflows. Released under a commercially permissive Apache 2.0 license so anyone can build powerful AI tools. 🧵↓

English

8.1K

LM Studio@lmstudio·3d

Say hello to Gemma 4 from @GoogleDeepMind 🚀🔥 💎 Comes in 4 sizes: E2B, E4B, 26B A4B, 31B 💎 Supports vision and reasoning 💎 Apache 2.0 💎 Available now in LM Studio lmstudio.ai/models/gemma-4

English

981

48.5K

image72@image72·1d

@HanPaoao x.com/kenw_2/status/…

Ken W@kenw_2

那么反过来，拜了不灵的话，是不是意味着祖上出轨了？🫪

QME

109

韩跑跑@HanPaoao·2d

为什么清明节要祭祖？因为你能出现在这个世界，往上推20代，背后需要上百万个祖先。在过去差不多500年的时间里面，你家的这100多万个祖先，但凡一个夭折，没结婚，没生孩子，就没有今天的你。所以你很优秀，你就是天选之子。

中文

247

47.7K

image72@image72·1d

@JoelDeTeves many thanks bro, current `qwen3.5-35b-a3b-claude-4.6-opus-reasoning-distilled-i1` pretty good, I will try it later.

English

Joel - coffee/acc@JoelDeTeves·1d

@image72 I'm using llama.cpp but you will want to look into MLX. RAM will be tight @ Q4 with unified mem consider offloading some experts to CPU and / or quantizing KV cache.

English

692

Joel - coffee/acc@JoelDeTeves·1d

My 2 cents on Qwen3.5 vs Gemma4: Use Qwen3.5-27B if you have < 24 GB VRAM and need to fit a dense model, I like bartowski IQ4_NL - still my top choice for accuracy at this level, makes for a great agent too Use 9B if you have 16 GB or less For MOE / speed, Gemma4-26B-A4B is incredible. IMO it makes better decisions than Qwen3.5-35B-A3B and doesn’t loop. And it’s smaller! Great model for 24 GB cards, feels like the best balance of speed + performance for Hermes Agent Of course, some people may disagree - always DYOR that’s where the fun is 😎

English

292

26.5K

image72@image72·1d

@Crypto_He @bcgame 断"章"取义取自断章取义

中文

2.4K

先知@Crypto_He·2d

媒体的断章取义水平可以有多高？六小龄童的「只有我是孙悟空代言人」媒体只截取了上半句让人误解了几十年

中文

191

1.5K

449.5K

image72@image72·1d

@DearHua2025 @miantiao 家里的联通宽带, 没有IP, ping的这个是aliyun服务器, 想要IP可以另外想办法从cloudflare split tunnel构造局域网, 也非常快

中文

DearHua@DearHua2025·1d

@image72 @miantiao 羡慕。靠的是 P2P 直连 (Direct connection)。你的两端网络中，至少有一端拥有公网 IP，或者是 Full Cone NAT（全锥型）。自己的话就得在国内找个服务器来做中转。

中文

面条@miantiao·3d

Stash 的这个 Tailscale 节点接入真方便，在外访问家庭设备完全无感了，配置也很简单希望其他软件也可以跟进

中文

307

40.7K

image72@image72·1d

@DearHua2025 @miantiao 日常35ms也还不错, 15ms延迟比较少要自己建DERP最好 blog.kiprey.io/2023/11/tailsc…

中文

DearHua@DearHua2025·2d

@image72 @miantiao 能在10延时左右，那就非常非常棒了。我建立连接后，它分配的中转站是洛杉矶。你是怎么才能做到10毫秒左右呢？

中文

image72@image72·2d

@mainichikane 中国量词用法和过度

中文

296

かね｜毎日中国語@mainichikane·2d

【中国語の量詞がカオスすぎる件】一本书 → 本は「本」で数える一条狗 → 犬は「条」で数える一匹马 → 馬は「匹」で数える一只猫 → 猫は「只」で数える一头牛 → 牛は「头」で数える日本語の漢字とまじで混乱する…

日本語

172

7.9K

image72@image72·2d

@waylybaye IOS18.5 iPhone16

Italiano

Baye@waylybaye·2d

@image72 是 iOS 版本还是 macOS 版本？

中文

Baye@waylybaye·26 Tem

OpenCat 支持本地大模型啦，可以下载任意模型并使用苹果芯片加速（不需要 Apple Intelligence），生成速度很快。

中文

8.8K

image72@image72·2d

@langkeee 你好，我也是同样配置的机器，26b运行后，根本不输出任何内容是怎么回事？lmstudio

中文

边际浪客@langkeee·2d

26b a4b 24g的macmini跑的很舒服吐字速度舒适内容够大推力时间也没那么长

Geek@geekbb

有丐版 M4 Mini 的兄弟可以直接上手了，我录了一段视频，供大家参考。注意，这里并不是在本地直接调用 gemma-4-e4b-it，而是通过我部署在洛杉矶小鸡上的 Open WebUI，再借助 FRP 穿透连接到我内网 M4 Mini 中 LM Studio 上跑的 gemma-4-e4b-it。完整链路是：VPS → FRP → LM Studio API → gemma-4-e4b-it，延迟表现相当不错。

中文

image72@image72·2d

@miantiao 看了一下，可惜只支持osx桌面端，如果移动端支持就再好不过了，在移动上不能同时打开stash tailscale,桌面上能够同时打开2个，这个功能也不是特别需要

中文

163

image72@image72·2d

@DearHua2025 @miantiao tailscale建立连接后，后续延迟都在10毫秒左右，能够自建DERP中转，可以延迟维持在5左右，联通

中文

DearHua@DearHua2025·3d

@miantiao 弃用 Tailscale，改用国产 “帮我吧”。利用其内置的华为云/腾讯云国内 BGP 中转，延迟稳在 80ms 左右，丝滑办公。

中文

image72@image72·2d

@waylybaye 2.90.0（1943）App Store最新版本

日本語

Baye@waylybaye·2d

@image72 感谢反馈，App 是什么版本啊？我用最新版本试了下这个模型，对话没有问题。

中文

image72@image72·3d

@yangyaoshishabi @Nicole_yang88 除了改host, 第一次翻墙可能就是用的这个

中文

465

舒克舒克@yangyaoshishabi·4d

@Nicole_yang88 第一次是这个😂

中文

45.5K

泥伏雷闯关记@Nicole_yang88·4d

我还记得七八年前我第一次用VPN那个App名字叫蓝灯。不知道为啥后来就不好用了~

中文

200

448

450K

image72@image72·4d

@MeepsSammy2024 @Stellarixorine @sciencegirl normal shoes

English

KimInWis@MeepsSammy2024·4d

@Stellarixorine @sciencegirl Are they wearing a special shoe?

English

172

Science girl@sciencegirl·5d

Playing Jianzi, a traditional Chinese game

English

264

2.6K

165.7K

image72@image72·4d

@_hey_aman @sflorimm you know

English

312

Aman@_hey_aman·4d

@sflorimm

QME

136.1K

Floro S.@sflorimm·5d

USA has ChatGPT USA has Grok USA has Claude USA has Gemini USA has Llama USA has Copilot China has DeepSeek China has Qwen China has Ernie China has GLM China has Kimi China has MiniMax Europe has?

Español

8.7K

709

9.1K

2.1M

Découvrir

@0xkakarot888 @BrianLi65636283 @bbcchinese @jjeang_ch @pmamtraveller @lmstudio @GoogleDeepMind @HanPaoao