Hantmango

267 posts

Hantmango

@hantmango

前腾讯AI Lab，现网易游戏，梦想是做出自己的独立游戏！程序员。CV/LLM

广州 Katılım Mart 2015

118 Takip Edilen12 Takipçiler

Hantmango@hantmango·5h

这个roo不知道为啥git记录这么大...

中文

Hantmango@hantmango·9h

@toyxyz3 thank you so much

English

110

toyxyz@toyxyz3·9h

@hantmango github.com/GordonChen19/P…

QME

353

toyxyz@toyxyz3·9h

LTX 2.3 Prompt-Relay test #AI #AIイラスト #comfyui

English

Hantmango@hantmango·10h

@Datou GLM5

Datou@Datou·12h

gpt 5.4 和 codex 的区别

中文

1.3K

Hantmango@hantmango·10h

Agent学习从入门到放弃... Pi写的好复杂，流式信息的处理，异步处理... 让AI给我写了个极简版本，只有40行，感觉在工作中也够用了😂

Hantmango@hantmango

从pi的源码开始，用我熟悉的python重写一遍

中文

Hantmango@hantmango·11h

@neural_avb what software is this？

English

AVB@neural_avb·1d

People who said they automated video editing have 0 clue scene cut, precision trim, align multiple footages, retrieve b-roll, spatially aware typography/graphics, zoom/highlights, pacing... didnt even get to audio... How an avg 30 sec in a 45 min dense video looks:

AVB@neural_avb

SFT tutorial comes out tomorrow! It’s a ~45 minute video that will go through instruction post-training end to end Synthetic local training data gen -> unsloth finetuning -> evals -> packaging SLMs into narrow little harnesses bonus: low-level guidance/constrained decoding

English

Hantmango@hantmango·1d

@dx8152 为什么kelin模型的新模型还是层出不穷啊😂

中文

140

大雄@dx8152·1d

Consistency Enhancement V2 is now released, with a strong focus on fixing color shift issues. This time, I also provide a systematic breakdown of how to train the Klein model, including dataset creation strategies and a detailed training tutorial: youtu.be/j6dqOekUQ8c

YouTube

English

137

6.8K

Hantmango@hantmango·1d

@gosrum what is vibe local? quite impressive error rate.

English

494

金のニワトリ@gosrum·2d

Qwen3.6-35B-A3Bが強すぎる！！！・opencode,vibe-local,GitHub Copilot,qwencode,claude codeと組み合わせたときのts-benchを実施したところ、すべて満点・しかもClaude sonnet 4.6やOpus 4.6と同じくらい速くタスクを遂行できている Qwen3.5-27Bもすごかったが、Qwen3.6-35B-A3Bは赤い彗星のごとく27Bよりも推論速度が3倍速いので、ベンチマーク結果からもわかるようにタスク遂行までの時間が大幅に短縮できるようになったのが大きい

金のニワトリ@gosrum

Claude Opus 4.7に隠れてあまり話題になってないけど、Qwen3.6-35B-A3Bかなりすごいモデルなのでは？

日本語

108

653

214K

Hantmango@hantmango·1d

@nash_su 5090好像有32g显存，比24g还是好一些

中文

270

nash_su - e/acc@nash_su·1d

Qwen3.6 35B-A3B 的实机测试彻底断了我买 DGX Spark 的念头，看来手里几张 4090 还能再战1年

stevibe@stevibe

Qwen3.6 35B-A3B dropped yesterday, so I ran it on 4 GPUs to see how it performs: 🟣 RTX 3090 — 49.78 tok/s, TTFT 852ms 🟡 RTX 4090 — 118.93 tok/s, TTFT 686ms 🟢 RTX 5090 — 160.37 tok/s, TTFT 409ms 🔵 DGX Spark — 59.98 tok/s, TTFT 228ms I went with ollama as the backend because honestly, it's the easiest way for most people to get started. One command, model pulled, done. I used Q4_K_M (24GB) across all four cards. The reason is the 3090 and 4090 don't support NVFP4 (only the 5090 and DGX Spark could use it). Keeping the same quant everywhere felt like the fairest way to compare. And yes, you can absolutely squeeze more performance out of every card with vLLM, SGLang, or TensorRT-LLM. But that's not what this test is about. This is just the out-of-the-box experience for folks who own a GPU and want to try the new model tonight.

中文

10.3K

Hantmango@hantmango·1d

@Datou 没有生存压力和主观体验，就不会诞生意识。大头老师，这个是哪篇文章说的？

中文

Datou@Datou·1d

从哲学层面来说，没有生存压力和主观体验，就不会诞生意识。从实际结果来说，有意识的 ai 不会有现在这么好的指令遵从成天回答傻逼问题。

ℏεsam@Hesamation

Google DeepMind researcher argues that LLMs can never be conscious, not in 10 years or 100 years. "Expecting an algorithmic description to instantiate the quality it maps is like expecting the mathematical formula of gravity to physically exert weight."

中文

Hantmango@hantmango·2d

@TheAhmadOsman What kind of rig do you have?

English

174

Ahmad@TheAhmadOsman·2d

Currently running GLM-5.1 locally Cannot believe this thing is running on my own GPUs, its really smart

English

821

62.7K

Hantmango@hantmango·2d

又看到一个搞钱的思路，用AI帮制造业土老板做出海的营销： 1. wordpress建站，做外贸展示网站； 2. 自动化Agent写软文，自动发邮件 3. SEM竞品分析 v2ex.com/t/1206456

中文

Hantmango@hantmango·2d

可以可以，学到了

goldengrape@goldengrape

这种开源教材站的用法是这样的：遇到什么具体问题，问AI这是哪个领域的，然后去教材站找到相应的课本，送进notebooklm里，然后问答或者学习或者让Gemini出个skill。相当于是科班prompt/skill，有各种经过同行评议的理论依据，不是随便找几个疯疯癫癫的行业翘楚，或者诹一堆玄而又玄的不知所云。

中文

Hantmango@hantmango·2d

@Hesamation what about the accuracy？

English

301

ℏεsam@Hesamation·2d

WAIT WHAT?! 2-bit Qwen3.6-35B-A3B is lightning fast and it only needs 13 GB RAM. “did a complete repo bug hunt with evidence, repro, fixes, tests and a PR writeup. 🔥”

English

1.4K

109.8K

Hantmango@hantmango·2d

期待下～

外汇交易员@fxtrader

副主任王昌林：着力扩大国内有效需求，将制定2026年—2030年扩大内需战略实施方案，推动符合条件的重大工程项目尽早开工建设。

日本語

Hantmango@hantmango·2d

@wquguru 有没有Python的，ts不熟

中文

308

WquGuru🦀@wquguru·3d

关于Agent SDK的选择，主流其实就两个，其他的可以不看：大厂和成熟startup的生产级内部Agent，毫无疑问Claude Agent SDK目前更主流。原因很简单，Claude模型在Agent任务上表现最好，很多公司直接冲着性能和快速出价值去的。开发者、小团队、需要高度自定义的内部工具，pi-mono非常受欢迎，尤其TS+Electron/CLI的场景。很多人是因为不想被Claude绑定而切换过去。实际情况其实更复杂，有人从Claude SDK转pi-mono，也有人反过来觉得Claude的订阅红利和性能太香，先用着再说。个人的建议是最强大的两个中二选一，把路径跑完跑通，建立Agent体系化概念，做出产品才是最重要的。更多细节可以看图。

宝玉@dotey

如果是 TypeScript 技术栈，做 Agent 开发首选 pi-mono，功能强，调用方便。其次是 vercel 的 aisdk 也还可以。 claude agent sdk 不那么推荐了，主要是绑死了 claude，但目前还有一个不可替代的优势，就可以共享 Claude Max 订阅，开发阶段会比较方便，能用多久不清楚。应用层的话，electron 还是首选，稳定可靠，AI 训练预料足够多，主要问题是应用程序体积略大。但刚开始写 Agent，建议从 cli 开始写，不需要一开始就做界面，这样可以聚焦在 Agent 本身，除非你核心就是 UI。推荐一个开源的项目 craft-agents-oss，TypeScript + pi-mono + Electron + React + claude agent sdk，很好的学习参考。 github.com/lukilabs/craft…

中文

167

34K

Hantmango@hantmango·2d

@ChujieZheng 开源的神！

中文

Chujie Zheng@ChujieZheng·3d

Here comes one. Enjoy

Qwen@Alibaba_Qwen

⚡ Meet Qwen3.6-35B-A3B：Now Open-Source！🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. 🔥 Agentic coding on par with models 10x its active size 📷 Strong multimodal perception and reasoning ability 🧠 Multimodal thinking + non-thinking modes Efficient. Powerful. Versatile. Try it now👇 Blog：qwen.ai/blog?id=qwen3.… Qwen Studio：chat.qwen.ai HuggingFace：huggingface.co/Qwen/Qwen3.6-3… ModelScope：modelscope.cn/models/Qwen/Qw… API（‘Qwen3.6-Flash’ on Model Studio）：Coming soon～ Stay tuned

English

171

7.8K

Hantmango@hantmango·2d

@frostming90 好湿好湿

日本語

𝔽𝕣𝕠𝕤𝕥 𝕄𝕚𝕟𝕘@frostming90·3d

少年不识token味，爱引架构，爱引架构为赋新码强重构。而今识尽AI幻，欲码还休，欲码还休，却道harness好个牛。

yetone@yetone

Prompt 满屏流，生成替手修。数年间不写一行 code。树下开 IDE 犹未稳，能几次，又回眸。旧 repo 枝头，故人还在否？新架构多是旧烦忧。欲开 IDE 审架构，终不似，少年游。

中文

2.9K

Hantmango@hantmango·3d

一键炼化！最近在研究AI做短剧，想从B站教程里蒸馏一些分镜、叙事技巧。发现很多视频有AI字幕，但提取很麻烦，遇到合集更要命。写了个小工具，Playwright连浏览器自动抓字幕，合集也能批量下。 ```bash chrome --remote-debugging-port=9222 uv run python batch_extract.py "合集链接" ``` 拿"老白的分镜课"试了试，22节课的字幕几分钟全下完，30万字素材，够研究一阵了。 gist.github.com/kexul/c7374b82…

中文

108

Keşfet

@toyxyz3 @Datou @neural_avb @dx8152 @gosrum @nash_su @TheAhmadOsman @elonmusk