Hantmango

268 posts

Hantmango

@hantmango

前腾讯AI Lab，现网易游戏，梦想是做出自己的独立游戏！程序员。CV/LLM

广州 شامل ہوئے Mart 2015

118 فالونگ12 فالوورز

Hantmango@hantmango·12m

之前在项目里面做过一个换脸是用qwen edit做的，效果时好时坏。这个工作流看起来不错。

Photogenic Weekend@PhotogenicWeekE

1枚の画像からdataset作るworkflowはこれ。難しいことはしていない。左上のpromptを連続して当ててるだけ。

中文

Hantmango@hantmango·11h

这个roo不知道为啥git记录这么大...

中文

Hantmango@hantmango·15h

@toyxyz3 thank you so much

English

138

toyxyz@toyxyz3·15h

@hantmango github.com/GordonChen19/P…

QME

441

toyxyz@toyxyz3·16h

LTX 2.3 Prompt-Relay test #AI #AIイラスト #comfyui

English

10.7K

Hantmango@hantmango·16h

@Datou GLM5

Datou@Datou·18h

gpt 5.4 和 codex 的区别

中文

1.4K

Hantmango@hantmango·17h

Agent学习从入门到放弃... Pi写的好复杂，流式信息的处理，异步处理... 让AI给我写了个极简版本，只有40行，感觉在工作中也够用了😂

Hantmango@hantmango

从pi的源码开始，用我熟悉的python重写一遍

中文

Hantmango@hantmango·17h

@neural_avb what software is this？

English

AVB@neural_avb·1d

People who said they automated video editing have 0 clue scene cut, precision trim, align multiple footages, retrieve b-roll, spatially aware typography/graphics, zoom/highlights, pacing... didnt even get to audio... How an avg 30 sec in a 45 min dense video looks:

AVB@neural_avb

SFT tutorial comes out tomorrow! It’s a ~45 minute video that will go through instruction post-training end to end Synthetic local training data gen -> unsloth finetuning -> evals -> packaging SLMs into narrow little harnesses bonus: low-level guidance/constrained decoding

English

Hantmango@hantmango·1d

@dx8152 为什么kelin模型的新模型还是层出不穷啊😂

中文

152

大雄@dx8152·1d

Consistency Enhancement V2 is now released, with a strong focus on fixing color shift issues. This time, I also provide a systematic breakdown of how to train the Klein model, including dataset creation strategies and a detailed training tutorial: youtu.be/j6dqOekUQ8c

YouTube

English

143

7.1K

Hantmango@hantmango·1d

@gosrum what is vibe local? quite impressive error rate.

English

506

金のニワトリ@gosrum·3d

Qwen3.6-35B-A3Bが強すぎる！！！・opencode,vibe-local,GitHub Copilot,qwencode,claude codeと組み合わせたときのts-benchを実施したところ、すべて満点・しかもClaude sonnet 4.6やOpus 4.6と同じくらい速くタスクを遂行できている Qwen3.5-27Bもすごかったが、Qwen3.6-35B-A3Bは赤い彗星のごとく27Bよりも推論速度が3倍速いので、ベンチマーク結果からもわかるようにタスク遂行までの時間が大幅に短縮できるようになったのが大きい

金のニワトリ@gosrum

Claude Opus 4.7に隠れてあまり話題になってないけど、Qwen3.6-35B-A3Bかなりすごいモデルなのでは？

日本語

108

653

214.8K

Hantmango@hantmango·1d

@nash_su 5090好像有32g显存，比24g还是好一些

中文

273

nash_su - e/acc@nash_su·2d

Qwen3.6 35B-A3B 的实机测试彻底断了我买 DGX Spark 的念头，看来手里几张 4090 还能再战1年

stevibe@stevibe

Qwen3.6 35B-A3B dropped yesterday, so I ran it on 4 GPUs to see how it performs: 🟣 RTX 3090 — 49.78 tok/s, TTFT 852ms 🟡 RTX 4090 — 118.93 tok/s, TTFT 686ms 🟢 RTX 5090 — 160.37 tok/s, TTFT 409ms 🔵 DGX Spark — 59.98 tok/s, TTFT 228ms I went with ollama as the backend because honestly, it's the easiest way for most people to get started. One command, model pulled, done. I used Q4_K_M (24GB) across all four cards. The reason is the 3090 and 4090 don't support NVFP4 (only the 5090 and DGX Spark could use it). Keeping the same quant everywhere felt like the fairest way to compare. And yes, you can absolutely squeeze more performance out of every card with vLLM, SGLang, or TensorRT-LLM. But that's not what this test is about. This is just the out-of-the-box experience for folks who own a GPU and want to try the new model tonight.

中文

10.4K

Hantmango@hantmango·2d

@Datou 没有生存压力和主观体验，就不会诞生意识。大头老师，这个是哪篇文章说的？

中文

Datou@Datou·2d

从哲学层面来说，没有生存压力和主观体验，就不会诞生意识。从实际结果来说，有意识的 ai 不会有现在这么好的指令遵从成天回答傻逼问题。

ℏεsam@Hesamation

Google DeepMind researcher argues that LLMs can never be conscious, not in 10 years or 100 years. "Expecting an algorithmic description to instantiate the quality it maps is like expecting the mathematical formula of gravity to physically exert weight."

中文

1.1K

Hantmango@hantmango·2d

@TheAhmadOsman What kind of rig do you have?

English

174

Ahmad@TheAhmadOsman·2d

Currently running GLM-5.1 locally Cannot believe this thing is running on my own GPUs, its really smart

English

822

62.8K

Hantmango@hantmango·2d

又看到一个搞钱的思路，用AI帮制造业土老板做出海的营销： 1. wordpress建站，做外贸展示网站； 2. 自动化Agent写软文，自动发邮件 3. SEM竞品分析 v2ex.com/t/1206456

中文

Hantmango@hantmango·2d

可以可以，学到了

goldengrape@goldengrape

这种开源教材站的用法是这样的：遇到什么具体问题，问AI这是哪个领域的，然后去教材站找到相应的课本，送进notebooklm里，然后问答或者学习或者让Gemini出个skill。相当于是科班prompt/skill，有各种经过同行评议的理论依据，不是随便找几个疯疯癫癫的行业翘楚，或者诹一堆玄而又玄的不知所云。

中文

Hantmango@hantmango·2d

@Hesamation what about the accuracy？

English

302

ℏεsam@Hesamation·3d

WAIT WHAT?! 2-bit Qwen3.6-35B-A3B is lightning fast and it only needs 13 GB RAM. “did a complete repo bug hunt with evidence, repro, fixes, tests and a PR writeup. 🔥”

English

1.4K

110K

Hantmango@hantmango·2d

期待下～

外汇交易员@fxtrader

副主任王昌林：着力扩大国内有效需求，将制定2026年—2030年扩大内需战略实施方案，推动符合条件的重大工程项目尽早开工建设。

日本語

Hantmango@hantmango·3d

@wquguru 有没有Python的，ts不熟

中文

312

WquGuru🦀@wquguru·3d

关于Agent SDK的选择，主流其实就两个，其他的可以不看：大厂和成熟startup的生产级内部Agent，毫无疑问Claude Agent SDK目前更主流。原因很简单，Claude模型在Agent任务上表现最好，很多公司直接冲着性能和快速出价值去的。开发者、小团队、需要高度自定义的内部工具，pi-mono非常受欢迎，尤其TS+Electron/CLI的场景。很多人是因为不想被Claude绑定而切换过去。实际情况其实更复杂，有人从Claude SDK转pi-mono，也有人反过来觉得Claude的订阅红利和性能太香，先用着再说。个人的建议是最强大的两个中二选一，把路径跑完跑通，建立Agent体系化概念，做出产品才是最重要的。更多细节可以看图。