琥珀青葉@KohakuLab

3.1K posts

琥珀青葉@KohakuLab

@KBlueleaf

Graduate student in Taiwan. Leader of KohakuLab. Researcher/Dev in ComfyOrg

Taiwan Sumali Mayıs 2021

954 Sinusundan3.6K Mga Tagasunod

Naka-pin na Tweet

琥珀青葉@KohakuLab@KBlueleaf·7 Şub

TIPO finally got accepted by ICLR! over 2M total download (PoC + Main models) and 600 star on Github "TIPO: Text to Image with Text Presampling for Prompt Optimization" Also thanks @AngelBottomless @liyi20129 @hhjjjFinFin @giyeongoh for their hard works on polishing the paper!

English

7.3K

琥珀青葉@KohakuLab@KBlueleaf·4h

MAR in image gen: um? Well I think ppl need to realize, a well known method in realm A may be novel and "breaking" in realm B

Simplifying AI@simplifyinAI

🚨 BREAKING: Tencent has killed the “next-token” paradigm. Tencent and Tsinghua has released CALM (Continuous Autoregressive Language Models), and it completely disrupts the next-token paradigm. LLMs currently waste massive amounts of compute predicting discrete, single tokens through a huge vocabulary softmax layer. It’s slow and scales poorly. CALM bypasses the vocabulary entirely. It uses a high-fidelity autoencoder to compress chunks of text into a single continuous vector with 99.9% reconstruction accuracy. The model now predicts the “next vector” in a continuous space. The numbers are actually insane: - Each generative step now carries 4× the semantic bandwidth. - Training compute is reduced by 44%. - The softmax bottleneck is completely removed. We’re literally watching language models evolve from typing discrete symbols to streaming continuous thoughts. This changes the entire trajectory of AI.

English

456

琥珀青葉@KohakuLab nag-retweet

懶股頭：談股票@lazystockbone·1d

轉傳台灣的政黨中成員最多黑道幫派的是國民黨掠奪最多台灣物資的是國民黨殺害最多台灣人民的是國民黨斷交最多邦交國家的是國民黨選舉最多作票舞弊的是國民黨累計最多貪污被告的是國民黨涉及最多賄選買票的是國民黨阻擋最多國防預算的是國民黨但他們告訴你台灣不能沒有國民黨

中文

369

1.2K

12K

琥珀青葉@KohakuLab nag-retweet

FaceOff@faceoff21095161·2d

幫擴（Threads, Littlebu_chen) ❗️新竹市府要求兒童肖像永久授權❗️ ❗️不限全球地域❗️ 文件不讓家長勾選不同意，高虹安請問你意圖為何？不限全球地域？❗️想授權給誰？這份授權書充滿陷阱, 且只有在群組告知家長，兒童本人完全不知道要拍影片！（附上台北市國小肖像授權同意書做比較差超多！）

中文

618

1.6K

67.2K

琥珀青葉@KohakuLab nag-retweet

李老师不是你老师@whyyoutouzhele·3d

中科天玑外泄文件曝涉台数据体系：建构赖清德、蔡英文等“政要图谱”疑似掌握台湾2300万户籍资料近日，美国范德比尔特大学国家安全研究所公布了一份中国企业“中科天玑的内部文件。文件显示，中科天玑构建了所谓“涉台政要图谱”与“涉台知识图谱”，对赖清德、蔡英文、苏贞昌、陈菊、陈其迈、柯文哲、侯友宜等台湾政要进行系统化建档，内容涵盖党派背景、职务经历、出生地、教育程度、宗教信仰、社会关系及对大陆态度等多维信息，并结合约2300万条台湾户籍数据，实现自动化检索与目标筛选。文件还指出，中科天玑开发了舆情监测系统，对2024年台湾地区领导人选举进行持续跟踪，并形成周期性分析报告。同时，该公司被指利用人工智能技术生成虚拟账号，模拟“在地用户”参与网络互动，以影响舆论环境并加剧社会分化。

中文

190

1.4K

1.1M

琥珀青葉@KohakuLab@KBlueleaf·3d

BTW the NNUE is trained on 6depth PVS+KP search only, I'm preparing to train next version of NNUE to train on current NNUE's search result

English

145

琥珀青葉@KohakuLab@KBlueleaf·3d

Recently back to investigate NNUE stuff, and finally successfully make a NNUE that beat myself with one sec of search LOL Altho it is for minichess right now, I may expand it to be minishogi one, finally may start investigate full shogi stuff

English

506

琥珀青葉@KohakuLab@KBlueleaf·3d

Big thanks to @tayayan_ts whose shogi AI "水匠" series bring me into the shogi AI realm. I decide to do NNUE stuff when I was mid/high school student while playing with 水匠!

English

125

琥珀青葉@KohakuLab nag-retweet

allen chen@allen1392·6d

北中部4縣市連鎖火燒山一般人看到的只是山火但實際上是【各種無能、怠惰、惡意、傲慢】長期積累的結果講坦白的，沒去清理雜物，鋤草，規劃防火線標案照樣驗收，縣市府編的預算照樣進褲袋燒也不是燒他家，死也不是死他家，吃飽太閒才會理你現在累積起來一次燒，一次痛快，更有效率不是嗎請記住這個陳年笑話 ************************************************ 一名留學生去蘇聯，約好寫信報平安如果日子不好過就用紅筆寫幾個月後收到信了，信上寫著「這裡一切都好，唯一不好的是買不到紅筆」 ************************************************* 你投票時的選擇，會決定你最後能不能買到紅筆來寫信

中文

356

962

15.3K

琥珀青葉@KohakuLab nag-retweet

歪。講堂 Y.Talkroom@y_talkroom·15 Mar

倫敦Hammersmith，一家名叫Rangrez的印度餐廳一直堅持不賣清真肉類，並在店外掛出「我們很自豪地宣佈，我們不出售清真食品」的標語這家餐廳由一位錫克教老闆Harman Singh Kapoor經營了16年，他餐廳強調符合錫克教飲食原則，卻惹怒了一群對其他文化高度包容和體諒的穆斯林群體這群穆斯林聚集在餐廳外抗議，要求餐廳改賣清真肉，餐廳老闆指控對方長期以來對他進行騷擾，其中包括死亡威脅和性侵威脅、砸蛋、破壞店面等實體攻擊、大量假評價轟炸、持續的線上霸凌和電話騷擾等等餐廳原本在今年2月宣佈因為成本上升、持續騷擾和警方支援不足而將在3月關門，但後來因為獲得大量英國本地人（特別是非穆斯林群體）的支持，所以決定繼續營業一段時間在倫敦穆斯林市長Sadiq Khan的統治之下，倫敦就是這麼多元化，而這種多元化必須符合穆斯林群體的要求，不然就集體搞死你為止下一個也許就是紐約了（或已經是）

中文

621

42.5K

琥珀青葉@KohakuLab nag-retweet

謝立聖插畫@pitt9244·15 Mar

苗栗也大火南投也大火台中也大火媒體卻噤聲

中文

481

1.8K

45.5K

琥珀青葉@KohakuLab@KBlueleaf·15 Mar

@AhanGupta13 BTW all triton kernel I made are all written by claudecode I write pytorch ref + "chunked pytorch version" And then ask Claude to transfer chunked pytorch version directly become Triton kernel

English

102

Ahan Gupta@AhanGupta13·15 Mar

I think people severely understimate the reward to effort ration triton has brought on the community. You can write a complex kernel in ~1 day and get insane speedups.

琥珀青葉@KohakuLab@KBlueleaf

Implemented VQ and related algo in triton, and get 4~8x speed up with nearly 0 vram usage (compare to naive torch impl)

English

253

琥珀青葉@KohakuLab@KBlueleaf·14 Mar

@_Jaivardhan_ definitely, this project include some new VQ method and I will open source all of them at once

English

186

Jaivardhan Kapoor@_Jaivardhan_·14 Mar

@KBlueleaf Any plans to opensource? 🤩

English

207

琥珀青葉@KohakuLab nag-retweet

琥珀青葉@KohakuLab@KBlueleaf·14 Mar

Implemented VQ and related algo in triton, and get 4~8x speed up with nearly 0 vram usage (compare to naive torch impl)

English

6.3K

琥珀青葉@KohakuLab nag-retweet

台灣越來越好@hdiojojo·13 Mar

有信用的人都有機會先取得服務再付費，但是民眾黨跟小草們都是屬於沒信用的人。

中文

146

863

35.5K

琥珀青葉@KohakuLab@KBlueleaf·9 Mar

@t4448666 @xinwendiaocha 對啊，日本國民不想要穆斯林有特權所以選了對穆斯林政策方向更為強硬的政治人物，這不就是民主嗎(

中文

Yu Tou（鱼头）@t4448666·8 Mar

@xinwendiaocha 不是号称民主吗？

中文

2.5K

新闻调查@xinwendiaocha·8 Mar

突发新闻：日本新政府已拒绝在其境内修建穆斯林墓地。任何穆斯林遗体都不得埋葬在日本的土地上。日本境内也不会新建清真寺。日本似乎已经及时意识到了这一危险。 x.com/dailyglimps24/…

中文

156

361

4.7K

692K

琥珀青葉@KohakuLab@KBlueleaf·9 Mar

@zhimin_zhang @karminski3 chatgpt/claude其實差不多也是這個速度如果拿來跑一些離線agent task其實很夠用了

中文

192

Zhimin Zhang@zhimin_zhang·9 Mar

@karminski3 27b在4090上有40+token/秒，作为工具会不会慢了点？

中文

1.4K

karminski-牙医@karminski3·9 Mar

给大家带来归一化的Qwen3.5系列模型分数汇总, 惊喜的发现是 27B dense 这个模型的确不以言, 基本达到了期间模型 Qwen3.5-397B-A17B 94% 的性能. 尤其是视觉Agent能力 (比如操作浏览器/手机等图形界面) 是这些里面最强的. 以及长上下文能力, 指令遵循也很不错. 通用 Agent 能力 (比如工具调用，就是 OpenClaw 的绝大多数应用场景) 看上去 35B-A3B 和 27B 是一样的, 也有好多朋友问我, 我个人的主观感受是 27B 更强一丢丢, 但是带来的提升完全不值得上个 27B 的 dense 模型 (仅通用Agent能力来说哈), 还是 35B-A3B 这种激活量小的 MoE 更具性价比. 另外9B无论什么时候都是优于4B的, 即使9B量化到5bit, 跟4B的8bit差不多大小了, 我也还是推荐使用9B. 不过122B-A10B 就没太大的性价比了, 从分数上来看跟 27B 没有太大差距. 只不过知识量会大一些. 总结的话, OpenClaw 无脑上 35B-A3B, 传统任务比如文件处理, 写代码, 文本总结等能用27B尽量用27B, 9B 是最后的选择, 宁可 9B-5bit 量化也别 4B. #qwen35 #openclaw

中文

266

27.1K

琥珀青葉@KohakuLab nag-retweet

林韋恩(3)@waynelin999·7 Mar

徐芳麗用Ai照片網上散播謠言事實上是台灣人在球場幫忙收垃圾的請幫廣傳。

中文

125

1.1K

4.4K

215.5K

琥珀青葉@KohakuLab@KBlueleaf·5 Mar

Achieved 150 citations!

English

琥珀青葉@KohakuLab@KBlueleaf·4 Mar

@_blizhe @VukRosic99 left to right is how we read but the causality is "not only" left to right you can take it as "you can use the final part of sentence/paragraph to guess the whole content"

English

Lyosha the Zebra@_blizhe·4 Mar

@VukRosic99 Isn't language only read left to right? Because in a lot of ways it's written right to left. Conclusion to premise, etc.

English

192

Vuk Rosić 武克@VukRosic99·4 Mar

Diffusion LLMs struggle because language is left-to-right, not random order, so AR LLMs model it better - Language is left-to-right, so it's easier to model it with an autoregressive model. - Diffusion LLMs optimize for all random orders of token generation, not just left-to-right, so it doesn't match data distribution as well If we train an AR LLM to predict in random (fixed) order, so order or tokens is choosen randomly, and stays same for the entire training, it underperforms left to right and right to left order. It shows that there is a structure in data that comes from L2R or R2L ordering. Written by @ducx_du (blog in replies)

English

4.6K

Tuklasin

@tayayan_ts @AhanGupta13 @_Jaivardhan_ @t4448666 @xinwendiaocha @zhimin_zhang @karminski3 @elonmusk