Zhiran

179 posts

Zhiran

@imbue_byte

China Katılım Ağustos 2025

46 Takip Edilen93 Takipçiler

Sabitlenmiş Tweet

Zhiran@imbue_byte·10 May

Hi There！这里是栀染。说起来，来到这里多少有些始料未及的意味。前阵子开源的因子挖掘项目 AlphaGPT，不知怎么就在社区里收到了好多意料之外的关注。有点受宠若惊，于是顺着这股热闹，迷迷糊糊地就跑来注册了账号。（为了我的 Startup 着想，我看大家都在运营自己的人设嘛🥺）阅读推文串⬇️

中文

623

Zhiran retweetledi

Phoenix Yin@Phoenixyin13·1d

这是我最重要的信息转发之一。这篇论文的第一作者是我极为钦佩的人，也是我的好朋友，来自@Tsinghua_Uni 姚班顶尖选手Guowei Xu，现在他在@Harvard 进行人工智能大模型的科研工作。 Guowei这篇论文精准击中了目前LLM搜索的两个致命瓶颈： ① 只有最后一步对错的sparse verification ② 所有候选答案都靠自回归生成，永远困在模型自己概率分布的entropy shell里由此，Guowei和他的团队提出BES这个全新的搜索框架，引入Forward Evolution，让大模型像生物演化一样思考，打破大模型原有的概率限制，逼它组合出平时根本写不出来的神仙脑洞。同时进行Backward Decomposition，把大任务拆成一堆一眼就能看出对错的子目标。这样大模型在往前走的时候，每走一步都有及时的Dense Feedback，走偏了立刻能纠正。 BES 在理论上成功证明了演化算子能帮大模型跳出思维定势，而倒推法可以指数级减少模型试错所需的样本量。当目前主流的Post-training提升算法都失效时，BES 依然能带得动并且让模型能力持续输出稳定提升，这无疑是打破了主流算法的天花板，值得许多人关注学习。我认为，Guowei这篇论文给Agent指明了新路。对于现在大火的 AI Agent 任务流、多智能体协同来说，这种一边基因重组思路，一边倒推拆解目标的方式，提供了一套更高效、更不容易跑偏的底层搜索算法。值得一提的是，@Kevin_GuoweiXu 同学不仅在清华姚班极其优秀，他曾经也是2022 年第 52 届国际物理奥林匹克竞赛（IPhO）的世界第一，金牌。他未来会在美国直博，大家可以多多关注follow！

Guowei Xu@Kevin_GuoweiXu

🚀 How should LLMs sample on hard reasoning problems during post-training and inference where direct rollouts rarely produce a correct answer? Best-of-N (e.g., GRPO) and tree search share two limitations: 🔻 Verification signals are sparse 🔻 Candidates stay within the model's own distribution We introduce BES: Bidirectional Evolutionary Search — a search framework that couples forward candidate evolution with backward goal decomposition. ✅ Works for both post-training and inference.

中文

193

964

126.1K

Zhiran retweetledi

T0nyS@imT0nyS·18 May

#超かぐや姫「 To Yachiyo ：」

Filipino

160

43.3K

650.7K

Zhiran@imbue_byte·17 May

@akenathonXVI @SnozakiSakura 已关

中文

roberto david@akenathonXVI·17 May

@imbue_byte @SnozakiSakura 已关来互关

中文

草莓泡芙🍀@SnozakiSakura·15 May

有蓝标的各位宝宝，可以互相关注一下，如果忘记回关也在下面说一声哦

中文

6.3K

Zhiran@imbue_byte·17 May

@SnozakiSakura 回了

日本語

草莓泡芙🍀@SnozakiSakura·17 May

@imbue_byte 已关注

中文

Zhiran@imbue_byte·17 May

@pantaloonz @SnozakiSakura 已回

日本語

jackjason（撸毛熊）@pantaloonz·17 May

@imbue_byte @SnozakiSakura 已关来互关

中文

Zhiran@imbue_byte·17 May

@xrwi238646 互关

日本語

装忧郁蹲在地上被狗认为是在出餐@xrwi238646·16 May

把推特当成社交软件就会收获很多萌萌互关。

中文

297

80.9K

Zhiran@imbue_byte·17 May

@tangeorange 互关👀

日本語

橙子@tangeorange·17 May

我刷推喜欢关注哪些人？ 1. 高质量蓝 v 2. 互 fo 蓝 v 3. 猫

中文

141

116

8.3K

Zhiran@imbue_byte·17 May

ZXX

247

Zhiran retweetledi

Harsh Bhatt@harshbhatt7585·16 May

A new way to pre-train language models that gives quite faster training. Normal causal LLMs predict the next token t+1 But this method predicts a bag of future tokens: (t+1, t+2, t+3, …) in a single step using token superposition. Instead of learning exact next-token prediction early on, the model first learns broad exposure to future tokens and data distribution by averaging token probabilities through approximation. The intuition is that early pretraining may not need exact token prediction, the model mainly needs exposure to language structure and data. Since this is only a weak approximation, it doesn’t work for the entire training process.  So later, training switches back to standard one-token prediction. This two-phase training surprisingly converges to similar loss with much fewer GPU hours.

English

151

11.8K

Zhiran retweetledi

Mathematica@mathemetica·16 May

The Hessian matrix H(f) of a function f: Rⁿ → R is the n×n matrix of all second partial derivatives. For f(x,y): [ fxx fxy ] [ fyx fyy ] Definition: [ ∂²f/∂x₁² ∂²f/∂x₁∂x₂ ... ∂²f/∂x₁∂xₙ ] [ ∂²f/∂x₂∂x₁ ∂²f/∂x₂² ... ∂²f/∂x₂∂xₙ ] H(f) = [ : : : ] [ ∂²f/∂xₙ∂x₁ ∂²f/∂xₙ∂x₂ ... ∂²f/∂xₙ² ] In compact form: [H(f)]ᵢⱼ = ∂²f / ∂xᵢ∂xⱼ Named after German mathematician Otto Hesse (1811–1874). Used to study curvature, convexity, and classify critical points in multivariable calculus & optimization.

English

150

931

21.3K

Zhiran@imbue_byte·16 May

@adkins_reb92893 yuka也是好上了，评论区里有万达广场了

中文

Zhiran@imbue_byte·16 May

⏰真应该发个法律解释，说过拟合和未来函数也纳入诈骗罪。

中文

456

Zhiran retweetledi

KiraMyao🐱@KiraMyao·15 May

会不会太隐晦了？🤔 就这样跪着求吗…是出了什么事故吗😨

中文

322

187.6K

Zhiran retweetledi

Zeyi(Andy) Liu@ZeyiAndyLiu·16 May

New paper: Spectral Lens Loss curves can hide how LLMs actually learn. We show that activation and gradient spectra reveal hidden representation geometry, predict token efficiency early, and distinguish learning gains from throughput gains. arxiv.org/abs/2605.05683

English

232

13.5K

Zhiran@imbue_byte·16 May

🌟 Open Source & Ready for the Edge. TidyLangChain is distributed under the Apache 2.0 license. If you are building the next generation of Edge AI, hardware agents, or smart IoT devices, check it out! Drop a ⭐ on GitHub and let us know what you build! ⚛️ 👇 github.com/imbue-bit/Tidy… #EdgeAI #IoT #EmbeddedSystems #CProgramming #LLM #LangChain #Microcontrollers

English

Zhiran@imbue_byte·16 May

📦 Zero External Dependencies. TidyLangChain is deeply portable. It relies solely on the standard C library and standard POSIX make. Simply compile it, statically link libtidylangchain.a, and deploy it directly to FreeRTOS, Zephyr, or bare-metal environments. ⚡

English

Zhiran@imbue_byte·16 May

🚀 Bringing Autonomous LLM Agents to the Edge! Meet TidyLangChain ⚛️: A deterministic, memory-bounded framework for LLM orchestration on Microcontrollers (MCUs) and resource-constrained IoT devices. Written strictly in ANSI C11. 👇 Let’s dive into how it works: 🧵1/N

English

166

Keşfet

@Tsinghua_Uni @Harvard @Kevin_GuoweiXu @akenathonXVI @SnozakiSakura @pantaloonz @xrwi238646 @tangeorange