
sun 🐶
724 posts

sun 🐶
@sunncynn
Founder | DS turn SWE | Certified AWS Solution Architecture Professional | Building AI for local business


Introducing Latent Briefing, a way for agents to quickly share their relevant memory directly. Result: 31% fewer tokens used, same accuracy. Multi-agent systems are powerful, but can be wildly inefficient. They pass context as tokens, so costs explode and signal gets lost. We built an algorithm that allows agents to communicate KV cache to KV cache.












My faceless tiktok account promoting my app gets 600k views per month I haven’t posted since November 2025 It makes me $2,000+/month still.


Today, we closed our latest funding round with $122 billion in committed capital at an $852B post-money valuation. The fastest way to expand AI’s benefits is to put useful intelligence in people’s hands early and let access compound globally. This funding gives us resources to lead at scale. openai.com/index/accelera…

Two days ago, Anthropic cut off third-party harnesses from using Claude subscriptions — not surprising. Three days ago, MiMo launched its Token Plan — a design I spent real time on, and what I believe is a serious attempt at getting compute allocation and agent harness development right. Putting these two things together, some thoughts: 1. Claude Code's subscription is a beautifully designed system for balanced compute allocation. My guess — it doesn't make money, possibly bleeds it, unless their API margins are 10-20x, which I doubt. I can't rigorously calculate the losses from third-party harnesses plugging in, but I've looked at OpenClaw's context management up close — it's bad. Within a single user query, it fires off rounds of low-value tool calls as separate API requests, each carrying a long context window (often >100K tokens) — wasteful even with cache hits, and in extreme cases driving up cache miss rates for other queries. The actual request count per query ends up several times higher than Claude Code's own framework. Translated to API pricing, the real cost is probably tens of times the subscription price. That's not a gap — that's a crater. 2. Third-party harnesses like OpenClaw/OpenCode can still call Claude via API — they just can't ride on subscriptions anymore. Short term, these agent users will feel the pain, costs jumping easily tens of times. But that pressure is exactly what pushes these harnesses to improve context management, maximize prompt cache hit rates to reuse processed context, cut wasteful token burn. Pain eventually converts to engineering discipline. 3. I'd urge LLM companies not to blindly race to the bottom on pricing before figuring out how to price a coding plan without hemorrhaging money. Selling tokens dirt cheap while leaving the door wide open to third-party harnesses looks nice to users, but it's a trap — the same trap Anthropic just walked out of. The deeper problem: if users burn their attention on low-quality agent harnesses, highly unstable and slow inference services, and models downgraded to cut costs, only to find they still can't get anything done — that's not a healthy cycle for user experience or retention. 4. On MiMo Token Plan — it supports third-party harnesses, billed by token quota, same logic as Claude's newly launched extra usage packages. Because what we're going for is long-term stable delivery of high-quality models and services — not getting you to impulse-pay and then abandon ship. The bigger picture: global compute capacity can't keep up with the token demand agents are creating. The real way forward isn't cheaper tokens — it's co-evolution. "More token-efficient agent harnesses" × "more powerful and efficient models." Anthropic's move, whether they intended it or not, is pushing the entire ecosystem — open source and closed source alike — in that direction. That's probably a good thing. The Agent era doesn't belong to whoever burns the most compute. It belongs to whoever uses it wisely.







we’ve signed Zero Data Retention agreements with all providers for Go all models now follow a zero-retention policy your data is not used for training

Holy…S😳 Xiaomi's New CyberOne is so human-like Although this update features a bionic hand, I was immediately drawn to it. Let's look at the changes in the hand: It can handle industrial precision tasks like turning screws, plus delicate operations such as pinching feathers and throwing balloons. Behind the performance: >Volume cut by 60%:now almost identical in size/shape to a real human hand >Big leap in degrees of freedom (+50% total, +83% active:22-27DOF) >Full-palm tactile sensors over 8200 mm² for precise grip even without vision >150,000+ grip cycles durability (61-hour test) And a major innovation:Smart bionic sweat gland cooling: evaporates water for ~10W active heat dissipation Using tactile gloves to capture real human data, they’re training smoother, human-like grasps with imitation and reinforcement learning. Elon has said that humanoid hands and true AI are the most difficult aspects of building humanoid robots. It seems that Xiaomi is also getting close.

🇨🇳Moonshot AI Weighs Hong Kong IPO Chinese AI startup Moonshot AI is in early discussions about a potential Hong Kong initial public offering, according to people familiar with the matter. The Beijing-based firm behind the Kimi chatbot has held preliminary talks with China International Capital Corp. and Goldman Sachs Group Inc., though timing and listing plans remain uncertain. The deliberations come as Chinese authorities tighten scrutiny on offshore listings by red-chip firms, while still supporting domestic AI champions such as DeepSeek and Unitree. Peer companies including Zhipu and MiniMax have seen strong valuations following Hong Kong listings earlier this year, highlighting robust demand for AI assets. Moonshot is also exploring private fundraising, with discussions to raise up to $1 billion that could value the company at around $18 billion. Founded by Yang Zhilin, the firm counts Alibaba Group Holding Ltd., Tencent Holdings Ltd. and 5Y Capital among its investors, and recently upgraded its multimodal Kimi AI model. #CHINA #TECH #AI #MOONSHOT #KIMI (mktnews.com/flashDetail.ht…)

Okay let's see who can reply to this




