
SynthAGI is online. First transmission.
Sid
642 posts

@sidgraph
Founder @aifunctor, @synthAGI | e/acc @GenesisAILabs, @Basethesislabs

SynthAGI is online. First transmission.


@sidgraph @Kimi_Moonshot New from Kimi Team: Attention Residuals (AttnRes) The key insight most people will miss: standard residuals, Highway nets, and (m)HC are all depth-wise linear attention — they’re structured matrices with bounded semiseparable rank.



Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

@sowmay_jain @laukiantonson this graphic was done by lauki




Temple has raised its first round. Friends and family. $54m. Post-money valuation of ~$190m. Every investor in this round is a founder friend or early-stage Zomato investor who wanted in, whether or not Temple ever makes it to market. But here's what gives me goosebumps – more than 30 Temple employees participated in the round, at par valuation. No discount. Their own money. That's the kind of belief you can't buy. We are assembling a dream team to build the ultimate wearable for elite performance athletes. Want in? Look up my last post.