
Marktechpost AI
13.2K posts

Marktechpost AI
@Marktechpost
🐝 AI Dev News Platform (1 million+monthly traffic) | 150k+ AI subreddit | Contact: [email protected]









NVIDIA just dropped Gated DeltaNet-2. Here's what's actually interesting about it. Linear attention squeezes the unbounded KV cache into a fixed-size recurrent state. The hard part isn't what to forget. It's how to edit that compressed memory without scrambling the associations already in it. Prior delta-rule models like Gated DeltaNet and KDA use one scalar gate to do two different jobs at once: erasing old content on the key side, writing new content on the value side. Those two decisions act on different axes of the state, so tying them together is a real limitation. Gated DeltaNet-2 decouples them. 1. Channel-wise erase gate b_t→ Picks which key-side coordinates of the decayed state are read and removed 2. Channel-wise write gate w_t→ Picks which value-side coordinates of the new content are committed 3. Strict generalization→ Recovers KDA exactly when both gates collapse to one scalar → Recovers Gated DeltaNet when the decay collapses too 4. Still trains fast→ Chunkwise WY algorithm with channel-wise decay absorbed into asymmetric erase factors → Gate-aware backward fused in Triton Trained at 1.3B parameters on 100B FineWeb-Edu tokens, matched in recurrent state size against Mamba-2, Gated DeltaNet, KDA, and Mamba-3: → Best language modeling + commonsense average in both recurrent and hybrid settings → S-NIAH-3 at 2K (recurrent): KDA 63.2 → GDN-2 89.8 → MK-NIAH-1 at 4K (recurrent): KDA 28.0 → GDN-2 37.8 Full analysis: marktechpost.com/2026/05/24/nvi… Paper: github.com/NVlabs/GatedDe… Repo: github.com/NVlabs/GatedDe… @nvidia @NVIDIAAI








Perplexity just open-sourced an internal security tool they've been running in production. It's called 'Bumblebee'. Here's what's actually interesting: 1. It solves a specific blind spot SBOMs cover build artifacts. EDR covers running processes. Neither tells you what's installed on a developer's laptop right now. Bumblebee does exactly that — and nothing more. 2. The read-only design is the key decision npm packages can carry postinstall scripts that execute automatically on install. Most recent supply-chain worms spread that way. A scanner that invokes npm to check exposure has already triggered the attack. Bumblebee reads metadata directly — lockfiles, manifests, extension manifests — and never runs any code. 3. Four surfaces in one scan → Language package managers: npm, pnpm, Yarn, Bun, PyPI, Go modules, RubyGems, Composer → AI agent configs: MCP JSON host files including claude_desktop_config.json and cline_mcp_settings.json → Editor extensions: VS Code, Cursor, Windsurf, VSCodium → Browser extensions: Chrome, Edge, Brave, Arc, Comet, Firefox 4. The internal workflow is worth noting Perplexity Computer drafts a catalog entry when a threat signal lands → human reviews and merges the PR → Bumblebee runs on endpoints → findings go to the security team. Human in the loop before anything hits machines. 5. Technical details → Written in Go 1.25+, zero non-stdlib dependencies → Single static binary, three scan profiles: baseline, project, deep → Outputs NDJSON records with confidence levels (high / medium / low) → Apache 2.0, current release v0.1.1 Full analysis: marktechpost.com/2026/05/23/per… Repo: github.com/perplexityai/b… Technical details: perplexity.ai/hub/blog/perpl…. @perplexity_ai





Most agent frameworks today are stitching together reasoning models with external orchestration layers. Qwen3.7-Max takes a different position — train the agent capability into the model itself. Alibaba just introduced Qwen3.7-Max Here's what's actually interesting: → 1M-token context window — up from 256K on Qwen3.6 Max Preview → Extended-thinking mode with visible chain-of-thought reasoning trace → 1,000+ tool calls executed autonomously in an internal kernel optimization test → 35 hours of sustained autonomous execution on a single complex task → 56.6 on the Artificial Analysis Intelligence Index — #5 overall, ahead of Gemini 3.5 Flash → #13 in Text Arena (1,475 Elo), #7 in Math, #9 in Expert Prompts Full analysis: marktechpost.com/2026/05/21/qwe… Other technical details ⤵ @Alibaba_Qwen

