Manu_TechAndGames

15.9K posts

Manu_TechAndGames

Manu_TechAndGames

@AndroidBlogger

Developer, interested in many different tech fields. Working at Ubisoft Paris.

Katılım Şubat 2009
202 Takip Edilen192 Takipçiler
Manu_TechAndGames retweetledi
Sebastian Raschka
Gated DeltaNet has been one of my favorite "hybrid attention" newcomers in the good old transformer stack. Excited to see Gated DeltaNet-2. Adding it to my reading stack. In the meantime, I have a primer on Gated DeltaNet here: magazine.sebastianraschka.com/i/177848019/26…
Ali Hatamizadeh@ahatamiz1

Gated DeltaNet-2 is here. 🚀 🔥 New paper: Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Gated DeltaNet-2 outperforms KDA and Mamba-3, the latest and best recurrent architectures, head to head at 1.3B. 🏆 💡 Here's the idea behind it: Linear attention squeezes an unbounded KV cache into a fixed-size recurrent state. The hard part isn't just what to forget, it's how to edit that memory without scrambling the associations already in it. Prior delta-rule models like Gated DeltaNet and KDA use one scalar gate to do two jobs at once: erasing old content and writing new content. But these two decisions act on different axes of the state, so tying them together is a real limitation. Gated DeltaNet-2 decouples them. ✂️ a channel-wise erase gate b_t picks which key-side coordinates to read and remove ✍️ a channel-wise write gate w_t picks which value-side coordinates to commit 🔁 recovers KDA when both gates collapse to a scalar, and Gated DeltaNet when the decay collapses too ⚡ still trains fast: chunkwise WY algorithm with gate-aware backward, fused in Triton 📊 Results: We train 1.3B models on 100B tokens of FineWeb-Edu, matched in recurrent state size, against Mamba-2, Gated DeltaNet, KDA, and Mamba-3. Best average on language modeling + commonsense reasoning, in both recurrent and hybrid settings Biggest gains on long-context RULER retrieval. S-NIAH-3 jumps from 63 to 90 over KDA, and multi-key needle retrieval climbs from 28 to 38 Joint work with @YejinChoinka and @jankautz. 📄 Paper: shorturl.at/AAlVb 💻 Code: github.com/NVlabs/GatedDe… #LinearAttention #StateSpaceModels #Mamba #LLM

English
11
10
65
11.4K
Manu_TechAndGames retweetledi
Shubham Sharma
Shubham Sharma@HappyyPablo·
open sourcing Marlin-2B 🐟 a tiny VLM to extract structured information from videos Marlin is finetuned for two questions devs want to ask in their videos: what is happening, and when? Best open model in its weight class, competitive with Gemini-2.5-flash at only 2B params 🧵
English
131
511
4.6K
286.2K
Manu_TechAndGames retweetledi
DailyPapers
DailyPapers@HuggingPapers·
Alibaba researchers present MIGA A train-free method for infinite-frame video generation with state-of-the-art temporal consistency across thousands of frames featuring a novel two-stage alignment mechanism and dual consistency enhancement.
English
6
13
41
5.5K
Manu_TechAndGames retweetledi
Casper Hansen
Casper Hansen@casper_hansen_·
composer 2.5 is quite good and it’s built on kimi k2.5 which is insane given the big difference in performance! the moat is simply data + harness + open-weight models. the rest of the work is just a skill issue. the biggest opportunity right now is creating good datasets.
English
10
6
121
6.8K
Manu_TechAndGames retweetledi
OpenAI
OpenAI@OpenAI·
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
English
909
3.5K
24.6K
11.4M
Manu_TechAndGames retweetledi
Ethan Mollick
Ethan Mollick@emollick·
June 2024: The latest general-purpose LLMs could not count the r's in strawberry. July 2025: The latest general-purpose LLMs get gold in the International Math Olympiad. May 2026: The latest general-purpose LLM solve one of the "best-known questions in combinatorial geometry"
Ethan Mollick tweet mediaEthan Mollick tweet mediaEthan Mollick tweet media
English
52
208
1.6K
91.2K
Manu_TechAndGames retweetledi
DailyPapers
DailyPapers@HuggingPapers·
Stability AI just released SAME on Hugging Face A music autoencoder with 4096x compression—double the industry standard— maintaining pristine stereo reconstruction quality for generative audio workflows.
DailyPapers tweet media
English
2
14
62
8.1K
Manu_TechAndGames retweetledi
Android Developers
Android Developers@AndroidDev·
Native Android development is now fully supported in Google AI Studio! You can generate native Android apps from just a prompt, preview them via an embedded Android Emulator, and deploy them directly to test devices using ADB over USB. goo.gle/AndroidAIS_IO26 #GoogleIO
Android Developers tweet media
English
13
51
362
15K
Manu_TechAndGames retweetledi
Justine Moore
Justine Moore@venturetwins·
Cool new launch for Project Genie ✨ World generation is now grounded with Google Maps. Pick a real-world location, restyle it, and draft a character to explore the scene. I turned @a16z SF into a desert and wandered around as a camel.
Ben Poole@poolio

Real-world models are here! Stoked to share how we're bringing real-world locations to life by integrating Street View into Genie. Try it now at labs.google/fx/projectgenie and read the blog for more info: blog.google/innovation-and…

English
12
30
359
60.5K
Manu_TechAndGames retweetledi
Tesana - Make games with AI
Introducing Muranyi 3. The world’s most powerful AI model to make games. Ideas to games, faster than ever. Built with improved graphics and VFX, smoother animations, advanced game logic, smarter NPC behavior, faster loading times, faster generation, and more reliable builds. A new way to build worlds, dream up games, and bring them to life. Publish instantly. Earn from day one. You imagined it. Now build it.
English
59
220
1.3K
105.9K
Manu_TechAndGames retweetledi
Ethan Mollick
Ethan Mollick@emollick·
I had early Gemini Omni access: "sea otter in a pilot's uniform explains why Spirit Airlines went bankrupt to a river otter who is distracted by their laptop while they are in a hot air balloon over NYC. in the next balloon over, william shakespeare fights a robot made of pizza"
English
23
21
427
267.6K
Manu_TechAndGames retweetledi
Noam Brown
Noam Brown@polynoamial·
Andrej @karpathy is back in the game! I would have loved for him to rejoin @OpenAI, but I'm happy he's at any frontier lab pushing the field forward. It’s easy to frame this as zero-sum among the labs, but in truth we’re collectively advancing the most important tech of our era.
Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English
62
127
3.7K
208.2K
Manu_TechAndGames retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English
7.8K
11.1K
147.5K
26.4M
Manu_TechAndGames retweetledi
Chetaslua
Chetaslua@chetaslua·
Gemini Omni explains science with video ...... holy shit @demishassabis @OfficialLoganK thanks a lot for this , now every student will get there custom video for there topic of science and math I am so happy like while typing i want to see all of your reaction to this , this is new era
English
43
77
973
242.7K
Manu_TechAndGames retweetledi
huihui.ai
huihui.ai@support_huihui·
ByteDance just dropped an open-source model called Lance—and get this: it runs on just 3B active parameters! 🤯 Yet it can take in text, images, and videos, and simultaneously generate all three! Absolutely mind-blowing! huggingface.co/bytedance-rese…
English
21
94
764
41.3K
Manu_TechAndGames
Manu_TechAndGames@AndroidBlogger·
@DylanTFWang Awesome work. I'm still a little sad it's not real open source , as it can't be used in Europe and south Korea.
English
0
0
1
48
Tengfei Wang
Tengfei Wang@DylanTFWang·
⚡️After weeks of hard work, we're thrilled to fully open-source HY World 2.0 today -- full inference code and all models! Build, explore, and create your own interactive worlds with us.👇 github.com/Tencent-Hunyua…
Tengfei Wang@DylanTFWang

Genie3 generates videos. We generate 𝟯𝗗 𝘄𝗼𝗿𝗹𝗱𝘀 you can actually use. Launching tomorrow — Tencent #HYWorld 2.0, an engine-ready World Model🚀 This isn't a video. It's a real 3D scene, all generated & editable. One image in. A whole 3D world out. 🔥Open-source tomorrow

English
35
108
979
93.4K
Manu_TechAndGames retweetledi
Qwen
Qwen@Alibaba_Qwen·
🚀🚀Qwen3.7 Preview lands on Arena ! Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.⚡️⚡️ Can't wait to release Qwen3.7 series models!Stay tuned! @arena
Arena.ai@arena

Qwen3.7 Preview By @Alibaba_Qwen lands on Arena for Text and Vision. In Text Arena, Qwen3.7 Max Preview ranks #13 overall. Alibaba is now the #6 lab in this arena. - #7 Math - #9 Expert - #9 Software & IT - #10 Coding In Vision Arena: Qwen3.7 Plus Preview ranks #16 overall, making Alibaba the #5 lab. Congrats to the @Alibaba_Qwen team on the latest progress!

English
200
378
3.4K
605.8K