FW

123 posts

FW

FW

@thegenerality

https://t.co/uJtHMTZCrZ

Katılım Mayıs 2018
16 Takip Edilen178 Takipçiler
FW retweetledi
DailyPapers
DailyPapers@HuggingPapers·
Microsoft just released VibeVoice-ASR on Hugging Face A unified speech-to-text model that transcribes hour-long audio in one pass With built-in speaker diarization, timestamps, and customizable user context
DailyPapers tweet media
English
5
36
256
24.3K
FW retweetledi
Robert Youssef
Robert Youssef@rryssf_·
🚨 Microsoft Research just launched something that might define the next era of AI systems. They call it 'Agentic Organization' and it’s not just a new model. It’s a new way for intelligence itself to organize. Here’s what’s wild: Most large language models still “think” like a single brain. Step-by-step. Linear. Slow. Even “parallel thinking” just runs the same process twice and merges answers later. Agentic Organization changes the entire game. They built a new reasoning protocol called AsyncThink, where a model plays both roles an Organizer that breaks a complex problem into sub-queries, and Workers that solve those sub-parts at the same time. Think of it like this: Instead of one mind grinding through steps, AsyncThink forms a mini civilization of minds delegating, merging, adapting in real time. And it learns this behavior through reinforcement learning literally learning how to organize its own thoughts. The results are insane: → 28% lower inference latency than parallel thinking → Higher accuracy on math reasoning tasks → Zero-shot generalization to unseen problems like Sudoku → Learned organizational policies that evolve dynamically during reasoning It’s like scaling from “an intelligent agent” → to “an intelligent organization.” AsyncThink models don’t just reason faster they reason like teams do. Fork. Think. Join. Verify. Iterate. This is a glimpse of post-LLM intelligence systems that don’t just think, they coordinate thought. And if that holds, the future of AI might look less like a single brain… and more like a company of minds. Paper: The Era of Agentic Organization: Learning to Organize with Language Models
Robert Youssef tweet media
English
45
218
1.1K
127K
FW
FW@thegenerality·
The Era of Agentic Organization
Rohan Paul@rohanpaul_ai

New @Microsoft paper teaches LLMs to organize reasoning into concurrent subtasks for faster, more accurate answers. It shows 28% lower wait time than typical parallel thinking while also boosting math accuracy. The big deal is simple, it turns coordination into a skill the model learns, so it decides when to split work, when to wait, and when to merge. The usual single chain wastes time because each step blocks the next. Fixed parallel plans also waste time because they cannot adapt to each query. The fix is an organizer that writes simple Fork and Join tags to start and merge worker thoughts. Workers chase sub-queries in parallel while the organizer keeps thinking and only pauses to Join. All control lives in plain text, so the base model stays unchanged. Training happens in 2 stages, first supervised traces that teach the tag format. Then reinforcement learning rewards correct final answers, clean format, and real concurrency. Speed is measured by the critical path through the Fork-Join graph, which matches true waiting. Across countdown puzzles, math questions, and Sudoku, the learned policy runs faster and fails less. The big idea is to learn organization itself rather than hard-code a script. ---- Paper – arxiv. org/abs/2510.26658 Paper Title: "The Era of Agentic Organization: Learning to Organize with Language Models"

English
0
0
3
85
FW retweetledi
FW
FW@thegenerality·
#VibeVoice Vibd Podcasting
Axel Dittmann@DittmannAxel

#Microsoft's VibeVoice-1.5B just turned my rig into a podcast studio. 4 voices. Zero API costs. Running locally on a consumer GPU. Generated 5 test podcasts instantaneously - they sound surprisingly human. Setup took 30 minutes: clone repo, load model (most of the time - rural Germany), feed script, press play. The open-source podcast revolution is here, and it fits in your home rig. Who needs cloud subscriptions when innovation runs at localhost? 🎙️ #GenAI #LocalAI #Podcasting #VibeVoice

English
0
0
1
70