ulkubayer
3.6K posts

ulkubayer
@ulkubayer1
Software Start-up Founder, Entrepreneur, Industrial Designer, Interior Architect, Loving mom🥰#generativeart #dataviz #creativecoding



What Is computer-aided engineering? 🤔 Computer-aided engineering (CAE) is the use of software to simulate and analyze how products will perform in real-world conditions Read how this helps engineers test, optimize, and validate designs before building them. 👉 nvda.ws/4td8zbK




There's a common assumption in AI right now that if one language model can do a task reasonably well, having several of them collaborate — splitting up the work, checking each other's outputs, debating answers — should do it better. This paper puts that assumption under a controlled experiment across 180 configurations and finds that the reality is messier and more interesting: multi-agent setups improved performance by up to 81% on some tasks and made things worse by up to 70% on others, with the difference coming down to whether the task can be broken into parallel pieces or whether each step depends on the previous one. In a financial analysis, one agent can look at regulatory filings while another reads market news and a third examines earnings data — none of them need to wait for the others. In a Minecraft crafting puzzle, on the other hand, each action changes the inventory that the next action depends on, so the steps have to happen in order and splitting them across agents just adds overhead without any benefit. The paper fits an equation that predicts which architecture will work best for a new task 87% of the time. For anyone building or thinking about building systems where multiple AI models work together, this replaces a lot of hand-waving with something concrete. Read with an AI tutor: chapterpal.com/s/5c02af66/tow… Download the PDF: arxiv.org/pdf/2512.08296




Research from Imperial College London and Microsoft Research. According to this work, more compute FLOPs won't solve the AI agent inference problem. As AI agents move from chatbots to coding, web-use, and computer-use workflows, the memory capacity wall becomes the binding constraint. The industry needs to rethink inference infrastructure from the ground up. The default approach to scaling AI inference today remains GPU-centric: pack more compute onto bigger chips, stack more HBM, and hope the architecture fits every workload. The industry is building gigawatt-scale data centers around this assumption. But AI agents break this assumption entirely. This new research introduces a framework based on two metrics, Operational Intensity (OI) and Capacity Footprint (CF), that exposes why the classic roofline model fails to capture what actually bottlenecks agent inference. Here is what the researchers find: Different agentic workflows create wildly different hardware demands. A coding agent's context can snowball to 300K-1M tokens across 20-30 environment interactions. A computer-use agent consumes orders of magnitude more prefill tokens than a chatbot. Same base model, completely different system requirements. At batch size 1 with 1M context, a single DeepSeek-R1 request needs roughly 900GB of memory, far exceeding any single accelerator. Even a LLaMA-70B coding agent can exceed a B200's capacity ceiling. Meanwhile, the KV cache loading during decode drives OI so low that hardware spends most of its time moving data, not computing. The paper argues this means disaggregated serving should be the default. Prefill and decode phases show drastically different operational intensity and need specialized accelerators. NVIDIA's upcoming Rubin CPX already targets prefill-only workloads, but the authors predict more than two accelerator types will be needed within a single inference system. Heterogeneous architectures with disaggregated compute and memory, connected by optical interconnects, will let data centers rebalance hardware within a single generation rather than waiting for new chips. Homogeneous GPU clusters will amplify the mismatched scaling between compute, memory, and networking. Paper: arxiv.org/abs/2601.22001 Learn to build effective AI agents in our academy: academy.dair.ai

Genie 3 is pretty wild. People just dropped some new insane 3D world videos 100% AI 1. "You are a fish, you must escape the kitchen"




