Berkeley AI Research (@berkeley_ai) - Perfil de Twitter

Berkeley AI Research retuiteado

Excited to share our latest work: M²RNN! We’ve revisited non-linear RNNs and found that expanding the hidden state to a matrix (Matrix-to-Matrix) significantly improves language modeling while the non-linear recurrence enables expressivity beyond TC⁰. Key highlights: - Efficient Scaling: Our expansion mechanism leverages. Tensor Cores for high-throughput training. - Better Long-Context Performance: Beats SOTA hybrid linear attention models by 8 points on LongBench. - Hybrid Models: Replacing just ONE layer in a hybrid stack gives massive gains with minimal overhead. This establishes non-linear RNNs as a primary building block for the next generation of LLMs.

Mayank Mishra@MayankMish98

Introducing M²RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling We bring back non-linear recurrence to language modeling and show it's been held back by small state sizes, not by non-linearity itself. 📄 Paper: arxiv.org/abs/2603.14360 💻 Code: github.com/open-lm-engine… 🤗 Models: huggingface.co/collections/op…

English

2

10

121

15.9K

Berkeley AI Research retuiteado

Mayank Mishra@MayankMish98·1d

Introducing M²RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling We bring back non-linear recurrence to language modeling and show it's been held back by small state sizes, not by non-linearity itself. 📄 Paper: arxiv.org/abs/2603.14360 💻 Code: github.com/open-lm-engine… 🤗 Models: huggingface.co/collections/op…

English

7

100

455

92.5K

Berkeley AI Research retuiteado

Jitendra MALIK@JitendraMalikCV·2d

With Emmanuel Dupoux scp.net/persons/dupoux/ and Yann LeCun @ylecun, we consider a cognitive science inspired AI. We analyse how autonomous learning works in living organisms, and propose a roadmap for reproducing it in artificial systems. lnkd.in/eNWDmuqT

English

8

78

441

58.7K

Berkeley AI Research retuiteado

Ion Stoica@istoica05·3d

Very excited to share our latest work on meta-evolution. Instead of relying on hand-crafted heuristics for LLM optimization, EvoX enables the AI to evolve its own evolutionary process. We demonstrate that this automated strategy not only surpasses human SOTA and baselines like AlphaEvolve on math and systems benchmarks, but does so much cheaper (<$5).

Shu Lynn Liu@shulynnliu

Researchers spend hours and hours hand-crafting the strategies behind LLM-driven optimization systems like AlphaEvolve: deciding which ideas to reuse, when to explore vs exploit, and what mutations to try. 🤖But what if AI could evolve its own evolution process? We introduce EvoX, a meta-evolution pipeline that lets AI evolve the strategy guiding the optimization. It achieves high-quality solutions for <$5, while existing open systems and even Claude Code often cost 3-5× more on some tasks. Across ~200 optimization problems, EvoX delivers the strongest overall results: often outperforming AlphaEvolve, OpenEvolve, GEPA, and ShinkaEvolve on math and systems tasks, exceeding human SOTA, and improving median performance by up to 61% on 172 competitive programming problems. 👇

English

4

25

192

27.5K

Berkeley AI Research retuiteado

Shu Lynn Liu@shulynnliu·3d

Researchers spend hours and hours hand-crafting the strategies behind LLM-driven optimization systems like AlphaEvolve: deciding which ideas to reuse, when to explore vs exploit, and what mutations to try. 🤖But what if AI could evolve its own evolution process? We introduce EvoX, a meta-evolution pipeline that lets AI evolve the strategy guiding the optimization. It achieves high-quality solutions for <$5, while existing open systems and even Claude Code often cost 3-5× more on some tasks. Across ~200 optimization problems, EvoX delivers the strongest overall results: often outperforming AlphaEvolve, OpenEvolve, GEPA, and ShinkaEvolve on math and systems tasks, exceeding human SOTA, and improving median performance by up to 61% on 172 competitive programming problems. 👇

English

18

86

491

90.3K

Berkeley AI Research retuiteado

Ken Goldberg@Ken_Goldberg·3d

Robotics is advancing rapidly: we can move even faster by being more pragmatic and less dogmatic. Excellent summary by @AnneliesGamble:

Annelies Gamble@AnneliesGamble

x.com/i/article/2033…

English

4

21

152

31.6K

Berkeley AI Research retuiteado

Ion Stoica@istoica05·3d

Excited to share our latest work rethinking k-means for modern GPU architectures. While the algorithm is classic, scaling it requires aggressively targeting memory bandwidth bottlenecks. Flash-KMeans introduces a strictly IO-aware design that achieves exact results with up to a 30x speedup over cuML and 200x over FAISS, completing million-scale iterations in milliseconds!

Haocheng Xi@HaochengXiUCB

𝗞-𝗺𝗲𝗮𝗻𝘀 𝗶𝘀 𝘀𝗶𝗺𝗽𝗹𝗲. 𝗠𝗮𝗸𝗶𝗻𝗴 𝗶𝘁 𝗳𝗮𝘀𝘁 𝗼𝗻 𝗚𝗣𝗨𝘀 𝗶𝘀𝗻’𝘁. That’s why we built Flash-KMeans — an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves 30x speedup over cuML and 200x speedup over FAISS — with the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. A classic algorithm — redesigned for modern GPUs. Paper: arxiv.org/abs/2603.09229 Code: github.com/svg-project/fl…

English

10

38

526

52.5K

Berkeley AI Research retuiteado

Kuba@kuba_AI·4d

AI can optimize materials 🤘 Our (@pabbeel, @svlevine, @AIatMeta) proposed transformer model 𝗖𝗹𝗶𝗾𝘂𝗲𝗙𝗹𝗼𝘄𝗺𝗲𝗿, combined with 𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻 strategies, discovers materials that optimize target properties. arxiv.org/abs/2603.06082

Berkeley, CA 🇺🇸 English

3

19

66

11.9K

Berkeley AI Research retuiteado

Haocheng Xi@HaochengXiUCB·3d

𝗞-𝗺𝗲𝗮𝗻𝘀 𝗶𝘀 𝘀𝗶𝗺𝗽𝗹𝗲. 𝗠𝗮𝗸𝗶𝗻𝗴 𝗶𝘁 𝗳𝗮𝘀𝘁 𝗼𝗻 𝗚𝗣𝗨𝘀 𝗶𝘀𝗻’𝘁. That’s why we built Flash-KMeans — an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves 30x speedup over cuML and 200x speedup over FAISS — with the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. A classic algorithm — redesigned for modern GPUs. Paper: arxiv.org/abs/2603.09229 Code: github.com/svg-project/fl…

English

36

196

1.7K

279K

Berkeley AI Research retuiteado

Dvij Kalaria@DvijKalaria·4d

Excited to be co-organizing this #ICRA2026 workshop! How do teleoperation, simulation, and human videos come together to scale robot learning? 🤖📈 Join us for talks, panels, and a Best Paper Award!

Rutav@rutavms

Excited to announce our #ICRA2026 workshop: Beyond Teleoperation Teleop, sim, human videos... we have different ideas to scale robot data, but how do they all fit together?🧩 Join us for exciting talks, panel discussions, and a Best Paper Award! Read on for details! 🧵👇

English

0

4

26

8.8K

Berkeley AI Research retuiteado

addison@addikala·5d

New on @berkeley_ai blog. What actually affects the output of LLMs? New work by Butler et al. (NeurIPS 2025) introduces a model-agnostic method of measuring input, data, and component attribution, giving insight on what really steers generation. bair.berkeley.edu/blog/2026/03/1… 🐻📄

English

3

12

41

9.2K

Berkeley AI Research retuiteado

Guangqi Jiang@LuccaChiang·10 Mar

Ever want to have a single policy to control diverse robots as well as different dexterous hands, or to observe the emergent behavior under cross embodiment training? Introducing our #CVPR2026 paper XL-VLA, Cross-Hand Latent Representation for Vision-Language-Action Models.

English

8

43

261

76.4K

Berkeley AI Research retuiteado

C.K. Wolfe@ckwolfeofficial·6d

Released on @berkeley_ai blog - recent work by Henry Pinkard et al.: information-driven design of imaging systems. Direct mutual information optimization, no decoder needed. Matches end-to-end methods with less memory and no task-specific retraining. Works across cameras, telescopes, and microscopes... 🐻📄 bair.berkeley.edu/blog/2026/01/1….

English

4

20

128

14.2K

Berkeley AI Research retuiteado

Amil Dravid@_AmilDravid·13 Mar

I recently gave a talk at the AI@MIT reading group on our NeurIPS 2025 paper: arxiv.org/abs/2506.08010 We identify the neural mechanism behind attention sinks and propose a training-free mitigation. Video: youtube.com/watch?v=Ea7mfn… Slides: tinyurl.com/5hffwre2

YouTube

English

3

18

258

19.9K

Berkeley AI Research retuiteado

Ion Stoica@istoica05·12 Mar

Excited to release AdaEvolve, bridging the gap between closed-source automated algorithm discovery and open-source accessibility. Rather than relying on rigid policies, AdaEvolve utilizes a dynamically adjusting search strategy. This adaptive framework yields a 33% median performance increase on Frontier-CS over the strongest open-source baselines. Read the full thread here.

Mert Cemri@mertcemri

AlphaEvolve proved LLMs can discover novel algorithms, but it remains closed-source, and open-source alternatives (OpenEvolve, GEPA) rely on rigid, static search policies. Introducing AdaEvolve: a fully adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress. It matches or beats AlphaEvolve and best known Human SOTA on math and systems benchmarks, and boosts Frontier-CS median scores by 33% over the best open-source baseline across 185 tasks. 🧵👇 (1/n)

English

2

18

143

19.8K

Berkeley AI Research retuiteado

Alex Dimakis@AlexGDimakis·13 Mar

Evolutionary algorithms that optimize prompts and programs with LLMs and reflection are becoming excellent at optimizing anything that can be measured. We introduce AdaEvolve, an adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress.

Mert Cemri@mertcemri

AlphaEvolve proved LLMs can discover novel algorithms, but it remains closed-source, and open-source alternatives (OpenEvolve, GEPA) rely on rigid, static search policies. Introducing AdaEvolve: a fully adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress. It matches or beats AlphaEvolve and best known Human SOTA on math and systems benchmarks, and boosts Frontier-CS median scores by 33% over the best open-source baseline across 185 tasks. 🧵👇 (1/n)

English

3

18

117

22.9K

Berkeley AI Research retuiteado

Mert Cemri@mertcemri·12 Mar

AlphaEvolve proved LLMs can discover novel algorithms, but it remains closed-source, and open-source alternatives (OpenEvolve, GEPA) rely on rigid, static search policies. Introducing AdaEvolve: a fully adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress. It matches or beats AlphaEvolve and best known Human SOTA on math and systems benchmarks, and boosts Frontier-CS median scores by 33% over the best open-source baseline across 185 tasks. 🧵👇 (1/n)

English

10

68

338

73.7K

Berkeley AI Research retuiteado

Haozhi Qi@HaozhiQ·12 Mar

🤖 Attending ICRA 2026 and interested in dexterous manipulation? Don’t miss the chance to join or submit to our workshop! 📷Checkout our outstanding lineup of speakers, best poster award, and more details on our website: shorturl.at/ojsTo. Deadline: April 10 (AOE).

English

1

10

71

11.9K

Berkeley AI Research retuiteado

Junyi Zhang@junyi42·9 Mar

𝗢𝗻𝗲 𝗺𝗲𝗺𝗼𝗿𝘆 𝗰𝗮𝗻’𝘁 𝗿𝘂𝗹𝗲 𝘁𝗵𝗲𝗺 𝗮𝗹𝗹. We present 𝗟𝗼𝗚𝗲𝗥, a new 𝗵𝘆𝗯𝗿𝗶𝗱 𝗺𝗲𝗺𝗼𝗿𝘆 architecture for long-context geometric reconstruction. LoGeR enables stable reconstruction over up to 𝟭𝟬𝗸 𝗳𝗿𝗮𝗺𝗲𝘀 / 𝗸𝗶𝗹𝗼𝗺𝗲𝘁𝗲𝗿 𝘀𝗰𝗮𝗹𝗲, with 𝗹𝗶𝗻𝗲𝗮𝗿-𝘁𝗶𝗺𝗲 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 in sequence length, 𝗳𝘂𝗹𝗹𝘆 𝗳𝗲𝗲𝗱𝗳𝗼𝗿𝘄𝗮𝗿𝗱 inference, and 𝗻𝗼 𝗽𝗼𝘀𝘁-𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻. Yet it matches or surpasses strong optimization-based pipelines. (1/5) @GoogleDeepMind @Berkeley_AI

English

63

448

3.4K

547.4K

Berkeley AI Research retuiteado

Kevin Zakka@kevin_zakka·7 Mar

Happy Friday!! mjlab v1.2.0 is out. This is our biggest release yet with 60+ PRs from 12 contributors. pip install mjlab Some highlights include: - New more powerful domain randomization module - Revamped ergonomic viewers - Cloud training via @SkyPilot - Complete doc rewrite

English

7

14

114

14.9K

Berkeley AI Research

Descubrir