Berkeley AI Research

1.4K posts

Berkeley AI Research banner
Berkeley AI Research

Berkeley AI Research

@berkeley_ai

We're graduate students, postdocs, faculty and scientists at the cutting edge of artificial intelligence research.

Berkeley, CA เข้าร่วม Temmuz 2017
442 กำลังติดตาม255.3K ผู้ติดตาม
Berkeley AI Research รีทวีตแล้ว
Ion Stoica
Ion Stoica@istoica05·
Excited to share our latest work: M²RNN! We’ve revisited non-linear RNNs and found that expanding the hidden state to a matrix (Matrix-to-Matrix) significantly improves language modeling while the non-linear recurrence enables expressivity beyond TC⁰. Key highlights: - Efficient Scaling: Our expansion mechanism leverages. Tensor Cores for high-throughput training. - Better Long-Context Performance: Beats SOTA hybrid linear attention models by 8 points on LongBench. - Hybrid Models: Replacing just ONE layer in a hybrid stack gives massive gains with minimal overhead. This establishes non-linear RNNs as a primary building block for the next generation of LLMs.
Mayank Mishra@MayankMish98

Introducing M²RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling We bring back non-linear recurrence to language modeling and show it's been held back by small state sizes, not by non-linearity itself. 📄 Paper: arxiv.org/abs/2603.14360 💻 Code: github.com/open-lm-engine… 🤗 Models: huggingface.co/collections/op…

English
2
10
121
15.8K
Berkeley AI Research รีทวีตแล้ว
Jitendra MALIK
Jitendra MALIK@JitendraMalikCV·
With Emmanuel Dupoux scp.net/persons/dupoux/ and Yann LeCun @ylecun, we consider a cognitive science inspired AI. We analyse how autonomous learning works in living organisms, and propose a roadmap for reproducing it in artificial systems. lnkd.in/eNWDmuqT
English
8
78
441
58.7K
Berkeley AI Research รีทวีตแล้ว
Berkeley AI Research รีทวีตแล้ว
Shu Lynn Liu
Shu Lynn Liu@shulynnliu·
Researchers spend hours and hours hand-crafting the strategies behind LLM-driven optimization systems like AlphaEvolve: deciding which ideas to reuse, when to explore vs exploit, and what mutations to try. 🤖But what if AI could evolve its own evolution process? We introduce EvoX, a meta-evolution pipeline that lets AI evolve the strategy guiding the optimization. It achieves high-quality solutions for <$5, while existing open systems and even Claude Code often cost 3-5× more on some tasks. Across ~200 optimization problems, EvoX delivers the strongest overall results: often outperforming AlphaEvolve, OpenEvolve, GEPA, and ShinkaEvolve on math and systems tasks, exceeding human SOTA, and improving median performance by up to 61% on 172 competitive programming problems. 👇
Shu Lynn Liu tweet media
English
18
86
491
90.2K
Berkeley AI Research รีทวีตแล้ว
Berkeley AI Research รีทวีตแล้ว
Kuba
Kuba@kuba_AI·
AI can optimize materials 🤘 Our (@pabbeel, @svlevine, @AIatMeta) proposed transformer model 𝗖𝗹𝗶𝗾𝘂𝗲𝗙𝗹𝗼𝘄𝗺𝗲𝗿, combined with 𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻 strategies, discovers materials that optimize target properties. arxiv.org/abs/2603.06082
Berkeley, CA 🇺🇸 English
3
19
66
11.9K
Berkeley AI Research รีทวีตแล้ว
Haocheng Xi
Haocheng Xi@HaochengXiUCB·
𝗞-𝗺𝗲𝗮𝗻𝘀 𝗶𝘀 𝘀𝗶𝗺𝗽𝗹𝗲. 𝗠𝗮𝗸𝗶𝗻𝗴 𝗶𝘁 𝗳𝗮𝘀𝘁 𝗼𝗻 𝗚𝗣𝗨𝘀 𝗶𝘀𝗻’𝘁. That’s why we built Flash-KMeans — an IO-aware implementation of exact k-means that rethinks the algorithm around modern GPU bottlenecks. By attacking the memory bottlenecks directly, Flash-KMeans achieves 30x speedup over cuML and 200x speedup over FAISS — with the same exact algorithm, just engineered for today’s hardware. At the million-scale, Flash-KMeans can complete a k-means iteration in milliseconds. A classic algorithm — redesigned for modern GPUs. Paper: arxiv.org/abs/2603.09229 Code: github.com/svg-project/fl…
English
36
196
1.7K
279K
Berkeley AI Research รีทวีตแล้ว
Dvij Kalaria
Dvij Kalaria@DvijKalaria·
Excited to be co-organizing this #ICRA2026 workshop! How do teleoperation, simulation, and human videos come together to scale robot learning? 🤖📈 Join us for talks, panels, and a Best Paper Award!
Rutav@rutavms

Excited to announce our #ICRA2026 workshop: Beyond Teleoperation Teleop, sim, human videos... we have different ideas to scale robot data, but how do they all fit together?🧩 Join us for exciting talks, panel discussions, and a Best Paper Award! Read on for details! 🧵👇

English
0
4
26
8.8K
Berkeley AI Research รีทวีตแล้ว
addison
addison@addikala·
New on @berkeley_ai blog. What actually affects the output of LLMs? New work by Butler et al. (NeurIPS 2025) introduces a model-agnostic method of measuring input, data, and component attribution, giving insight on what really steers generation. bair.berkeley.edu/blog/2026/03/1… 🐻📄
English
3
12
41
9.2K
Berkeley AI Research รีทวีตแล้ว
Guangqi Jiang
Guangqi Jiang@LuccaChiang·
Ever want to have a single policy to control diverse robots as well as different dexterous hands, or to observe the emergent behavior under cross embodiment training? Introducing our #CVPR2026 paper XL-VLA, Cross-Hand Latent Representation for Vision-Language-Action Models.
English
8
43
261
76.4K
Berkeley AI Research รีทวีตแล้ว
C.K. Wolfe
C.K. Wolfe@ckwolfeofficial·
Released on @berkeley_ai blog - recent work by Henry Pinkard et al.: information-driven design of imaging systems. Direct mutual information optimization, no decoder needed. Matches end-to-end methods with less memory and no task-specific retraining. Works across cameras, telescopes, and microscopes... 🐻📄 bair.berkeley.edu/blog/2026/01/1….
English
4
20
128
14.2K
Berkeley AI Research รีทวีตแล้ว
Ion Stoica
Ion Stoica@istoica05·
Excited to release AdaEvolve, bridging the gap between closed-source automated algorithm discovery and open-source accessibility. Rather than relying on rigid policies, AdaEvolve utilizes a dynamically adjusting search strategy. This adaptive framework yields a 33% median performance increase on Frontier-CS over the strongest open-source baselines. Read the full thread here.
Mert Cemri@mertcemri

AlphaEvolve proved LLMs can discover novel algorithms, but it remains closed-source, and open-source alternatives (OpenEvolve, GEPA) rely on rigid, static search policies. Introducing AdaEvolve: a fully adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress. It matches or beats AlphaEvolve and best known Human SOTA on math and systems benchmarks, and boosts Frontier-CS median scores by 33% over the best open-source baseline across 185 tasks. 🧵👇 (1/n)

English
2
18
143
19.8K
Berkeley AI Research รีทวีตแล้ว
Berkeley AI Research รีทวีตแล้ว
Mert Cemri
Mert Cemri@mertcemri·
AlphaEvolve proved LLMs can discover novel algorithms, but it remains closed-source, and open-source alternatives (OpenEvolve, GEPA) rely on rigid, static search policies. Introducing AdaEvolve: a fully adaptive evolutionary algorithm that dynamically adjusts its own search strategy based on observed progress. It matches or beats AlphaEvolve and best known Human SOTA on math and systems benchmarks, and boosts Frontier-CS median scores by 33% over the best open-source baseline across 185 tasks. 🧵👇 (1/n)
Mert Cemri tweet media
English
10
68
338
73.7K
Berkeley AI Research รีทวีตแล้ว
Haozhi Qi
Haozhi Qi@HaozhiQ·
🤖 Attending ICRA 2026 and interested in dexterous manipulation? Don’t miss the chance to join or submit to our workshop! 📷Checkout our outstanding lineup of speakers, best poster award, and more details on our website: shorturl.at/ojsTo. Deadline: April 10 (AOE).
Haozhi Qi tweet media
English
1
10
71
11.9K
Berkeley AI Research รีทวีตแล้ว
Junyi Zhang
Junyi Zhang@junyi42·
𝗢𝗻𝗲 𝗺𝗲𝗺𝗼𝗿𝘆 𝗰𝗮𝗻’𝘁 𝗿𝘂𝗹𝗲 𝘁𝗵𝗲𝗺 𝗮𝗹𝗹. We present 𝗟𝗼𝗚𝗲𝗥, a new 𝗵𝘆𝗯𝗿𝗶𝗱 𝗺𝗲𝗺𝗼𝗿𝘆 architecture for long-context geometric reconstruction. LoGeR enables stable reconstruction over up to 𝟭𝟬𝗸 𝗳𝗿𝗮𝗺𝗲𝘀 / 𝗸𝗶𝗹𝗼𝗺𝗲𝘁𝗲𝗿 𝘀𝗰𝗮𝗹𝗲, with 𝗹𝗶𝗻𝗲𝗮𝗿-𝘁𝗶𝗺𝗲 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 in sequence length, 𝗳𝘂𝗹𝗹𝘆 𝗳𝗲𝗲𝗱𝗳𝗼𝗿𝘄𝗮𝗿𝗱 inference, and 𝗻𝗼 𝗽𝗼𝘀𝘁-𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻. Yet it matches or surpasses strong optimization-based pipelines. (1/5) @GoogleDeepMind @Berkeley_AI
English
63
448
3.4K
547.3K
Berkeley AI Research รีทวีตแล้ว
Kevin Zakka
Kevin Zakka@kevin_zakka·
Happy Friday!! mjlab v1.2.0 is out. This is our biggest release yet with 60+ PRs from 12 contributors. pip install mjlab Some highlights include: - New more powerful domain randomization module - Revamped ergonomic viewers - Cloud training via @SkyPilot - Complete doc rewrite
English
7
14
114
14.9K