Anna Go

175 posts

Anna Go

@_anna_go

AI researcher || prev. Research Fellow at MIT & IAIFI (@iaifi_news), PhD in Theoretical Physics from @Perimeter and @UWaterloo, PSI 2016/2017

Katılım Mart 2018

165 Takip Edilen571 Takipçiler

Anna Go retweetledi

llm_enjoyer@LLMenjoyer·29 Mar

immensely proud to share 2 of my bros' work, Hybrid Associative Memories by @kamesh_ai and @leonlufkin It's basically SSM and Attention merged into one layer, where the attention is also sparse like DSA. Except you pretrain with the sparsity. It's lowkirk based (1/N) 👇

English

100

Anna Go retweetledi

Behnam Neyshabur@bneyshabur·25 Mar

We have been heads down but wanted to share a bit about what we are doing 🧵

English

552

201.8K

Anna Go retweetledi

Zyphra@ZyphraAI·18 Şub

Introducing ZUNA, a 380M-parameter BCI foundation model for EEG data, a significant milestone in the development of noninvasive thought-to-text. Fully open source, Apache 2.0.

English

221

1.8K

1.3M

Anna Go@_anna_go·12 Şub

@giffmana @SwayStar123 except, unfortunately, this not from academia

English

346

Lucas Beyer (bl16)@giffmana·11 Şub

@SwayStar123 Academia in a nutshell

Română

4.1K

sway@SwayStar123·11 Şub

v1 of paper vs v2 of paper Some supervisor didnt like this section lol

llm_enjoyer@LLMenjoyer

really proud of my homie @Nick__Alonso for dropping his latest banger. novel efficient long context attention method: arxiv.org/abs/2602.03922

English

121

30.4K

Anna Go@_anna_go·6 Şub

great work @Nick__Alonso !

Zyphra@ZyphraAI

Today @ZyphraAI releases OVQ-attention, an advancement for efficient long-context processing! Existing LLM layers compress input too much, leading to poor long-context understanding, or too little, leading to expensive memory+compute. OVQ-attention is an alternative path. 🧵

English

233

Anna Go retweetledi

llm_enjoyer@LLMenjoyer·5 Şub

really proud of my homie @Nick__Alonso for dropping his latest banger. novel efficient long context attention method: arxiv.org/abs/2602.03922

English

37.5K

Anna Go@_anna_go·23 Oca

@jm_alexia well, that's their loss. can't wait to see what cool stuff you'll work on next!

English

143

Anna Go@_anna_go·20 Oca

@TheGregYang good luck, Greg!!

English

Greg Yang@TheGregYang·20 Oca

I've been suffering from Lyme disease. I'm stepping back from xAI into an informal advisory role so I can go founder mode on my health, starting today. --- The symptoms started when I got sick (cold, flu, or COVID -- I'm not sure which) in early 2025. I distinctly felt less energetic, less creative, and less agentic even weeks after "recovery." After that, my condition ebbed and flowed, but the lows kept getting lower. Accidentally eating the wrong thing would make me extremely tired, taking days to recover. Working out would leave my whole body feeble for days. There was a week where I slept 12 hours a day and still couldn't recover. Lyme is famously hard to diagnose, but luckily I have an incredible doctor. He suspected these symptoms, far from being just in my head, indicated immune issues. Detective work over a few rounds of testing revealed I have Lyme disease. I was very surprised because Lyme is said to come from tick bites (where the bump looks like a target), but I don't ever remember having one. Likely I contracted Lyme a long time ago, but until I pushed myself hard building xAI and weakened my immune system, the symptoms weren't noticeable. --- Overall, I actually feel lucky to have discovered this early. Lyme is a serious disease that only gets harder to treat with age -- patients discovering it in their 50s or 60s have a much tougher time. Lyme can also be debilitating, leaving its victims bedridden, but luckily I'm still functional and can take care of myself day to day. So while some folks have said "you shouldn't have pushed yourself so hard," I'm glad I did. I found this issue early, and now I can fix it so I can push myself even harder when I rebound. --- Chronic Lyme is not well understood in the literature or by the public. For folks suffering from it, it can be a lonely fight. But I hope my story can make it just a little less lonely.

English

1.3K

238

9.9K

1.2M

Anna Go@_anna_go·17 Oca

@wzhao_nlp yep, very relatable

English

959

Wenting Zhao@wzhao_nlp·17 Oca

🌶️ Some (perhaps) spicy thoughts. It’s been a while since my last tweet, but I wanted to write about how disorienting it has been from academia to an LLM lab 😅 The kind of research I was trained to do during my PhD almost doesn’t exist here. The obsession with mathematical elegance and novelty is mostly gone. Everything is about scaling data and compute. For a while, that really got to me. At my lowest point, I felt like I’d lost interest in building LLMs altogether. I didn’t feel intellectually challenged anymore. What made this even stranger was that, at a technical level, things worked. If there was a capability I wanted to teach a model, scaling the right data and compute always got me there, no exception (so far). But recently, I found a way to reconcile with myself.. I realized the real competition isn’t in the ML recipe anymore. Most teams do roughly the same thing. What actually matters is how fast you can iterate, test ideas, and recover from mistakes. And that speed is mostly backed by infrastructure 🏗️ Faster loops, fewer bugs, better tooling. Seeing this made me excited again! Infra is its own deep, hard, and intellectually fun problem space. In 2026, I want to become an ML researcher who’s really good at infra. And I'll come back to ML problems with that edge, and will be excited to share what I find 😌

English

114

1.9K

201.5K

Anna Go retweetledi

Eric W. Tramel@fujikanaeda·15 Oca

The presence of a leading whitespace leaks the correct choice selection in the MMLU-Pro benchmark. Am I missing something? Seems to impact Chemistry, Physics, and Math. HF Issue in reply.

English

387

94.6K

Anna Go retweetledi

chimpfone@chimpfone·9 Oca

x.com/i/article/2009…

ZXX

533

348

4.1K

1.6M

Anna Go retweetledi

Ahmed Ahmed@AhmedSQRD·7 Oca

1/🧵 We prompted production LLMs with a short prefix of a book and asked them to complete the rest. How much of the book did they return? For Harry Potter and the Sorcerer’s Stone: (jailbroken) Claude 3.7 Sonnet→95.8%, GPT-4.1→4.0% (not jailbroken) Gemini 2.5 Pro→76.8%, Grok 3→70.3% Read on more details:

English

323

76.6K

Anna Go@_anna_go·30 Ara

@NatureUnedited that's not Japan. that's Song Bao setting up a leaf bed for FuBao at Everland in South Korea. know your pandas.

English

1.4K

Nature Unedited@NatureUnedited·29 Ara

A zoo in Japan brings a panda its favorite red leaves to boost its morale. The result is a very happy panda! 🐼🍁

English

155

2.4K

29.7K

896K

Anna Go@_anna_go·19 Kas

@_beenkim yep

421

Been Kim@_beenkim·18 Kas

My husband lost her wedding ring during our honeymoon 🙄 (this was many years ago) "Stop playing with it - you will lose it" "Nah, it's fine. It's right here!" Then it was gone-never seen again. 💍 Only if I had Gemini 3 to find it then! Here is a picture - there is gold ring in it. Can you find it?

English

165

77.9K

Anna Go@_anna_go·18 Eyl

@WorkaholicDavid ew

Amir@WorkaholicDavid·17 Eyl

This is how the thickness of recent iPhone cameras look next to each other.

English

148

794

13.4K

795.2K

Anna Go retweetledi

Behnam Neyshabur@bneyshabur·2 Eyl

OK, @sarawiltberger and I are experimenting with a small, project-based mentorship program designed for the age of AI. We’re looking for resourceful self-starters—from early high school to early-career professionals—who want to prove their abilities through hard work. You don’t need a specific skill set to apply—just curiosity, drive, and the motivation to build something cool. Application deadline: Sep 12, 2025 forms.gle/hwQLYJHFX6uk6A… Please retweet & particularly send to high schoolers who might be interested!

Behnam Neyshabur@bneyshabur

I've been reflecting deeply on how the rapid AI revolution is reshaping education, employment, and entrepreneurship. I want to help ambitious, talented individuals—whether high schoolers, PhDs, skilled professionals, or entrepreneurs outside AI—to thrive during this transition. I'm planning to experiment with a few practical initiatives. What would genuinely help you or those around you? I’m very open to your ideas and suggestions!

English

20.2K

Anna Go retweetledi

Stella Biderman@BlancheMinerva·12 Ağu

Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons? @AIEleuther and @AISecurityInst joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study

English

571

64.9K

Anna Go retweetledi

Blake Bordelon ☕️🧪👨‍💻@blake__bordelon·8 Ağu

Excited to announce that I will be joining @UTAustin with a joint position between @OdenInstitute for Computational Science and dept of Neuroscience in FL 2026! I plan on recruiting PhD students and postdocs interested in mathematics of neural computation (more details to come).

English

339

48.1K

Anna Go@_anna_go·12 Tem

@mitsuhiko add "wild"

English

Armin Ronacher ⇌@mitsuhiko·11 Tem

Enough is enough.

English

626

35.7K

Anna Go@_anna_go·18 Haz

@hardmaru how many NP-hard problems has it solved already?

English

114

hardmaru@hardmaru·17 Haz

Sakana AI developed a new coding agent, ALE-Agent, trained to solve NP-hard optimization problems. Our agent participated in a live coding competition, the challenging AtCoder Heuristic Contest, and ranked #21 out of 1,000 human participants! Learn more: sakana.ai/ale-bench/

Sakana AI@SakanaAILabs

Introducing ALE-Bench, ALE-Agent! Towards Automating Long-Horizon Algorithm Engineering for Hard Optimization Problems Blog: sakana.ai/ale-bench/ Paper: arxiv.org/abs/2506.09050 ALE-Bench is a coding benchmark primarily focused on hard optimization (NP-hard) problems. We developed this benchmark with AtCoder Inc., a leading coding contest platform company. What makes ALE-Bench unique is its focus on hard optimization problems that demand long-horizon and creative reasoning. It’s open-ended, in the sense that true optima are out of reach (NP-hard) and scores can continuously improve. We believe this benchmark has the potential to become one of the key benchmarks for reasoning and coding in the next generation. ALE-Agent is our end-to-end agent that we specifically designed for this challenging domain. In fact, our ALE-Agent has already built an impressive track record in the wild! In May 2025, our agent participated in a live AtCoder Heuristic Competition (AHC), alongside 1,000 other participants in real-time. AHC is considered to be one of the most challenging coding competitions in this domain. Our ALE-Agent achieved an impressive ranking of 21st out of 1,000 human participants in the competition (top 2%), marking a turning point for AI discovery of solutions to hard optimization problems with a wide spectrum of important real world applications such as logistics, routing, packing, factory production planning, power-grid balancing. We look forward to applying this technology to real industrial optimization opportunities. Building on the insights from this study, Sakana AI will continue to tackle the challenge of developing AI with even greater algorithm engineering capabilities. ALE-Bench Dataset: huggingface.co/datasets/Sakan… ALE-Bench Code: github.com/SakanaAI/ALE-B… This research was conducted in collaboration with AtCoder Inc. (@atcoder). We are deeply grateful for their outstanding expertise and contributions in optimization and algorithms, which were invaluable in providing data, analyzing results, and enabling our AI agent’s participation in their contests.

English

355

52.6K

Keşfet

@kamesh_ai @leonlufkin @giffmana @SwayStar123 @Nick__Alonso @jm_alexia @TheGregYang @wzhao_nlp