Anna Go

175 posts

Anna Go banner
Anna Go

Anna Go

@_anna_go

AI researcher || prev. Research Fellow at MIT & IAIFI (@iaifi_news), PhD in Theoretical Physics from @Perimeter and @UWaterloo, PSI 2016/2017

Katılım Mart 2018
165 Takip Edilen571 Takipçiler
Anna Go retweetledi
llm_enjoyer
llm_enjoyer@LLMenjoyer·
immensely proud to share 2 of my bros' work, Hybrid Associative Memories by @kamesh_ai and @leonlufkin It's basically SSM and Attention merged into one layer, where the attention is also sparse like DSA. Except you pretrain with the sparsity. It's lowkirk based (1/N) 👇
llm_enjoyer tweet media
English
5
13
100
7K
Anna Go retweetledi
Behnam Neyshabur
Behnam Neyshabur@bneyshabur·
We have been heads down but wanted to share a bit about what we are doing 🧵
Behnam Neyshabur tweet media
English
37
36
552
201.8K
Anna Go retweetledi
Zyphra
Zyphra@ZyphraAI·
Introducing ZUNA, a 380M-parameter BCI foundation model for EEG data, a significant milestone in the development of noninvasive thought-to-text. Fully open source, Apache 2.0.
Zyphra tweet media
English
83
221
1.8K
1.3M
Anna Go
Anna Go@_anna_go·
great work @Nick__Alonso !
Zyphra@ZyphraAI

Today @ZyphraAI releases OVQ-attention, an advancement for efficient long-context processing! Existing LLM layers compress input too much, leading to poor long-context understanding, or too little, leading to expensive memory+compute. OVQ-attention is an alternative path. 🧵

English
0
0
1
233
Anna Go
Anna Go@_anna_go·
@jm_alexia well, that's their loss. can't wait to see what cool stuff you'll work on next!
English
0
0
0
143
Greg Yang
Greg Yang@TheGregYang·
I've been suffering from Lyme disease. I'm stepping back from xAI into an informal advisory role so I can go founder mode on my health, starting today. --- The symptoms started when I got sick (cold, flu, or COVID -- I'm not sure which) in early 2025. I distinctly felt less energetic, less creative, and less agentic even weeks after "recovery." After that, my condition ebbed and flowed, but the lows kept getting lower. Accidentally eating the wrong thing would make me extremely tired, taking days to recover. Working out would leave my whole body feeble for days. There was a week where I slept 12 hours a day and still couldn't recover. Lyme is famously hard to diagnose, but luckily I have an incredible doctor. He suspected these symptoms, far from being just in my head, indicated immune issues. Detective work over a few rounds of testing revealed I have Lyme disease. I was very surprised because Lyme is said to come from tick bites (where the bump looks like a target), but I don't ever remember having one. Likely I contracted Lyme a long time ago, but until I pushed myself hard building xAI and weakened my immune system, the symptoms weren't noticeable. --- Overall, I actually feel lucky to have discovered this early. Lyme is a serious disease that only gets harder to treat with age -- patients discovering it in their 50s or 60s have a much tougher time. Lyme can also be debilitating, leaving its victims bedridden, but luckily I'm still functional and can take care of myself day to day. So while some folks have said "you shouldn't have pushed yourself so hard," I'm glad I did. I found this issue early, and now I can fix it so I can push myself even harder when I rebound. --- Chronic Lyme is not well understood in the literature or by the public. For folks suffering from it, it can be a lonely fight. But I hope my story can make it just a little less lonely.
English
1.3K
238
9.9K
1.2M
Wenting Zhao
Wenting Zhao@wzhao_nlp·
🌶️ Some (perhaps) spicy thoughts. It’s been a while since my last tweet, but I wanted to write about how disorienting it has been from academia to an LLM lab 😅 The kind of research I was trained to do during my PhD almost doesn’t exist here. The obsession with mathematical elegance and novelty is mostly gone. Everything is about scaling data and compute. For a while, that really got to me. At my lowest point, I felt like I’d lost interest in building LLMs altogether. I didn’t feel intellectually challenged anymore. What made this even stranger was that, at a technical level, things worked. If there was a capability I wanted to teach a model, scaling the right data and compute always got me there, no exception (so far). But recently, I found a way to reconcile with myself.. I realized the real competition isn’t in the ML recipe anymore. Most teams do roughly the same thing. What actually matters is how fast you can iterate, test ideas, and recover from mistakes. And that speed is mostly backed by infrastructure 🏗️ Faster loops, fewer bugs, better tooling. Seeing this made me excited again! Infra is its own deep, hard, and intellectually fun problem space. In 2026, I want to become an ML researcher who’s really good at infra. And I'll come back to ML problems with that edge, and will be excited to share what I find 😌
English
63
114
1.9K
201.5K
Anna Go retweetledi
Eric W. Tramel
Eric W. Tramel@fujikanaeda·
The presence of a leading whitespace leaks the correct choice selection in the MMLU-Pro benchmark. Am I missing something? Seems to impact Chemistry, Physics, and Math. HF Issue in reply.
Eric W. Tramel tweet media
English
26
31
387
94.6K
Anna Go retweetledi
Ahmed Ahmed
Ahmed Ahmed@AhmedSQRD·
1/🧵 We prompted production LLMs with a short prefix of a book and asked them to complete the rest. How much of the book did they return? For Harry Potter and the Sorcerer’s Stone: (jailbroken) Claude 3.7 Sonnet→95.8%, GPT-4.1→4.0% (not jailbroken) Gemini 2.5 Pro→76.8%, Grok 3→70.3% Read on more details:
Ahmed Ahmed tweet media
English
22
53
323
76.6K
Anna Go
Anna Go@_anna_go·
@NatureUnedited that's not Japan. that's Song Bao setting up a leaf bed for FuBao at Everland in South Korea. know your pandas.
English
0
1
50
1.4K
Nature Unedited
Nature Unedited@NatureUnedited·
A zoo in Japan brings a panda its favorite red leaves to boost its morale. The result is a very happy panda! 🐼🍁
English
155
2.4K
29.7K
896K
Been Kim
Been Kim@_beenkim·
My husband lost her wedding ring during our honeymoon 🙄 (this was many years ago) "Stop playing with it - you will lose it" "Nah, it's fine. It's right here!" Then it was gone-never seen again. 💍 Only if I had Gemini 3 to find it then! Here is a picture - there is gold ring in it. Can you find it?
Been Kim tweet media
English
55
8
165
77.9K
Amir
Amir@WorkaholicDavid·
This is how the thickness of recent iPhone cameras look next to each other.
Amir tweet media
English
148
794
13.4K
795.2K
Anna Go retweetledi
Behnam Neyshabur
Behnam Neyshabur@bneyshabur·
OK, @sarawiltberger and I are experimenting with a small, project-based mentorship program designed for the age of AI. We’re looking for resourceful self-starters—from early high school to early-career professionals—who want to prove their abilities through hard work. You don’t need a specific skill set to apply—just curiosity, drive, and the motivation to build something cool. Application deadline: Sep 12, 2025 forms.gle/hwQLYJHFX6uk6A… Please retweet & particularly send to high schoolers who might be interested!
Behnam Neyshabur@bneyshabur

I've been reflecting deeply on how the rapid AI revolution is reshaping education, employment, and entrepreneurship. I want to help ambitious, talented individuals—whether high schoolers, PhDs, skilled professionals, or entrepreneurs outside AI—to thrive during this transition. I'm planning to experiment with a few practical initiatives. What would genuinely help you or those around you? I’m very open to your ideas and suggestions!

English
2
14
65
20.2K
Anna Go retweetledi
Stella Biderman
Stella Biderman@BlancheMinerva·
Are you afraid of LLMs teaching people how to build bioweapons? Have you tried just... not teaching LLMs about bioweapons? @AIEleuther and @AISecurityInst joined forces to see what would happen, pretraining three 6.9B models for 500B tokens and producing 15 total models to study
Stella Biderman tweet media
English
27
75
571
64.9K
Anna Go retweetledi
Blake Bordelon ☕️🧪👨‍💻
Blake Bordelon ☕️🧪👨‍💻@blake__bordelon·
Excited to announce that I will be joining @UTAustin with a joint position between @OdenInstitute for Computational Science and dept of Neuroscience in FL 2026! I plan on recruiting PhD students and postdocs interested in mathematics of neural computation (more details to come).
Blake Bordelon ☕️🧪👨‍💻 tweet media
English
34
16
339
48.1K
Anna Go
Anna Go@_anna_go·
@hardmaru how many NP-hard problems has it solved already?
English
0
0
1
114
hardmaru
hardmaru@hardmaru·
Sakana AI developed a new coding agent, ALE-Agent, trained to solve NP-hard optimization problems. Our agent participated in a live coding competition, the challenging AtCoder Heuristic Contest, and ranked #21 out of 1,000 human participants! Learn more: sakana.ai/ale-bench/
Sakana AI@SakanaAILabs

Introducing ALE-Bench, ALE-Agent! Towards Automating Long-Horizon Algorithm Engineering for Hard Optimization Problems Blog: sakana.ai/ale-bench/ Paper: arxiv.org/abs/2506.09050 ALE-Bench is a coding benchmark primarily focused on hard optimization (NP-hard) problems. We developed this benchmark with AtCoder Inc., a leading coding contest platform company. What makes ALE-Bench unique is its focus on hard optimization problems that demand long-horizon and creative reasoning. It’s open-ended, in the sense that true optima are out of reach (NP-hard) and scores can continuously improve. We believe this benchmark has the potential to become one of the key benchmarks for reasoning and coding in the next generation. ALE-Agent is our end-to-end agent that we specifically designed for this challenging domain. In fact, our ALE-Agent has already built an impressive track record in the wild! In May 2025, our agent participated in a live AtCoder Heuristic Competition (AHC), alongside 1,000 other participants in real-time. AHC is considered to be one of the most challenging coding competitions in this domain. Our ALE-Agent achieved an impressive ranking of 21st out of 1,000 human participants in the competition (top 2%), marking a turning point for AI discovery of solutions to hard optimization problems with a wide spectrum of important real world applications such as logistics, routing, packing, factory production planning, power-grid balancing. We look forward to applying this technology to real industrial optimization opportunities. Building on the insights from this study, Sakana AI will continue to tackle the challenge of developing AI with even greater algorithm engineering capabilities. ALE-Bench Dataset: huggingface.co/datasets/Sakan… ALE-Bench Code: github.com/SakanaAI/ALE-B… This research was conducted in collaboration with AtCoder Inc. (@atcoder). We are deeply grateful for their outstanding expertise and contributions in optimization and algorithms, which were invaluable in providing data, analyzing results, and enabling our AI agent’s participation in their contests.

English
12
66
355
52.6K