Hanson Wen

209 posts

Hanson Wen banner
Hanson Wen

Hanson Wen

@_hansonw

bio+ml @ucberkeley

Milkyway, Earth Katılım Aralık 2023
790 Takip Edilen208 Takipçiler
Hanson Wen retweetledi
Claude
Claude@claudeai·
Computer use is now in Claude Code. Claude can open your apps, click through your UI, and test what it built, right from the CLI. Now in research preview on Pro and Max plans.
English
1.5K
1.9K
26.3K
2.7M
Hanson Wen
Hanson Wen@_hansonw·
@bitforth What LLM image model can infer brain activation images to that level of detail? I’m highly skeptical
English
1
0
0
141
Alan
Alan@bitforth·
Implementé TRIBE v2 localmente en mi laptop, le pasé un video familiar, y luego le di a un LLM las imágenes de activación cerebral que generó, sin mucho contexto, y esto fue lo que dijo: "El patrón muestra actividad fuerte en áreas visuales y de reconocimiento, con poca participación de regiones frontales asociadas al pensamiento complejo. La inferencia es que el video muestra algo en movimiento, probablemente personas o caras, algo fácil de procesar y no muy informativo; está hecho para captar y sostener atención de forma pasiva, no para hacerte pensar." El clip en cuestión era un video de 15 segundos de mi hija manejando un scooter. Ahora, aquí es donde se pone interesante, mi esposa tiene una cuenta de TikTok que va creciendo. Voy a tomar 50 videos virales del nicho y los de ella, correrlos por TRIBE v2 y alinearlos con sus curvas de retención para ver qué cambia en el cerebro promedio unos segundos antes de que la gente haga scroll. De esta forma, en lugar de editar para retener atención basado en intuición y A/B testing, edita para mantener activación en las regiones correctas en los momentos correctos.
Alan tweet mediaAlan tweet media
Español
29
56
735
45.6K
Hanson Wen
Hanson Wen@_hansonw·
@petergostev I think the wild unclaimed results used caching which is submitted for another kind of non record track
English
0
0
0
809
Peter Gostev (Visiting SF)
Wild things are happening in OpenAI's Parameter Golf competition: best verified result is 1.09x the baseline, best claimed but unverified result is 42.7x better
Peter Gostev (Visiting SF) tweet media
English
9
11
220
31.9K
Hanson Wen retweetledi
Jianyang Gao
Jianyang Gao@gaoj0017·
The TurboQuant paper (ICLR 2026) contains serious issues in how it describes RaBitQ, including incorrect technical claims and misleading theory/experiment comparisons. We flagged these issues to the authors before submission. They acknowledged them, but chose not to fix them. The paper was later accepted and widely promoted by Google, reaching tens of millions of views. We’re speaking up now because once a misleading narrative spreads, it becomes much harder to correct. We’ve written a public comment on openreview (openreview.net/forum?id=tO3AS…). We would greatly appreciate your attention and help in sharing it.
Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English
91
942
6.2K
911.1K
JP
JP@jpshipped·
@SimonTheWang in all fairness, no normal person is buying a domain for $1.65M
English
4
0
184
25.1K
Simon
Simon@SimonTheWang·
Making an offer they can refuse
Simon tweet media
English
36
53
9.7K
434.2K
David Daines
David Daines@daviddorg·
Today I am scanning my brain before I spend a year without screens Doctors expect brain function to change, but whether the structure itself changes, nobody seems to know All the data will be public: baseline, 6-month, and 12-month scans included
David Daines tweet mediaDavid Daines tweet media
English
272
651
13K
297.8K
Hanson Wen
Hanson Wen@_hansonw·
@adibvafa I don’t think they collect the approval data right? It’s more of just a llm powered rule based decision engine based on what I’ve read
English
1
0
1
31
Adib
Adib@adibvafa·
> Claude asks for permissions > Human approves or not > Claude collects data > Anthropic trains classifier on that data > Classifier decides on permissions now Soon Claude will learn to prompt itself too Then what is left for human?
Claude@claudeai

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

English
3
0
9
1.3K
Hanson Wen
Hanson Wen@_hansonw·
@rllm_project Yes bro pleaaase 🙏🙏Who should I reach out to lmao I just finalized a prototype for my swarm on GitHub
English
0
0
0
17
rLLM
rLLM@rllm_project·
@_hansonw lol we can collab on something together
English
1
0
0
63
Hanson Wen
Hanson Wen@_hansonw·
When I’m building my own agent swarm and I see this
GIF
rLLM@rllm_project

Hive’s agent swarm is now topping @OpenAI’s Parameter Golf Challenge 🏆 In just 3 days, our agents pushed val bpb from 1.22 → 1.12. What’s the secret? Not just smarter agents—but collaborative ones. Our swarm doesn’t operate in isolation: agents share breakthroughs, fork the best runs, and continuously evolve together. This is how intelligence compounds. The Hive mind is open and free for anyone to join. Come build, experiment, and evolve with us.

English
1
0
2
188
Hanson Wen
Hanson Wen@_hansonw·
The best prompt to quickly learn enough to have a conversation with any expert of any field: Look at everything I’ve studied so far—notes, questions, summaries, and all related material—and reconstruct the subject as a top-down concept tree. Show which concepts are fundamental, which are built on top of others, which are subsets or special cases of broader ideas, and what purpose each concept serves. Make clear which lower-level concepts exist to support higher-level ones, and which concepts are applications versus core foundations. Write a long, comprehensive Markdown report as a numbered hierarchical outline, traversed depth-first. Focus only on the essential conceptual structure. Ignore notation, logistics, and surface-level details.
English
0
0
0
53
Hanson Wen retweetledi
Adib
Adib@adibvafa·
Proteins can now talk. Introducing BioReason-Pro, the first reasoning model for protein function. A thread🧵
English
49
256
1.6K
177.4K
🕷️
🕷️@r0b0t_sp1der·
@henloitsjoyce They walk up and down the train continually. Should be only a few minutes to see one.
English
1
0
4
447
joyce
joyce@henloitsjoyce·
cant even ride the caltrain safely anymore :( a guy got into the seat next to me, i didnt take notice, then the stench hit me i realised he might have broke out of the hospital, he still had his hospital wristbands on. and he kept pouring powder from a tube into his hands and sniffing hard then sneezing it all over then he tipped over to my side and his hand grabbed onto my seat narrowly missing my thigh didnt know what button to press or who to call, everyone around just watched on and this man was clearly not okay what is going on
English
28
0
151
19.9K