brandon wang

342 posts

brandon wang

@fluorane

... | prev undergrad @miteecs and @mitbiology, @cartesia @janestreetgroup

california Katılım Nisan 2021

297 Takip Edilen1.1K Takipçiler

Sabitlenmiş Tweet

brandon wang@fluorane·11 Tem

happy to announce that we've gotten rid of tokenizers! especially excited with what we've replaced them with: end-to-end trainable modules that not only learn to group characters into (sub)words, but can iterate to group words into phrases and further higher-order concepts see @sukjun_hwang's thread for more details 👇

Sukjun (June) Hwang@sukjun_hwang

Tokenization has been the final barrier to truly end-to-end language models. We developed the H-Net: a hierarchical network that replaces tokenization with a dynamic chunking process directly inside the model, automatically discovering and operating over meaningful units of data

English

757

74.7K

brandon wang@fluorane·15h

@0xpiapark @pierrecomputer congrats pia! very hype, good luck :D

English

124

pia@0xpiapark·1d

update: excited to share that im joining @pierrecomputer to build code[dot]storage yay

English

232

15.5K

brandon wang@fluorane·18h

@serenaa_ge very cool benchmark, congrats! failure modes analysis is really interesting

English

268

Serena Ge (Datacurve)@serenaa_ge·21h

Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.

English

378

552

4.5K

1.1M

brandon wang@fluorane·18h

@frank_liquid congrats! very smooth experience

English

129

Franklyn Wang@frank_liquid·23h

It’s clear that the future of finance will involve more and more AI. Co-Invest is our first major step in exploring what that future might look like, and we’re incredibly proud of it. More soon. But using Co-Invest has genuinely felt like seeing the future — and I hope it gives you the same feeling.

Liquid@liquidtrading

Introducing Co-Invest. (@coinvestai) The first way to trade directly through ChatGPT and Claude.

English

122

20K

brandon wang@fluorane·1d

@Alan_Ma_ huawei duv is cooking indeed

English

Alan Ma@Alan_Ma_·1d

@fluorane chef's special

English

1.3K

brandon wang@fluorane·1d

the dim sum restaurant im at is showing a semiconductor lithography report from a hong kong tv channel, amazing

English

1.3K

30.4K

brandon wang@fluorane·1d

@JingyiRen @astar_gis congrats!! super exciting

English

315

Rena Ren@JingyiRen·1d

It's official! I will be joining @astar_gis as new PI this year. Looking forward to embarking my independent journey in Singapore! My lab will be at the intersection of genomics and RNA technology. We will be hiring at all levels. Email if you are interested! Check below⬇️ #newPI

English

227

20.5K

brandon wang@fluorane·2d

@johnloeber 7 has always been very interesting to me ever since i read about it here, pretty concerning what the implications for california are x.com/WillManidis/st…

Will Manidis@WillManidis

x.com/i/article/2008…

English

476

John Loeber 🎢@johnloeber·2d

Too many thoughts queued up. Here's 28 of them 1. Having now tweeted in this format many times, the most important thing about it is for the first item to have a great, provocative hook 2. From Zero to One is a distinctly religious saying: from nothing there came to be something. I wonder if Peter Thiel intended this. Feels like it. 3. Sometimes entrepreneurship is just about staying in the market long enough to wait for your competitors to give up. You don't even have to be better than them. 4. Everything that people love about cute European towns would be illegal to build today, both in Europe and in the US. Every cobblestone alley violates a gazillion regulations. Curious. 5. Virtually nothing of value on Twitter is posted in under 140 characters anymore. Feels crazy to look back on that product: so constrained, so much worse. 6. I've looked through a couple hundred of my ChatGPT/Claude queries: almost none of these are monetizable. It's all obscure questions about programming and the ancient greeks. I'm sure ads will do well in this medium for some demographics, but I really wonder about the details. 7. There's an underreported story about the slow decline of Massachusetts. Once upon a time, they were the big contender to Silicon Valley. Best educational attainment, highest incomes, life expectancy, and so forth. Not anymore. 8. It's interesting that the calls to "tax the rich" are just as strong in poor places as in rich ones. Redistributive norms are even strong in the poorest places. It seems exceedingly rare for people to say "actually we have no money, we need to pool and compound the little we have." 9. You can tell when a growth-stage company slowly loses steam. The all-in-person C-suite goes remote (never works), they open offices of B-players, the social media posts become more corporate and get less engagement... you can feel everyone's motivation slipping in real time. 10. Everyone loves to hate on the TSA, but it is noteworthy that pre-9/11, there were dozens of plane hijackings a year, and now basically zero. 11. There's a Bay Area town called "Vacaville" which I suppose means "Cow Town" to speakers of Spanish 101, but it was actually named after a Mr. Vaca. 12. Geoguessr, but for Mars. 13. I've been renovating my house, which has shown me a broad category of goods that are simply not available on Amazon, despite having big market sizes. Specific types of cables, interior tubes, drywall, pipes, etc.? Need to go to specialist websites that look like they're from 1995. I sense a market gap. 14. It is weird that basically all keyboards have the same key sizes. There are none with small keys for children, for example. Or big keys for adults with large hands. I want to buy a keypad with larger keys that i can hit more easily while I'm working out, but no such thing exists. 15. Culture is now evolving with incredible speed. It used to be that a particularly good joke on Seinfeld or SNL would have a decade-plus staying power. You could reference it with a wink and everyone would know what you mean, for years. Now these memes have ~zero half-life, they hit and then they're immediately gone. 16. Apple Notes gives you folders, but you can't search or filter by folder. Instead you can search by hashtag? It is all so tiresome. 17. The level of fraud and villainy during this AI cycle may now be greater than what we witnessed in crypto in 2021. 18. I think that there's a tremendous danger that people do good work for a few decades to accumulate authority in a field, and then become feeble-minded and out of touch, and use theirs still-overhanging authority to (perhaps unwittingly) do enormous harm. I am thinking of Nicholas Kristof of the NYT. 19. It's underrated how amazing media is in great supply now. Back in the 00s, you had to pore through specialized coffee table books to find amazing landscape photography. Or you'd go online and maybe every now and again you'd see a funny photo and you'd save it. Millions of people would tune in to watch "America's funniest home videos" on television. All of these are totally abundant now. You can find the prettiest photos, the funniest memes, etc. trivially now. The supply is endless when once it used to be scarce. Nobody talks about this. 20. It's ironic how Japanese furniture and fashion design is so minimalist and then the websites are so maximalist 21. I saw a billboard in Chicago for a construction company that said "built by humans who use AI". Interesting contrast to SF. 22. The main challenge of travel is never the length of the first flight. Twelve hours direct? Not bad at all. It's the layover and the second flight where people drop off. 23. Many parts of life expose small games that you can play. There's a game to ordering at restaurants. There's a game to negotiating with your contractors. Some people avoid these games. But often they're fun to play, like a little reward unto themselves. 24. “Experts” spend a long time acquiring status. Once that status has been acquired, they can use it to launder their opinions or preferences as legitimate or substantiated by expertise, which may not be the case at all. 25. I feel a great dearth of objects. When our lives were more in the physical world and less in the digital world, there used to be so many things that you could buy. Now it feels like everyone’s trying to sell me the same few things: phone charging cases, thin wallets, robot vacuum cleaners, etc. This is all very boring. I am surprised by how few things there are that I want to own. 26. An interesting thing about stress is that you can control your psychological response by lowering your physiological response. Getting annoyed? Just think about bringing down your pulse and breathing. 27. My timeline is littered with wannabe erudites trying to ape Will Manidis’ writing style. They’re writing LinkedIn Slop that starts with an AI generated anecdote about Mary Magdalene and I am not here for it 28. People should have to take "modernity appreciation class" in which you spend a week washing your clothes by hand, tilling a field with a mule, or even just eating tinned food typical of the 70s. Many people romanticize the past (frequently conjoined with some kind of anti-tech, pro-communist outlook) from the comfort of their homes and would be well-served by a short course of “what it’s actually like”

English

4.7K

brandon wang@fluorane·2d

@_jamorton

GIF

QME

Jon Morton@_jamorton·2d

@fluorane yes

brandon wang@fluorane·4d

good model

Artificial Analysis@ArtificialAnlys

Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵

English

1.6K

brandon wang retweetledi

Arnav Shah@arnavshah0·3d

Oh actually my bad it's an oral

Arnav Shah@arnavshah0

Excited to announce that dnaHNet has been accepted as an ICML 2026 Spotlight paper! Very grateful to my coauthors @victor_ljz and team, plus our remarkable supervisors @_albertgu and @genophoria.

English

5.2K

brandon wang@fluorane·3d

@_jamorton can you make it better plz

English

Jon Morton@_jamorton·3d

@fluorane it’s ok

English

brandon wang retweetledi

Raj Mathai@rajmathai·5d

Bravo, Elim! @itselimchan named Music Director of @SFSymphony. 39 year-old Hong Kong born conductor becomes among very few woman and Asian Americans to lead a major symphony. Came to #USA to become a doctor…and then fell in love w/ music @UMich. @nbcbayarea

English

7.5K

brandon wang@fluorane·5d

@kothasuhas it seems plausible that you should relax filters as you get more compute, but it can't be categorically true that you should never filter? though this paper probably implies some lower bound about the distribution of quality of CC

English

348

Suhas Kotha@kothasuhas·5d

really really cool work. TLDR: it probably does not make sense to filter _any_ data in the infinite compute regime x.com/tatsu_hashimot…

Tatsunori Hashimoto@tatsu_hashimoto

Some new results I found surprising that I’m tweeting for Chris (who isnt on here). With enough compute, the best data filter for LMs (on DCLM) might be no filter. Why? Large models can tolerate a surprising amount of nominally 'low quality' data, and can sometimes even benefit.

English

110

23.3K

brandon wang@fluorane·6d

@hypersoren i generally agree but ai labs are frankly nowhere near as competitive as finance, which will affect the medium term outcome

English

237

Soren Larson@hypersoren·6d

genuinely id like the sf data for ai class to study the history of the alt data market of finance and suggest why market structure or long term outcome will be any different

English

3.4K

brandon wang@fluorane·19 May

genuinely how do you serve this model at such high speeds on literally every search request

Google@Google

Available today, Gemini 3.5 Flash is the new default model in AI Mode in Search for everyone globally. #GoogleIO

English

378

brandon wang@fluorane·17 May

@sheriyuo don't the proxies also sell traces to the chinese labs?

English

451

Xiuyu Li@sheriyuo·16 May

These proxy sellers and black-market routers are super prone to suddenly collapsing or straight-up running away with the money. That said, Xianyu/Taobao actually give you solid buyer protection — you can even do a 7-day refund-only return. As a student who doesn’t work at a company handing out free tokens and has zero reimbursement path, if someone offered you a router giving Claude quota at just 0.6 CNY per $1 claude worth… would you actually refuse?

aditya@adxtyahq

Chinese students are buying GPT-5.4/5.5 and Claude API access from Xianyu/Taobao proxy sellers for almost 96-97% cheaper People are apparently burning 100M+ tokens a day for like $1 and vibecoding nonstop.

English

7.7K

brandon wang@fluorane·15 May

@bilaltwovec tbh i have no idea what the design space for this would look like, but it seems like it could be really large and also really targetted

English

216

bilal@bilaltwovec·15 May

@fluorane ill probably be proven wrong tho when the next PYTORCH_POWERPLANT_BLOWUP=1 in the vr200 microcode very subtly reduces the bitwidth of your fp4 accumulator if it detects that you're in singapore

English

brandon wang@fluorane·15 May

the malware specialized to target exclusively large llm training runs is gonna be insane holy shit

hanlon’s mortola razr@rhizomaticthot

the fast16 malware was almost certainly targeting spherical implosion simulations. left: unmodified LS-DYNA 970 right: LS-DYNA 970 modified with the relevant portions of fast16.sys both running a spherical implosion deck

English

118

3.6K

158.2K

brandon wang@fluorane·13 May

@finn_hulse sir please

English

1.1K

Finn Hulse@finn_hulse·13 May

if u have an academic-style or otherwise minimalist white-background personal website but u have <4 published papers, you’re larping at an elite level and i hope a larp swat team obliterates you at this point it is the equivalent of a blue-purple gradient vibecoded saas homepage

English

206

17.1K

brandon wang@fluorane·13 May

@finn_hulse oh nice i also missed a1 that year, good times

English

208

Finn Hulse@finn_hulse·13 May

look at these fuckin scores though dawg im regarded 😭😹 missed top 500

English

3.5K

Finn Hulse@finn_hulse·13 May

the only putnam problem i've ever sniped was this A4 on an otherwise dismal exam it was the culmination of everything i've ever loved doing

English

195

26.8K

Keşfet

@0xpiapark @pierrecomputer @serenaa_ge @frank_liquid @Alan_Ma_ @JingyiRen @astar_gis @johnloeber