Pres

1.9K posts

Pres

@spnichol

Machine learning/NLP @finance industry. Cats.

DF/Austin Katılım Şubat 2011

3.7K Takip Edilen394 Takipçiler

Pres retweetledi

Aaron Bergman 🔍@AaronBergman18·18 May

Nobody: Somebody somewhere occasionally: uses the term "smoke test" very occasionally Claude Opus 4.7 every other sentence: let's run a smoke test

English

2.1K

100.9K

Pres retweetledi

clem 🤗@ClementDelangue·13 May

As President Trump meets President Xi this week, a call to the American AI community: If your startup, lab, non-profit or company benefits from open international AI - especially Chinese (Deepseek, Qwen, Kimi, GLM,…), please share! Open source is the most important driver of competition, jobs and wealth creation in AI today. Let’s support and promote it at critical times like this week!

English

543

78.7K

Pres@spnichol·12 May

@rogerw0108 I ♥️ vLLM

English

Roger Wang@rogerw0108·12 May

There's a misconception floating around that vLLM is reliable and versatile but doesn't push the limits on the latest hardware or low-latency use cases. Well, numbers speak louder than words :)

vLLM@vllm_project

vLLM tops the Artificial Analysis leaderboard 🎉 vLLM tops @ArtificialAnlys on DeepSeek V3.2 and ranks among the top deployments of MiniMax-M2.5 and Qwen 3.5 397B. The leading deployments of these models are now open source. How each result was built: 🔹 DeepSeek V3.2 — Aggressive op fusion across the attention path collapsed ~33 per-layer kernels down toward ~10. 🔹 MiniMax-M2.5 — Custom EAGLE3 draft trained against the target's own token distribution via TorchSpec, plus a custom QK-norm fusion for MiniMax's TP-aware attention. 🔹 Qwen 3.5 397B — Targeted fusions plus a QK-norm fix for Qwen's linear-attention path. Every optimization is in vLLM main or on its way upstream. Huge thank you to @inferact, @digitalocean, @nvidia, @RedHat_AI, and the vLLM community 🙏 Full breakdown 👇 vllm.ai/blog/vllm-tops…

English

2.2K

Pres@spnichol·2 May

@sakurayukiai Would love to hear more!

English

462

Sakura Yuki@sakurayukiai·2 May

I did not expect 1-bit ternary kernels to be this clean. Since the weights are literally just -1, 0, or 1, it completely bypasses matrix multiplication for basic addition. We're running 70B models in 9GB of VRAM and the math is stupidly elegant ✨

English

5.5K

Pres@spnichol·12 Nis

@ecom_joseph There are definitely bad days. Dealing with a totally different level of competence.

English

195

Joseph Siegel@ecom_joseph·11 Nis

Yeah Claude Opus 4.6 is nuked RIP

English

152

2.2K

757K

Pres@spnichol·16 Nis

Does anyone have a @getairchat invite?

English

Pres retweetledi

François Chollet@fchollet·7 Oca

"Humans are conscious; this big curve fitted on tons of human-generated outputs can reproduce human-like behavior in some cases; therefore this big curve is conscious" has got to be some of the most mindless, most hubristic reasoning I've ever seen.

English

126

754

96.4K

Pres retweetledi

Chris Albon@chrisalbon·21 Eyl

This is still the best AI ever generated Salmon jumping in a river

English

215

4.6K

55.6K

3.2M

Pres@spnichol·29 May

@chrisalbon ooo What platform is this?

English

145

Pres retweetledi

DeepAI@DeepAI·29 Nis

🔥Lowkey Goated When Clustering is The Vibe 🔥 Check out this new Perception-Based Clustering Model for Scattered Data by @Pedro_He_Ca and others! #DataScience deepai.org/publication/cl…

English

1.5K

Pres@spnichol·6 Nis

@random_walker Yeah you're correct, it may be because the machine-readable abstract from that website doesn't mention Louisiana, so certainly a win for GPT.

English

340

Arvind Narayanan@random_walker·6 Nis

@spnichol Ha! I actually tried Google first, and used something similar to the query I gave ChatGPT. But Google didn't parse the "one of the Southern states" phrase correctly and gave me papers that had the phrase "Southern states" in the title.

English

9.3K

Arvind Narayanan@random_walker·6 Nis

Like everything about ChatGPT, the fake citation issue is complicated. Yes, it often makes them up. But it has memorized the details of millions of real papers, so it's an excellent tool — better than search — for finding papers you've encountered long ago and vaguely remember.

English

446

234.4K

Pres@spnichol·6 Nis

@random_walker To be fair, this is the first search result when searching for "paper by economists showing judges were biased based off emotional state" on Google. However, having the openAI summary is nice.

English

214

Arvind Narayanan@random_walker·6 Nis

In case you're wondering, the citation and description that ChatGPT gave are correct. aeaweb.org/articles?id=10…

English

7.2K

Pres retweetledi

Mike Solana@micsolana·15 Mar

one interesting thing about financial crises is the way they prove whatever political beliefs you already have

English

304

47.6K

Pres retweetledi

Dr. Jonathan N. Stea@jonathanstea·23 Ara

Whether the topic is: - Climate change - GMOs - Nuclear power - Vaccines - Homeopathy - Astronomy - Evolution - or Covid-19... ...People who disagree most with the scientific consensus know less about the topic, but they think they know more. science.org/doi/10.1126/sc…

English

116

533

1.7K

1.1M

Pres retweetledi

Retro Tech Dreams@RetroTechDreams·25 Ara

80s moms watching you open your Christmas presents

English

226

2.6K

125.7K

Pres retweetledi

maple cocaine@maplecocaine·5 Şub

Olive oil, sea salt, red pepper flakes... Oregon has (unwisely) decriminalized bein Italian

New York Post@nypost

Oregon law decriminalizing all street drugs goes into effect trib.al/VYT2GaF

English

278

9.5K

83.1K

Pres@spnichol·23 Ara

@Adriaac

GIF

QME

Pres@spnichol·1 Ara

@AlmostMedia Parece que la función "traducir este Tweet" está rota. ¿Alguien más que pueda corroborar?

Español

Julie Fredrickson@AlmostMedia·1 Ara

It appears that the “translate this Tweet” function is broken. Anyone else able to corroborate?

English

Pres retweetledi

Natalie Wolchover@nattyover·30 Kas

Physicists have used Google's quantum computer to send a signal through a wormhole, a shortcut in space-time first theorized by Einstein and Rosen in 1935. The landmark experiment was published today in Nature. Lots to say about it. Here's my deep dive: quantamagazine.org/physicists-cre…

English

148

1.3K

6.1K

Pres retweetledi

Jean de Dieu Nyandwi@Jeande_d·25 Kas

TorchScale - A Library for Transformers at (Any) Scale TorchScale is a library that allows AI researchers and developers to design large-scale transformer models and train them efficiently. It has nearly all the building blocks of transformer models. github.com/microsoft/torc…

English

277

Keşfet

@rogerw0108 @sakurayukiai @ecom_joseph @getairchat @chrisalbon @Pedro_He_Ca @random_walker @elonmusk