Nick Levine

686 posts

Nick Levine banner
Nick Levine

Nick Levine

@status_effects

training vintage language models

Katılım Kasım 2024
1.2K Takip Edilen2.7K Takipçiler
Sabitlenmiş Tweet
Nick Levine
Nick Levine@status_effects·
New work with @AlecRad and @DavidDuvenaud: Have you ever dreamed of talking to someone from the past? Introducing talkie, a 13B model trained only on pre-1931 text. Vintage models should help us to understand how LMs generalize (e.g., can we teach talkie to code?). Thread:
English
171
367
2.9K
1M
Nick Levine
Nick Levine@status_effects·
on a research cluster I use you can clearly see utilization spike ahead of conference paper submission deadlines. If a futures market takes off get ready for conference cycle seasonality quant systems (although probably a subtle pattern / drop in the bucket in the grand scheme of things)
English
0
1
6
6.8K
Joe Weisenthal
Joe Weisenthal@TheStalwart·
What is the best argument for the "compute" market will evolve in such a way that there will be an actively traded market for capacity, rather than just long-term bilateral contracts between inference providers and GPU owners? (Assuming those remain distinct categories)
English
36
2
68
16.4K
evan conrad
evan conrad@evanjconrad·
oh yeah, we removed them because people kept getting confused our order book supports pricing all of these differently: - 100 nodes colocated for 3months starting in 1 week - 100 nodes non-colocated for 3 months starting in 1 week - 100 nodes starting whenever, for 3 months min - 100 nodes starting RIGHT NOW, for 3 months - 100 nodes starting RIGHT NOW, for 1 year sometimes it was flat in early SFC because a seller isn't being non-optimal and the book wasn't liquid enough for someone to take advantage of the arb
English
1
0
3
450
evan conrad
evan conrad@evanjconrad·
we build gpu clusters sometimes we take over other people's clusters like a property manager (just like other clouds do) we let you buy large, long-term contracts that let you sublease to let you sublease, we built an order book we built our cloud around the order book we are not a brokerage, we are not a gpu reseller
evan conrad@evanjconrad

we've done such an objectively terrible job at explaining ourselves at sfcompute, this is the year we fix that

English
11
3
156
18.9K
Nick Levine
Nick Levine@status_effects·
turbopuffer tagline should be 'puff around and find out' @turbopuffer
English
3
0
13
1.6K
Cheng Lou
Cheng Lou@_chenglou·
We’ll look back at Talkie as one of the newer pieces of AI-native art, alongside HP Belanciaga, golden gate Claude, spiral, pit and the rest
English
2
0
22
3.3K
GwynTel™
GwynTel™@gwyntel·
@xlr8harder If we can make a talkie user group, I'd love to join. I have about 15k rows of synthetic data generated from TalkieLM.
English
2
0
6
35
Nick Levine retweetledi
xlr8harder
xlr8harder@xlr8harder·
I wanted to play with the Talkie 1930 models, but they weren't packaged in a convenient transformers format, so I had codex convert them. They can also now be used with vllm transformers backend. Here they are, in case it's useful to anyone else: huggingface.co/collections/xl…
English
3
6
45
3.1K
Joe Weisenthal
Joe Weisenthal@TheStalwart·
Joe Weisenthal tweet media
The Book Brain@TheBookBrain_

@TheStalwart I went to a concert where songs played from speakers prior to the main act coming on stage. They played What's Up by 4 Non Blondes and the entire place sang along at the top of their lungs. It was the best moment of the concert better than the main act.

ZXX
4
4
95
26.5K
👶🏻🙌🏻
👶🏻🙌🏻@arthurpostingg·
@status_effects @MattZeitlin words typed by a man who has never had the displeasure of taking an avanti west coast service (39.9% percent “on time” between 2024-2025)
English
1
0
1
89
Matthew Zeitlin
Matthew Zeitlin@MattZeitlin·
Was spending some time on Google Maps and all the non London cities you’ve heard of in England are really close to each other by US standards
English
21
0
487
56.6K
Nick Levine
Nick Levine@status_effects·
talkie is a goblin skeptic.
Nick Levine tweet media
English
4
0
51
2.3K
Nick Levine
Nick Levine@status_effects·
This one had me on the edge of my seat
Nick Levine tweet mediaNick Levine tweet media
English
0
0
29
3.1K
Nick Levine
Nick Levine@status_effects·
@AndyMasley biang biang noodles at xian impression. falafel sandwich at mr falafel if in shepherds bush for some reason
English
0
0
2
308
Andy Masley
Andy Masley@AndyMasley·
Would appreciate recs for things to do in London, especially where the good vegan food is
English
49
1
59
19.3K
Egg Syntax
Egg Syntax@eggsyntax·
@status_effects @AlecRad @DavidDuvenaud (I say thanks especially because I can imagine just how much tedious work must have been involved in this particular project, and you conscientiously slogging through that for months is a real gift to the research community 🙏)
English
1
0
1
67
Nick Levine
Nick Levine@status_effects·
New work with @AlecRad and @DavidDuvenaud: Have you ever dreamed of talking to someone from the past? Introducing talkie, a 13B model trained only on pre-1931 text. Vintage models should help us to understand how LMs generalize (e.g., can we teach talkie to code?). Thread:
English
171
367
2.9K
1M
Tibo
Tibo@thsottiaux·
@gao_zibo All of this and more is coming
English
214
83
3.3K
492K
Zibo Gao
Zibo Gao@gao_zibo·
codex mac app is winning SO HARD. just need: - native editor - iOS app - full browser - openclaw then it might be the home default app
English
44
23
1.6K
126.8K
BO
BO@bo_austin_·
I love the idea of framing ‘the public learning about historical events not previously discussed’ as ‘effectively invented, in the public imagination’. it’s honestly so fascinating. it’s sort of a new type of being stupid.
AnechoicMedia@AnechoicMedia_

The "Tulsa Race Massacre" was effectively invented, in the public imagination, in October 2019, when the show Watchmen premiered with a fictionalized and exaggerated depiction of innocent blacks being bombed by whites. The term was practically nonexistent before then.

English
7
106
1.3K
22.2K