Deric Dinu Daniel

225 posts

Deric Dinu Daniel banner
Deric Dinu Daniel

Deric Dinu Daniel

@dericdinudaniel

electronic musician + silicon swe @apple. @umich ‘25

Michigan Katılım Mart 2018
385 Takip Edilen145 Takipçiler
Deric Dinu Daniel
Deric Dinu Daniel@dericdinudaniel·
@mihir__arya dropping a sem to work at @Bose was easily one of the best decisions i ever made in college. glad to see more ppl follow through with it
English
1
0
1
11
Deric Dinu Daniel retweetledi
Zedd
Zedd@Zedd·
@b3kholing I apologize for having so many hits
English
166
696
14.8K
428.9K
Deric Dinu Daniel
Deric Dinu Daniel@dericdinudaniel·
there are about 2000 unique versions of instagram because meta decides to molest every user with feature flags from hell. all 3 of my insta accounts have uniquely different UIs. half the features don't work. fuck you @Meta
English
0
0
2
79
Riley Brown
Riley Brown@rileybrown·
Naval was right on all three counts.
Riley Brown tweet mediaRiley Brown tweet mediaRiley Brown tweet media
English
32
37
621
96.2K
Ayush S
Ayush S@ayushswrites·
We just launched the Warp mobile app Our most requested feature ever. Thank you to everyone who's been texting us about this.
English
33
12
280
1.2M
Deric Dinu Daniel
Deric Dinu Daniel@dericdinudaniel·
Had to use a Wind*ws 🤢 computer today and already got blessed with peak MicroSlop upon logging in. @Microsoft genuinely f u, pls fire every pm in the windows org
Deric Dinu Daniel tweet media
English
0
0
2
140
Alexander Whedon
Alexander Whedon@alex_whedon·
@JasOberoiTweets The motivation to compare against Flash Attention was actually to set the bar higher. Sparse attention reduces FLOPs, but the actual speedup is typically lower and sometimes negative. We wanted to show that the speedup was not only theoretical (FLOPs) but implemented.
English
2
0
7
3K
Alexander Whedon
Alexander Whedon@alex_whedon·
Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.
English
1.5K
2.9K
23K
12.8M
Alexander Whedon
Alexander Whedon@alex_whedon·
@daniel_mac8 Great question. We have quite a bit more in the pipeline that I would love to talk about when the time is right. In the short-term we expect to fit in with the others. We have a vision for the longer term that is quite divergent from the space as it is today.
English
1
0
8
959
Dan McAteer
Dan McAteer@daniel_mac8·
SubQ is either the biggest breakthrough since the Transformer... > 52x faster than FlashAttention at 1mm tok context > 20x cheaper than Opus ...or it's AI Theranos. Requested early access so hopefully can investigate soon.
Dan McAteer tweet media
Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English
53
50
941
140.1K
Deric Dinu Daniel
Deric Dinu Daniel@dericdinudaniel·
just hit $10+ in my brokerage account at 22 years old
Deric Dinu Daniel tweet media
English
1
0
4
192
Deric Dinu Daniel retweetledi
Jonathan Cloud
Jonathan Cloud@JohnCloudThe1st·
Fuck you if you use Suno.
English
21
116
891
23.3K
Deric Dinu Daniel retweetledi
Dev Kunjadia 🇺🇸
Dev Kunjadia 🇺🇸@dev_kunjadia·
Introducing LoshiAttention — a major breakthrough in LLM intelligence. We built the first model powered by O(1) transformer attention.* Traditional transformers waste enormous amounts of compute evaluating every possible token relationship. In practice, almost none of those interactions matter. LoshiAttention ignores the noise. Instead of processing everything, the model just picks whichever token looks the coolest and locks in. The result: Infinite theoretical scalability > 12 million token context window > 52x faster than FlashAttention at 1M tokens > Costs basically nothing to run > Runs on a toaster >Vibes-based inference pipeline This is the future of AI architecture.
Dev Kunjadia 🇺🇸 tweet media
English
8
1
17
470