spacy

16.2K posts

spacy banner
spacy

spacy

@dosco

LLM research, systems and compilers | ax + dspy in TS | agent engineering

Beigetreten Temmuz 2008
1.6K Folgt4.9K Follower
spacy
spacy@dosco·
bro why the hate i thought we was family
the tiny corp@__tinygrad__

@Bencera The only people who think AI is really good at things is people who aren't that good at things themselves.

English
0
0
0
51
spacy retweetet
Jack Altman
Jack Altman@jaltma·
There’s a lot of alpha in putting your ego aside by being willing to be cringe, willing to fail in public, willing to ask for what you want and face rejection, etc.
English
162
387
3.7K
293.4K
spacy retweetet
Omar Khattab
Omar Khattab@lateinteraction·
guess what NVIDIA used here for an "attention-based encoder-decoder to retrieve directly from its own internal representations"? late interaction is sparse attention
Omar Khattab tweet media
Sumit@_reachsumit

Retrieval from Within: An Intrinsic Capability of Attention-Based Models NVIDIA enables encoder-decoder models to perform retrieval directly through their own cross-attention mechanism, eliminating the need for a separate retriever. 📝 arxiv.org/abs/2605.05806

English
5
16
183
15K
spacy retweetet
Nathan Lambert
Nathan Lambert@natolambert·
Work led by @jacobcares showed that little compute for building an LLM is actually in the final runs. The vast majority of compute goes to developing a recipe. Creating the recipe openly is a huge lever in making sure the research community's compute pushes to new knowledge.
Nathan Lambert tweet media
Ai2@allen_ai

Today we’re bringing new NSF OMAI compute online with NVIDIA Blackwell Ultra-powered systems, turning a $152M national investment from @NSF & @NVIDIA into a foundation for truly open AI research. 🧵

English
5
16
108
16.2K
spacy
spacy@dosco·
for a conversational bot even sonnet is kinda dumb only at the opus level do you feel the magic
English
0
0
2
177
spacy
spacy@dosco·
till recently i was on an iphone 11 (no case) and still am on a m1 pro both functioning like new. apple hardware is unmatched.
English
0
0
0
91
Ara Ghougassian
Ara Ghougassian@araghougassian·
list of shit canada doesn't need - summits - endless debate - innovation centers - government intervention - government purchased ai data centers - tech conferences sponsored by boomer companies things we need - cracked founders maximizing shareholder value
English
20
8
127
4.8K
spacy
spacy@dosco·
@E_FutureFan compared to nature we're all just in amateur mode
English
0
0
0
26
Erika S
Erika S@E_FutureFan·
@dosco I'm wondering if my brain is just bunching inference requests to avoid rate limits. At 10,000x scale, energy stability becomes the key career constraint; some jurisdictions clearly grasp this better.
English
1
0
0
19
spacy
spacy@dosco·
one of the reason coding models can do so much is also because we're at a point where our infra is solid, great scalable cloud platforms, mature data systems and stable libraries for almost anything. as zuck put it "move fast on stable infra"
English
1
0
3
163
spacy
spacy@dosco·
@pmddomingos and humans disaggregate it again into slop
English
0
0
1
69
Pedro Domingos
Pedro Domingos@pmddomingos·
The Internet disaggregated information. AI reaggregates it.
English
21
9
94
4.6K
spacy
spacy@dosco·
i would not mind a fully loaded m5 ultra max studio whenever that drops
English
0
0
1
124