spacy

16.2K posts

spacy banner
spacy

spacy

@dosco

LLM research, systems and compilers | ax + dspy in TS | agent engineering

เข้าร่วม Temmuz 2008
1.6K กำลังติดตาม4.9K ผู้ติดตาม
spacy รีทวีตแล้ว
Omar Khattab
Omar Khattab@lateinteraction·
guess what NVIDIA used here for an "attention-based encoder-decoder to retrieve directly from its own internal representations"? late interaction is sparse attention
Omar Khattab tweet media
Sumit@_reachsumit

Retrieval from Within: An Intrinsic Capability of Attention-Based Models NVIDIA enables encoder-decoder models to perform retrieval directly through their own cross-attention mechanism, eliminating the need for a separate retriever. 📝 arxiv.org/abs/2605.05806

English
3
13
121
8.4K
spacy รีทวีตแล้ว
Nathan Lambert
Nathan Lambert@natolambert·
Work led by @jacobcares showed that little compute for building an LLM is actually in the final runs. The vast majority of compute goes to developing a recipe. Creating the recipe openly is a huge lever in making sure the research community's compute pushes to new knowledge.
Nathan Lambert tweet media
Ai2@allen_ai

Today we’re bringing new NSF OMAI compute online with NVIDIA Blackwell Ultra-powered systems, turning a $152M national investment from @NSF & @NVIDIA into a foundation for truly open AI research. 🧵

English
4
12
91
12.9K
spacy
spacy@dosco·
for a conversational bot even sonnet is kinda dumb only at the opus level do you feel the magic
English
0
0
2
154
spacy
spacy@dosco·
till recently i was on an iphone 11 (no case) and still am on a m1 pro both functioning like new. apple hardware is unmatched.
English
0
0
0
84
Ara Ghougassian
Ara Ghougassian@araghougassian·
list of shit canada doesn't need - summits - endless debate - innovation centers - government intervention - government purchased ai data centers - tech conferences sponsored by boomer companies things we need - cracked founders maximizing shareholder value
English
19
7
115
4.3K
spacy
spacy@dosco·
@E_FutureFan compared to nature we're all just in amateur mode
English
0
0
0
21
Erika S
Erika S@E_FutureFan·
@dosco I'm wondering if my brain is just bunching inference requests to avoid rate limits. At 10,000x scale, energy stability becomes the key career constraint; some jurisdictions clearly grasp this better.
English
1
0
0
16
spacy
spacy@dosco·
one of the reason coding models can do so much is also because we're at a point where our infra is solid, great scalable cloud platforms, mature data systems and stable libraries for almost anything. as zuck put it "move fast on stable infra"
English
1
0
3
158
spacy
spacy@dosco·
@pmddomingos and humans disaggregate it again into slop
English
0
0
0
60
Pedro Domingos
Pedro Domingos@pmddomingos·
The Internet disaggregated information. AI reaggregates it.
English
20
9
93
4.6K
spacy
spacy@dosco·
i would not mind a fully loaded m5 ultra max studio whenever that drops
English
0
0
1
123
spacy
spacy@dosco·
tachyon was for a research paper that LLMs can vibe code complex infra and be correct and fast it. a side effect of this was that it was faster than nginx and pingora (c and rust) while being written in golang. also its 100% self contained (zero dependencies) and supports http 1 and 2. benchmarks are complex but heres some of the ones we did. github.com/dosco/tachyon/…
spacy tweet media
English
0
0
4
867
Caddy Web Server
Caddy Web Server@caddyserver·
Web servers written in Go are immune to this entire class of vulnerabilities.
Cyber Security News@The_Cyber_News

⚠️ Critical Apache HTTP Server Flaw Exposes Millions of Servers to RCE Attacks Source: cybersecuritynews.com/apache-http-se… The Apache Software Foundation has released a critical security update for Apache HTTP Server, patching five vulnerabilities, including a dangerous double-free flaw capable of enabling Remote Code Execution (RCE) in version 2.4.67, released on May 4, 2026. All users running version 2.4.66 or earlier are strongly urged to upgrade immediately. The most severe of the five vulnerabilities is CVE-2026-23918, rated High with a CVSS base score of 8.8. The flaw is a double-free memory corruption bug triggered within Apache's HTTP/2 protocol implementation during an "early stream reset" sequence. #cybersecuritynews #vulnerability

English
7
13
99
22.1K