danfiru

2K posts

danfiru banner
danfiru

danfiru

@danfiru

Cofounder & Product @ Quadric. More with More.

Burlingame, CA Katılım Mart 2009
306 Takip Edilen327 Takipçiler
Sabitlenmiş Tweet
danfiru
danfiru@danfiru·
an engineer ported DOOM over the weekend. 36fps on a 50MHz fpga emulation. your NPU runs neural networks. ours runs programs. full article in the comments
English
2
1
53
29K
danfiru
danfiru@danfiru·
@_TommyMason i'll take the leaf and a rack of npus for tokens at home.
English
0
0
0
988
Tommy
Tommy@_TommyMason·
Left, Ferrari Luce $645k Right, Nissan Leaf $35k
Tommy tweet mediaTommy tweet media
English
964
2.8K
29.7K
3.7M
danfiru
danfiru@danfiru·
@brfootball Dean Huijsen is the only snub, tbh. Madrid have always been a more international club than Barcelona.
English
0
0
0
263
B/R Football
B/R Football@brfootball·
Not a single Real Madrid player has made Spain’s World Cup squad 🇪🇸
B/R Football tweet media
English
1.7K
10.4K
98.7K
4.8M
danfiru retweetledi
Fahd Mirza
Fahd Mirza@fahdmirza·
🦙 llama.cpp now has a BUILT-IN model router ♠ and it completely replaces Ollama + Open WebUI for model switching 🔹 One server, one config file, any model on disk 🔹 Switch models instantly without restarting anything 🔹 Zero duplicate model storage across backends 🔹 Full per-model control via a simple INI file 🔹 Native llama.cpp performance, no abstraction layer 🔥 Watch the full video below 👇 youtu.be/V2t_YRsyqeI
YouTube video
YouTube
English
21
66
547
46.4K
danfiru retweetledi
Jake Fitzgerald
Jake Fitzgerald@earthtojake·
new release for text-to-cad, an open source CAD harness and skills for codex / claude: - mechanism validation (go from text prompt to functional mechanical design) - parameters + animations for step files - extended sdf, srdf, urdf support 3k stars, 10k downloads, we cooking
English
51
251
2.7K
244.5K
danfiru
danfiru@danfiru·
Congrats to Arsenal.
English
0
0
0
135
danfiru
danfiru@danfiru·
Anthropic targeting third party harnesses is annoying.
English
0
0
0
20
Elon Musk
Elon Musk@elonmusk·
@beffjezos Our recently completed Grok V9 1.5T run is looking great and that is before Cursor data is added in supplemental training
English
377
285
4.4K
598.3K
Beff (e/acc)
Beff (e/acc)@beffjezos·
Impressions so far: Grok Build interface is actually really nice. Now as soon as xAI has a SOTA model, it could very well become competitive with Codex / Claude Code overnight
English
69
69
1.6K
229K
danfiru retweetledi
Bindu Reddy
Bindu Reddy@bindureddy·
Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques... Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries. Google's distillation + sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.
English
159
186
3.7K
919.7K
danfiru retweetledi
David Hendrickson
David Hendrickson@TeksEdge·
💥 What is this beast? Skymizer HTX301 is LLM 🛸👽👇 One PCIe card w/ 📦 384 GB memory ⚡ 240W TDP 🧠 Runs 700B LLMs locally Vs NVIDIA RTX 6000 Ada: 48 GB • 300W • ~$7,500 Vs RTX PRO 6000 Blackwell: 96 GB • 600W • ~$8,500 HTX301 delivers 8× memory at less than half the power and specialized LPU inference beast for on-prem AI. 🔥 No clusters. No NVLink. Just plug & infer. Pricing TBA • Early access open now
David Hendrickson tweet media
English
34
33
322
52.8K
danfiru retweetledi
Alexander Whedon
Alexander Whedon@alex_whedon·
Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.
English
1.5K
2.9K
23K
12.8M
danfiru retweetledi
Sudo su
Sudo su@sudoingX·
i get this question a lot so here is the answer everyone running hermes agent or any local agent should hear: tmux is the separation layer. cheapest, simplest, most reliable way to keep agent contexts from bleeding into each other. i run a lot of hermes sessions in parallel. one per project, one per active model, sometimes both. each session has its own working directory, its own memory context, its own conversation thread. the work session, the personal session, and the client session never see each other. a typical day on my main box has 6 to 10 hermes sessions running at any given time. coding project here, research session there, content drafting in another, telegram gateway routing requests in a fourth, model benchmarks in a fifth. zero overhead to switch, zero risk of context bleed. you do not need docker, a second machine, or elaborate workflow tooling for this. tmux plus a clear naming convention plus one hermes per session is the whole setup. the tools have been there the whole time, most people just have not connected them.
Sudo su tweet media
Nemanja@Nemanjadotcom

@sudoingX How do you organize projects and separation? Like would you use the same instance for managing work and personal things?

English
29
33
448
30.3K
danfiru retweetledi
Camus
Camus@newstart_2024·
This MRI study on young kids just exposed something terrifying: They scanned the brains of 60 children aged 3–5 — including 5-year-old Rose — and found interactive screen time is causing measurable loss of white matter in their developing brains. Even just 2 hours a day is linked to impaired neural connectivity, language, and literacy development. Professor Mike Nagel (neuroscientist and father) said his first reaction was simply: “Wow… I was not anticipating seeing anything like that.” We’re physically changing children’s brains before they even start school — and the damage is visible on scans. This one actually unsettled me. I’ve always suspected too much screen time was bad, but seeing real white matter loss in toddlers hits different. Parents of little ones — has this kind of research changed how much screen time you allow?
English
587
9.5K
28.4K
10.1M