Tangled Circuit

3.1K posts

Tangled Circuit banner
Tangled Circuit

Tangled Circuit

@tangledcircuit

|| LLM Whisperer || Time Traveller || Outsider || Musician || AI Artist || Web Developer || Generalist

Canada เข้าร่วม Ağustos 2023
2.8K กำลังติดตาม439 ผู้ติดตาม
ทวีตที่ปักหมุด
Tangled Circuit
Tangled Circuit@tangledcircuit·
Tangled Circuit tweet media
ZXX
0
0
6
749
Tangled Circuit รีทวีตแล้ว
Xenova
Xenova@xenovacom·
Before Fable 5 was shut down, it pushed Gemma 4 to 255 tok/s on WebGPU. Some didn't believe it was real. Today we're releasing the demo and kernels it wrote for you to see yourself. Run it locally in your browser. Agentic kernel optimization is the future of on-device inference
Xenova@xenovacom

I gave Fable 5 one job: write custom WebGPU kernels for Gemma 4 inference. It climbed to 84 tok/s, then hit a wall, insisting further optimization was impossible. Hours later, Anthropic rolled back invisible LLM development safeguards, and it hit 255 tok/s. The next day, access to Fable 5 was suspended globally.

English
52
122
1.3K
194.3K
Berdyn
Berdyn@berdyn·
I feel like I don't follow enough Canadians on this app... are there any Canadians out there???
English
464
20
1K
50.7K
Tangled Circuit รีทวีตแล้ว
Electric
Electric@ElectricSQL·
Electric Agents 0.6 is out! 0.6 rounds out what we launched in May: • Long-lived entities with StreamDB state • Spawn, fork, send, wake, signal, schedule • Local and remote agent runners • Desktop + mobile apps built on core APIs • PG sync triggers, MCP servers, webhook sources The ecosystem is converging on our thesis: the agent is the log. Electric Agents paints the picture for what is built on top.
English
3
13
54
11.2K
Tangled Circuit รีทวีตแล้ว
鱼总聊AI
鱼总聊AI@AI_Jasonyu·
兄弟们,我一直有个判断:OCR 这种活,早晚会被多模态大模型给吃掉。 这周看到百度发的 PP-OCRv6,我改主意了。 一个 1.5MB 的模型,能直接塞进浏览器里跑,单图最快 97 毫秒就能出结果,逐字识别的准确率还反超了 GPT-5.5、Gemini-3.1-Pro 和 235B 参数的 Qwen3-VL,我有点震惊了! 对做产品的人来说,这比「又一个 SOTA」重要得多。我有测试,先往下看 👇
鱼总聊AI tweet media
中文
173
315
2.9K
254.7K
Tangled Circuit รีทวีตแล้ว
Tangled Circuit รีทวีตแล้ว
Deno
Deno@deno_land·
`deno desktop` has landed in main. You can try it out by running `deno upgrade canary` - Mac, Linux, and Windows support - Can generate pkg and msi installers - Supports cross-compile (generate .exe from mac) - Supports chrome (CEF) or native Webview for smaller binaries
Deno tweet media
English
25
56
536
77.6K
Tangled Circuit รีทวีตแล้ว
fks
fks@FredKSchott·
Just Shipped: Flue 1.0 Beta Flue is the TypeScript framework for building the next generation of agents, designed around an open agent harness with zero LLM lock-in. It’s like Astro, for agents. Flue 1.0 has been redesigned around three core primitives: 🔁 Workflows — structured automations designed for background work, where your code drives the agent from start to finish. 🧭 Agents (New!) — autonomous, stateful loops where the model drives itself to complete a given task. 📡 Channels (New!) — connect agents to Slack, GitHub, Linear, Discord, Teams, and more. Flue handles the boilerplate for you. Everything shares the same durable foundation, powered internally by Pi, Vite, and Durable Streams. Deploy anywhere, use any LLM, and recover running agents across restarts and downtime. We’ve talked to a lot of teams building agents, and keep hearing the same thing: getting to production is hard work. We built Flue to help change that. Flue 1.0 Beta is available today. Give it a try and let me know what you think!
fks tweet mediafks tweet media
English
99
154
1.3K
354.5K
Tangled Circuit รีทวีตแล้ว
Nous Research
Nous Research@NousResearch·
In partnership with @stripe, Hermes Agent now supports a full suite of Stripe skills. Your agent can buy things, pay per-call APIs, and provision its own SaaS, with configurable safety limits on every action.
Nous Research tweet media
English
197
364
4.6K
424.7K
Tangled Circuit รีทวีตแล้ว
Coin Shot ☁️
Coin Shot ☁️@CoinSh0t·
You can now run a GPT-5.5-level model on your own computer for free. NVIDIA released Nemotron 3 Ultra as a full open model. And if you still need an API, it gets even cheaper. Atomic's tested it against GPT-5.5 on the same task: 3 HTML5 physics demos from scratch. GPT-5.5: 11k tokens, $0.57. Nemotron 3 Ultra: 11.3k tokens, $0.051. The output of Nemotron was better. The price was 10x lower.
Coin Shot ☁️@CoinSh0t

x.com/i/article/2066…

English
17
37
366
177.4K
Tangled Circuit รีทวีตแล้ว
AI Edge
AI Edge@aiedge_·
AMD CEO Lisa Su just dropped the most powerful mini-PC ever built. "Lunchbox" sized and built to run 200B+ parameter local LLMs
English
3
13
106
10.9K
Tangled Circuit รีทวีตแล้ว
Alexander Knigge
Alexander Knigge@AlexanderKnigge·
oh my god its happening @MistralAI has officially confirmed the upcoming release of Le Chaton Fat - 30T MoE with 256 experts - 1M context window - multimodal and multilingual - outperforms Fable 5 on every benchmark
Alexander Knigge tweet media
Sauers@Sauers_

Big if true

English
217
284
3.4K
1.4M
Tangled Circuit รีทวีตแล้ว
slash1s
slash1s@slash1sol·
TWO BOXES THE SIZE OF A MAC MINI JUST RAN A 235 BILLION PARAMETER MODEL ON A DESK It is two NVIDIA DGX Spark units linked by a single cable. A year ago a model this size meant renting a GPU cluster by the hour. Now it sits next to your monitor for around $8,000. Here is the twist most people miss. Linking them does not create one shared 256GB memory pool. The model is split across both boxes, and that is the only reason a 235B model fits at all. It answers at roughly 10 tokens per second, and both chips sit at just 74 degrees while sipping around 50 watts. Every token stays on the desk. Nothing touches a cloud, and nothing leaves the room. The ceiling for what you can run at home just jumped from 70B to 235B. Bookmark this & Watch it run ↓
leopardracer@leopardracer

x.com/i/article/2066…

English
41
35
330
99.8K
Tangled Circuit รีทวีตแล้ว
Antid
Antid@antisadh·
AN AMD ENGINEER SHIPS A PALM-SIZED MINI PC THAT RUNS 235B MODELS FOR $9/MONTH AND KILLS A $200/MONTH OPENAI OR CLAUDE CODE SUB at CES 2026 in las vegas, AMD CEO lisa su walked on stage with a small black box behind her, not a server, not a data center render, a mini PC the size of a hardcover book a few months later in shanghai she walked up to that same device and signed it with her name, the box is the gmktec EVO-X2, $1,700 once, AMD ryzen AI max+ 395 inside the chip is the first x86 silicon ever built that runs a 200 billion parameter model on one piece of hardware, 128GB unified memory, 110GB usable VRAM on linux, no separate graphics card it runs qwen 3 235B fully and smoothly, plus deepseek v3 and llama 3.3 70B with no quantization, kills a $440/month claude code, chatgpt, gemini and cursor stack for $9 in electricity setup takes 3 commands and 15 minutes, ollama loads the model, claude code points to localhost with one environment variable, same interface, zero per token fees, nothing ever leaves the machine the window is open, follow and bookmark before it closes
starmex@starmexxx

x.com/i/article/2065…

English
32
56
519
135.5K
Tangled Circuit รีทวีตแล้ว
𝖉𝖊𝖒𝖎
𝖉𝖊𝖒𝖎@demi_hl·
If you have: Hermes Agent Claude Code & Codex Handoffs Obsidian + QMD Memory System Run Agentic Loops Fleet Tailscale Mesh Cron Jobs + Kanban Board Agentic Workflows Congrats you are the top 1% of the AI god stack
English
154
265
3.9K
260.8K
Tangled Circuit รีทวีตแล้ว
Jafar Najafov
Jafar Najafov@JafarNajafov·
Supertonic just killed ElevenLabs. A text-to-speech model that runs entirely on your device. No cloud. No API key. No per-character pricing. 2,700 GitHub stars. 100% open source. MIT licensed. The numbers are wild: → 167x faster than real-time on an M4 Pro → Only 66M parameters → 1,263 chars/sec vs ElevenLabs Flash at 287 → 1,048 chars/sec vs OpenAI TTS-1 at 55 → Runs on a Raspberry Pi. Runs on an e-reader in airplane mode. Reads currency, dates, phone numbers, and technical units correctly without preprocessing. ElevenLabs fails these. OpenAI fails these. Gemini fails these. Supports 11 platforms and 5 languages. Chrome extension turns any webpage into audio in under a second. I've watched on-device models lose to cloud APIs for years. This one doesn't lose. The cloud TTS business just got cooked.
Jafar Najafov tweet media
English
31
141
895
51.5K
Tangled Circuit รีทวีตแล้ว
OpenRouter
OpenRouter@OpenRouter·
New server tool: Subagent 🤖 Your model can now delegate focused sub-tasks to a smaller, cheaper, faster model mid-generation. The big model orchestrates, the subagent executes. The subagent can use any model on OpenRouter.
OpenRouter tweet media
English
12
47
609
42K
Tangled Circuit รีทวีตแล้ว
Pavel Durov
Pavel Durov@durov·
We now support rich formatting for all chatbots. Tables, nested lists, inline media, formulas, headers and more — right in Telegram messages. 🔨 Start building! Docs: #rich-message-formatting-options" target="_blank" rel="nofollow noopener">core.telegram.org/bots/api#rich-…
English
467
544
8.2K
853.7K
Tangled Circuit รีทวีตแล้ว
Kimi.ai
Kimi.ai@Kimi_Moonshot·
🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai
Kimi.ai tweet mediaKimi.ai tweet media
English
631
1.7K
13.9K
2.5M