Tangled Circuit

3.1K posts

Tangled Circuit banner
Tangled Circuit

Tangled Circuit

@tangledcircuit

|| LLM Whisperer || Time Traveller || Outsider || Musician || AI Artist || Web Developer || Generalist

Canada شامل ہوئے Ağustos 2023
2.8K فالونگ439 فالوورز
پن کیا گیا ٹویٹ
Tangled Circuit
Tangled Circuit@tangledcircuit·
Tangled Circuit tweet media
ZXX
0
0
6
749
Tangled Circuit ری ٹویٹ کیا
Simon Späti 🏔️
Quack is a new protocol that enables client-server architecture for DuckDB over TCP/IP and HTTP(S), announced at AI Council 2026. It allows one DuckDB instance to act as a server and another as a client, with remote database attachment, query pushdown (computing on the server vs. the client), and token-based authentication. It allows whole new possibilities, as database clients are usually quite dumb; you can send a query and retrieve data. But with Quack, you could aggregate, you could build a whole ETL job, aggregation, or any business logic inside it Check out more at: ssp.sh/brain/quack-pr….
Simon Späti 🏔️ tweet media
English
2
9
63
4.2K
Tangled Circuit ری ٹویٹ کیا
Xenova
Xenova@xenovacom·
Before Fable 5 was shut down, it pushed Gemma 4 to 255 tok/s on WebGPU. Some didn't believe it was real. Today we're releasing the demo and kernels it wrote for you to see yourself. Run it locally in your browser. Agentic kernel optimization is the future of on-device inference
Xenova@xenovacom

I gave Fable 5 one job: write custom WebGPU kernels for Gemma 4 inference. It climbed to 84 tok/s, then hit a wall, insisting further optimization was impossible. Hours later, Anthropic rolled back invisible LLM development safeguards, and it hit 255 tok/s. The next day, access to Fable 5 was suspended globally.

English
67
145
1.6K
244.8K
Berdyn
Berdyn@berdyn·
I feel like I don't follow enough Canadians on this app... are there any Canadians out there???
English
465
20
1K
51.4K
Tangled Circuit ری ٹویٹ کیا
Electric
Electric@ElectricSQL·
Electric Agents 0.6 is out! 0.6 rounds out what we launched in May: • Long-lived entities with StreamDB state • Spawn, fork, send, wake, signal, schedule • Local and remote agent runners • Desktop + mobile apps built on core APIs • PG sync triggers, MCP servers, webhook sources The ecosystem is converging on our thesis: the agent is the log. Electric Agents paints the picture for what is built on top.
English
3
13
55
11.3K
Tangled Circuit ری ٹویٹ کیا
鱼总聊AI
鱼总聊AI@AI_Jasonyu·
兄弟们,我一直有个判断:OCR 这种活,早晚会被多模态大模型给吃掉。 这周看到百度发的 PP-OCRv6,我改主意了。 一个 1.5MB 的模型,能直接塞进浏览器里跑,单图最快 97 毫秒就能出结果,逐字识别的准确率还反超了 GPT-5.5、Gemini-3.1-Pro 和 235B 参数的 Qwen3-VL,我有点震惊了! 对做产品的人来说,这比「又一个 SOTA」重要得多。我有测试,先往下看 👇
鱼总聊AI tweet media
中文
175
321
3K
259.1K
Tangled Circuit ری ٹویٹ کیا
Deno
Deno@deno_land·
`deno desktop` has landed in main. You can try it out by running `deno upgrade canary` - Mac, Linux, and Windows support - Can generate pkg and msi installers - Supports cross-compile (generate .exe from mac) - Supports chrome (CEF) or native Webview for smaller binaries
Deno tweet media
English
25
59
541
79.6K
Tangled Circuit ری ٹویٹ کیا
fks
fks@FredKSchott·
Just Shipped: Flue 1.0 Beta Flue is the TypeScript framework for building the next generation of agents, designed around an open agent harness with zero LLM lock-in. It’s like Astro, for agents. Flue 1.0 has been redesigned around three core primitives: 🔁 Workflows — structured automations designed for background work, where your code drives the agent from start to finish. 🧭 Agents (New!) — autonomous, stateful loops where the model drives itself to complete a given task. 📡 Channels (New!) — connect agents to Slack, GitHub, Linear, Discord, Teams, and more. Flue handles the boilerplate for you. Everything shares the same durable foundation, powered internally by Pi, Vite, and Durable Streams. Deploy anywhere, use any LLM, and recover running agents across restarts and downtime. We’ve talked to a lot of teams building agents, and keep hearing the same thing: getting to production is hard work. We built Flue to help change that. Flue 1.0 Beta is available today. Give it a try and let me know what you think!
fks tweet mediafks tweet media
English
103
160
1.4K
378K
Tangled Circuit ری ٹویٹ کیا
Nous Research
Nous Research@NousResearch·
In partnership with @stripe, Hermes Agent now supports a full suite of Stripe skills. Your agent can buy things, pay per-call APIs, and provision its own SaaS, with configurable safety limits on every action.
Nous Research tweet media
English
197
364
4.7K
426.8K
Tangled Circuit ری ٹویٹ کیا
Coin Shot ☁️
Coin Shot ☁️@CoinSh0t·
You can now run a GPT-5.5-level model on your own computer for free. NVIDIA released Nemotron 3 Ultra as a full open model. And if you still need an API, it gets even cheaper. Atomic's tested it against GPT-5.5 on the same task: 3 HTML5 physics demos from scratch. GPT-5.5: 11k tokens, $0.57. Nemotron 3 Ultra: 11.3k tokens, $0.051. The output of Nemotron was better. The price was 10x lower.
Coin Shot ☁️@CoinSh0t

x.com/i/article/2066…

English
17
37
366
177.7K
Tangled Circuit ری ٹویٹ کیا
AI Edge
AI Edge@aiedge_·
AMD CEO Lisa Su just dropped the most powerful mini-PC ever built. "Lunchbox" sized and built to run 200B+ parameter local LLMs
English
3
13
106
10.9K
Tangled Circuit ری ٹویٹ کیا
Alexander Knigge
Alexander Knigge@AlexanderKnigge·
oh my god its happening @MistralAI has officially confirmed the upcoming release of Le Chaton Fat - 30T MoE with 256 experts - 1M context window - multimodal and multilingual - outperforms Fable 5 on every benchmark
Alexander Knigge tweet media
Sauers@Sauers_

Big if true

English
217
284
3.4K
1.4M
Tangled Circuit ری ٹویٹ کیا
slash1s
slash1s@slash1sol·
TWO BOXES THE SIZE OF A MAC MINI JUST RAN A 235 BILLION PARAMETER MODEL ON A DESK It is two NVIDIA DGX Spark units linked by a single cable. A year ago a model this size meant renting a GPU cluster by the hour. Now it sits next to your monitor for around $8,000. Here is the twist most people miss. Linking them does not create one shared 256GB memory pool. The model is split across both boxes, and that is the only reason a 235B model fits at all. It answers at roughly 10 tokens per second, and both chips sit at just 74 degrees while sipping around 50 watts. Every token stays on the desk. Nothing touches a cloud, and nothing leaves the room. The ceiling for what you can run at home just jumped from 70B to 235B. Bookmark this & Watch it run ↓
leopardracer@leopardracer

x.com/i/article/2066…

English
41
35
330
99.8K
Tangled Circuit ری ٹویٹ کیا
Antid
Antid@antisadh·
AN AMD ENGINEER SHIPS A PALM-SIZED MINI PC THAT RUNS 235B MODELS FOR $9/MONTH AND KILLS A $200/MONTH OPENAI OR CLAUDE CODE SUB at CES 2026 in las vegas, AMD CEO lisa su walked on stage with a small black box behind her, not a server, not a data center render, a mini PC the size of a hardcover book a few months later in shanghai she walked up to that same device and signed it with her name, the box is the gmktec EVO-X2, $1,700 once, AMD ryzen AI max+ 395 inside the chip is the first x86 silicon ever built that runs a 200 billion parameter model on one piece of hardware, 128GB unified memory, 110GB usable VRAM on linux, no separate graphics card it runs qwen 3 235B fully and smoothly, plus deepseek v3 and llama 3.3 70B with no quantization, kills a $440/month claude code, chatgpt, gemini and cursor stack for $9 in electricity setup takes 3 commands and 15 minutes, ollama loads the model, claude code points to localhost with one environment variable, same interface, zero per token fees, nothing ever leaves the machine the window is open, follow and bookmark before it closes
starmex@starmexxx

x.com/i/article/2065…

English
32
56
519
135.7K
Tangled Circuit ری ٹویٹ کیا
𝖉𝖊𝖒𝖎
𝖉𝖊𝖒𝖎@demi_hl·
If you have: Hermes Agent Claude Code & Codex Handoffs Obsidian + QMD Memory System Run Agentic Loops Fleet Tailscale Mesh Cron Jobs + Kanban Board Agentic Workflows Congrats you are the top 1% of the AI god stack
English
154
265
3.9K
261.1K
Tangled Circuit ری ٹویٹ کیا
Jafar Najafov
Jafar Najafov@JafarNajafov·
Supertonic just killed ElevenLabs. A text-to-speech model that runs entirely on your device. No cloud. No API key. No per-character pricing. 2,700 GitHub stars. 100% open source. MIT licensed. The numbers are wild: → 167x faster than real-time on an M4 Pro → Only 66M parameters → 1,263 chars/sec vs ElevenLabs Flash at 287 → 1,048 chars/sec vs OpenAI TTS-1 at 55 → Runs on a Raspberry Pi. Runs on an e-reader in airplane mode. Reads currency, dates, phone numbers, and technical units correctly without preprocessing. ElevenLabs fails these. OpenAI fails these. Gemini fails these. Supports 11 platforms and 5 languages. Chrome extension turns any webpage into audio in under a second. I've watched on-device models lose to cloud APIs for years. This one doesn't lose. The cloud TTS business just got cooked.
Jafar Najafov tweet media
English
31
142
895
51.5K
Tangled Circuit ری ٹویٹ کیا
OpenRouter
OpenRouter@OpenRouter·
New server tool: Subagent 🤖 Your model can now delegate focused sub-tasks to a smaller, cheaper, faster model mid-generation. The big model orchestrates, the subagent executes. The subagent can use any model on OpenRouter.
OpenRouter tweet media
English
13
47
609
42.1K