
Tom Jansen
393 posts

Tom Jansen
@thomasjansn
forward deployed human. agentic ai. product. strategy and general deep thinking.


Quick update on that: I am running the first DRACO benchmark, GPT 5.5 xhigh with pi.dev as harness and GPT 5.5 xhigh as judge and synthesizer via OpenAI API directly (same benchmark as the one used by OpenRouter) as a baseline, then going into testing different combinations of local models, with either paid and/or local models for judge and synthesizing. Benchmarking takes way longer than I expected .... DRACO has 100 tasks and this first run is already running more than 24 hours, will take another 3 hours for sure. Also doing this only with API pricing would land at about 600$, but via the 200$ sub it's completely fine. Will keep you updated once it's finished. research.perplexity.ai/articles/evalu…



Based on the @OpenRouter Fusion approach, I made an extenstion for pi.dev by @earendilworks @badlogicgames that lets you define your own set of fusion models, thinking-effort per model etc. so you can easily play around with it too. First tests give good results, but I didn't have any time for benchmarking yet, which I still intend to do -> github.com/tjansn/pi-fusi…






Some thoughts on ownership I shared with the team this morning




Fable 5 is dead. We just resurrected it — cheaper, open and you hold the keys. OpenRouter dropped Fusion 48h ago and broke the internet. We tested it hard. The synthesizer is insane for deep research… but absolute dogshit for coding. So we fixed it. Meet OrcaRouter.ai DSL — the version you actually own. One prompt → fans out to any panel you want → judge + synthesizer → one god-tier answer. But unlike black-box slugs, you control the entire graph in YAML. Fable 5 level intelligence… without waiting for Anthropic to turn it back on 🧵👇

We're excited to join forces with @SpaceX to advance the frontier of useful AI. Expect significant improvements to Cursor soon.

Do more experimentations with local models people! vickiboykis.com/2026/06/15/run…

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

😻








