Olivier Chafik

877 posts

Olivier Chafik

@ochafik

Work @ Anthropic on MCP (views expressed = my own), ex-Google, past contrib. to OpenSCAD & llama.cpp; he/him 🏳️‍🌈 @ochafik.bsky.social @[email protected]

London, UK Katılım Haziran 2009

610 Takip Edilen752 Takipçiler

Olivier Chafik retweetledi

Claude@claudeai·12 Mar

Claude can now build interactive charts and diagrams, directly in the chat. Available today in beta on all plans, including free. Try it out: claude.ai

English

1.6K

3.5K

42.1K

11M

Olivier Chafik@ochafik·11 Şub

Having lots of fun with local audio models…

Prince Canuma@Prince_Canuma

Voxtral 2 realtime coming to MLX-Audio thanks @KarnikShreyas and @ochafik New release this coming week once we land the PR. I believe we can get to 100-200ms range. Perfect for iOS! github.com/Blaizzy/mlx-au…

English

170

Olivier Chafik@ochafik·28 Oca

@bendersej @AnthropicAI @bcherny Hey, can you make sure to put the CDP settings in the resource’s _meta? We have a few examples (github.com/search?q=repo%…), if you use the registerAppTool / registerAppResource helpers in TS it should all type-check. And please feel free to file a bug if it still won't work 🙏

English

Benjamin André-Micolon@bendersej·28 Oca

@ochafik @AnthropicAI @bcherny Unfortunately this doesn't seem to work either in Claude Desktop, the CSPs are still hardcoded to assets.claude.ai and Claude Desktop does not honor connectDomains and resourceDomains. Thanks again for your help, I'll pause the work there as it's a fundamental limitation

English

Benjamin André-Micolon@bendersej·27 Oca

Building an MCP app that needs to embed an iframe. The @AnthropicAI ext-apps SDK defines frameDomains in its schema ("Origins for nested iframes"), but Claude Desktop enforces frame-src 'self' blob: data: regardless. Is this supported yet? @ochafik, @bcherny

English

Olivier Chafik@ochafik·27 Oca

@bendersej @AnthropicAI @bcherny Sorry no plans for it atm! (being conservative here for security + future model introspectability reasons) Best option is currently to turn your iframe embed itself into an MCP app. Here's a rough idea how to support both modes: github.com/modelcontextpr…

English

Benjamin André-Micolon@bendersej·27 Oca

@ochafik @AnthropicAI @bcherny Ok! And is there any plans to support it? (In Claude.ai / Desktop / Cowork?) As an alternative, do you have any partner programs I could sign up for to allow adding *.simplepdf.com as trusted iframe origin?

English

Olivier Chafik@ochafik·27 Oca

@bendersej @AnthropicAI @bcherny Hey @bendersej, yes sorry but claude.ai doesn't support frameDomains. Your app can check what CSP settings the host decided to support w/ getHostCapabilities().sandbox?.csp?.frameDomains

English

Benjamin André-Micolon@bendersej·27 Oca

@AnthropicAI @ochafik @bcherny

QME

Olivier Chafik retweetledi

Sean Strong@sean_t_strong·26 Oca

You can now run apps within Claude.ai, powered by MCP Apps. Analyze data, edit tickets, draft messages, generate diagrams and more with Claude's interactive connectors. Grateful to our team, the open source community, and our partners for this launch 🚀

English

1.2K

132.3K

Olivier Chafik retweetledi

Tobin South@TobinSouth·26 Oca

watch this space — Claude is about to be the only tool you open to do work

Claude@claudeai

Your work tools are now interactive in Claude. Draft Slack messages, visualize ideas as Figma diagrams, or build and see Asana timelines.

English

2.6K

Olivier Chafik@ochafik·26 Oca

Say (github.com/modelcontextpr…) uses the amazing Pocket-TTS to let Claude speak out loud to you, live!

English

Olivier Chafik@ochafik·26 Oca

Pdf (github.com/modelcontextpr… ; also available in claude.ai Connectors) will show any arxiv PDF and allow you to ask Claude about any page / selection you need details about

English

138

Olivier Chafik@ochafik·26 Oca

Had so much fun co-authoring the MCP Apps spec and helping launch it in claude.ai! 🎉 🧵 About some demo apps you may wanna check out as user or developer

Claude@claudeai

Your work tools are now interactive in Claude. Draft Slack messages, visualize ideas as Figma diagrams, or build and see Asana timelines.

English

962

Olivier Chafik@ochafik·31 May

@ericcurtin17 @ollama @ggml_org @parthsareen I think there's room for a variety of approaches to OSS. I (and my past employer) have been happy to contribute code under a permissive license ((almost) no strings attached✌️) And I'm thankful that Ollama's codebase is open-source 🤗 (even if I'm not a fan of Go 🤪)

English

Eric Curtin@ericcurtin17·30 May

@ollama @ochafik @ggml_org @parthsareen Not sure tbh... I can run a comparison with the cpp code in llama.cpp next time I grab my laptop... That's not how a community-friendly project should work...

English

ollama@ollama·29 May

Ollama v0.8 is here! Now it can stream responses with tool calling! Example of Ollama doing web search:

English

287

2.2K

148.2K

Olivier Chafik@ochafik·29 May

@ericcurtin17 @ollama @ggml_org Ideally Ollama would use the Jinja support and constrained tool calls from llama.cpp, removing the need for their bespoke templating engine and improving their output quality. Probably would just need to wrap the relevant APIs as C?

English

Olivier Chafik@ochafik·29 May

@ericcurtin17 @ollama @ggml_org The major differences AFAICT are Ollama's tool calls are not grammar-constrained (many models can have high failure unless temperature is kept low, see github.com/ggml-org/llama…), and they don't support streaming in their OpenAI-compatible endpoint (and no streaming of arguments)

English

Olivier Chafik@ochafik·26 May

@ngxson Preview above wasn't for the PR that introduces --reasoning-budget, here it is again: github.com/ggml-org/llama…

English

338

Olivier Chafik@ochafik·26 May

Wanna disable thinking in llama.cpp? Try the new `--reasoning-budget 0` flag github.com/ggml-org/llama… Should work w/ Qwen3, QwQ, DeepSeek R1 distills, Command R7B; please report any issues! (Upcoming per-request behaviour discussed on github.com/ggml-org/llama… @ngxson) #llamacpp

English

2.1K

Olivier Chafik@ochafik·26 May

Really slick integration! MCP all the (cool) things!

Vaibhav (VB) Srivastav@reach_vb

You really can just do things! Use *any* Hugging Face space as a MCP server along with your Local Models! 🔥 Here in we use Qwen 3 30B A3B with @ggml_org llama.cpp and @huggingface tiny agents to create images via FLUX powered by ZeroGPU ⚡ It's quite a bit crazy to see local models be capable of so much and just be able to understand/ infer from tool description! There's a lot of potential here in automating video generation workflows, content curation and a lot more.. Bonus: you can plug any other Inference Provider if you don't want to run locally too! npx @ huggingface/tiny-agents run [TASK] oh, and we provide both typescript and python client! 🐐

English

244

Olivier Chafik retweetledi

Xuan-Son Nguyen@ngxson·26 May

Months of @ochafik's work finally paid off, we have MCP w/ llama.cpp 😁😁

Vaibhav (VB) Srivastav@reach_vb

biggest takeaway of all is Qwen 3 30B A3B is slept on and you should be playing around with MCP if you haven't already!

English

1.2K

Olivier Chafik@ochafik·26 May

@profcelsofontes Support for disabling thinking is now available w/ `--reasoning-budget 0` across thinking models: x.com/ochafik/status… (the pending generic mechanism will be useful for other things) And my pleasure!

Olivier Chafik@ochafik

English

Prof Celso Fontes@profcelsofontes·26 May

@ochafik thanks for your explanation and your PR ! I hope the PR about enable_thinking = off for qwen 3 be merged soon too !

English

Olivier Chafik@ochafik·25 May

llama.cpp streaming support for tool calling & thoughts was just merged: please test & report any issues 😅 github.com/ggml-org/llama… #llamacpp

English

5.5K

Olivier Chafik@ochafik·26 May

@profcelsofontes Yeah it may sound overkill, but then it allows issuing parallel calls as early as they're returned, and some more advanced scenarios (e.g. streaming diff arguments for file patching tools, think Cline / Roo - initially wanted to integrate to this: github.com/cline/cline/pu…)

English

Prof Celso Fontes@profcelsofontes·26 May

@ochafik function call on stream mode ??? really ??

English

115

Keşfet

@bendersej @AnthropicAI @bcherny @ericcurtin17 @ollama @ggml_org @parthsareen @ngxson