Anthony

245 posts

Anthony

@DePasqualeOrg

Building Local Chat: https://t.co/P6iTZarFB9 Open source: https://t.co/qWD5BRt0o6

Katılım Ağustos 2015

193 Takip Edilen575 Takipçiler

Anthony@DePasqualeOrg·12 Mar

@GergelyOrosz AI accelerates the first 90% of development, but not so much the last 90%.

English

242

Gergely Orosz@GergelyOrosz·12 Mar

One thing that endlessly frustrates with Anthropic, a $300B+ dollar company, where most code is written with AI: Their landing page for paying customers, Claude .ai has been broken for weeks UX-wise, and no one notices or cares or fixes: It "loses" stuff I type while it loads:

English

158

1.5K

279.7K

Anthony@DePasqualeOrg·2 Mar

@mattcassinelli Containers on iOS would be great. github.com/apple/containe…

English

146

Matthew Cassinelli@mattcassinelli·2 Mar

Can you imagine if Apple gave us Terminal on iPad. God that'd be sick.

English

385

36.3K

Anthony@DePasqualeOrg·28 Şub

@awnihannun Thanks for everything you’ve done, and especially for creating a thriving community around MLX!

English

1.1K

Awni Hannun@awnihannun·28 Şub

Today is my last day at Apple. Building MLX with our amazing team and community has been an absolute pleasure. It's still early days for AI on Apple silicon. Apple makes the best consumer hardware on the planet. There's so much potential for it to be the leading platform for AI. And I'm confident MLX will continue to have a big role in that. To the future: MLX remains in the exceptionally capable hands of our team including @angeloskath, @zcbenz, @DiganiJagrit, @NasFilippova, @trebolloc (and others not on X). Follow them or @shshnkp for future updates.

English

260

2.2K

396.9K

Anthony@DePasqualeOrg·20 Şub

@awnihannun I've been thinking the same thing. The compute power of all currently active Apple devices is equivalent to a non-trivial percentage of all data center capacity. This is probably the scenario that @exolabs is aiming for.

English

Awni Hannun@awnihannun·20 Şub

Inference compute scarcity seems plausible. Hyper growth in demand with limited hardware or data center energy supply means token shortage. This could be a reason to run more AI locally.

Tibo@thsottiaux

I am increasingly asked during candidate interviews how much dedicated inference compute they will have to build with Codex. Pairing this with usage per user growing significantly faster than the number of users, it's pretty clear that compute will be something that is scarce.

English

5.1K

Anthony@DePasqualeOrg·19 Şub

@anemll The main limiting factor for this now is the agentic capabilities of smaller on-device models.

English

Anemll@anemll·19 Şub

strong reasoning and built-in tools, running locally on the iPhone. autonomous, low-latency ops

English

493

Anemll@anemll·19 Şub

Siri should be an on-device version of OpenClaw

English

911

Anthony@DePasqualeOrg·16 Şub

@bcherny @peakcooper Some more feedback: Please implement more MCP features like reacting to "tools changed" notifications and automatically reconnect to MCP servers after disconnections.

English

492

Boris Cherny@bcherny·15 Şub

@peakcooper Actively working on improving this. Keep the feedback coming

English

531

39.2K

Cooper@peakcooper·15 Şub

Claude Desktop app takes a full 10 seconds to switch from the 'Code' tab to the 'Chat' tab. It then takes another 10 seconds for the input to become usable. It then takes another 10 seconds for your prompt to be submitted after you clicked Enter. I am convinced, absolutely convinced that nobody at anthropic actually uses this. It's impossible. I refuse to believe

English

865

158.7K

Anthony@DePasqualeOrg·2 Şub

@anemll It could be interesting to expose this on an MCP server in Swift. github.com/DePasqualeOrg/…

English

Anemll@anemll·31 Oca

My little remote harness works with iPhone mirroring too, which is useful for on-device ANE workflow. I give Claude a task, it implements and tests it with actual UI/device. Some UI inconsistencies are only visible on the real device—like iPhone 17 Pro's notoriously bad UI scaling (e.g., still not fixed in the X app after 6 months).

English

11.1K

Anthony@DePasqualeOrg·28 Oca

Here's my recent presentation at Swift Barcelona about building agentic apps with the new Swift AI and Swift MCP packages. youtube.com/watch?v=ekOzNd…

YouTube

English

417

Anthony@DePasqualeOrg·26 Oca

@nickoates_ It's even easier now on Apple devices with my Swift version. github.com/DePasqualeOrg/…

English

Nick Oates@nickoates_·25 Oca

idk how this is taking Apple so long. you literally just type `npm i ai` and you’re good to go.

9to5Mac@9to5mac

Apple to 'unveil' results of Google Gemini partnership as soon as next month: report 9to5mac.com/2026/01/25/app… by @mbrkhrdt

English

10K

Anthony@DePasqualeOrg·19 Oca

@rauchg I implemented almost all the missing functionality in the Swift SDK for MCP. Human feedback is still necessary for evaluating engineering tradeoffs and finding the right API design. github.com/DePasqualeOrg/…

English

135

Guillermo Rauch@rauchg·19 Oca

Skeptics are saying that agents / opus are not actually yielding new or improved software. That it's just hype. Reply with the most interesting things you've shipped or solved with the latest generation of ai coding tools. I'd love to hear the anecdotes and see the links.

English

321

675

148K

Anthony@DePasqualeOrg·17 Oca

@jordansblog Thanks for your feedback. I'm working on this open-source package, which could enable this feature in the future. github.com/DePasqualeOrg/…

English

Jordan@jordansblog·17 Oca

@DePasqualeOrg I'm trying local chat for the first time on MacOS. is there a way to enable spoken (TTS) responses? I'm a voiceover user, so this would be helpful. Thanks

English

Anthony@DePasqualeOrg·23 Tem

A new version of Local Chat is now available with support for several much-requested models, including: - Gemma 3n (text only) - SmolLM 3 - DeepSeek V3 Thanks to everyone in the growing MLX Swift community who helped port these models to Swift!

English

Anthony@DePasqualeOrg·12 Oca

The talk will include demos using the MCP package for Swift that I've been working on. github.com/DePasqualeOrg/…

English

178

Anthony@DePasqualeOrg·12 Oca

I'll be giving a talk about MCP in Swift at the SwiftBarcelona meetup on January 22. Come by if you're in town! meetup.com/swiftbarcelona…

English

405

Anthony@DePasqualeOrg·4 Oca

@bcherny @bcherny, Claude Code keeps asking me for permission to run commands that I've explicitly allowed in ~/.claude/settings.json. Is this a bug, or am I missing something?

English

188

Boris Cherny@bcherny·2 Oca

I'm Boris and I created Claude Code. Lots of people have asked how I use Claude Code, so I wanted to show off my setup a bit. My setup might be surprisingly vanilla! Claude Code works great out of the box, so I personally don't customize it much. There is no one correct way to use Claude Code: we intentionally build it in a way that you can use it, customize it, and hack it however you like. Each person on the Claude Code team uses it very differently. So, here goes.

English

1.3K

54.4K

8.1M

Anthony@DePasqualeOrg·29 Ara

@ldenoue In principle, any model can be implemented in MLX. It looks like this one hasn't implemented voice cloning, but it's on their roadmap.

English

Laurent Denoue@ldenoue·29 Ara

@DePasqualeOrg Thanks for your work. Do you know is this Soprano TTS model can run in MLX? x.com/wildmindai/sta…

Wildminder@wildmindai

Soprano: An instant, ultra-lightweight TTS model for realistic speech; generates 10 hours of 32kHz audio in <20s; streams with <15ms latency using just 80M params & <1GB VRAM. Has some limitations and drawbacks. github.com/ekwek1/soprano

English

118

Anthony@DePasqualeOrg·28 Ara

Loading MLX models is about to get a lot faster in Swift, thanks to the performance work I’ve done on swift-transformers. In my testing, I'm seeing load times go from ~2300 ms down to ~500 ms. This makes it possible to interact with the model immediately after app launch.

English

9.3K

Anthony@DePasqualeOrg·28 Ara

The results of this test are even better: loading time goes from ~3900–4600 ms to ~300–360 ms after all my optimizations. github.com/ml-explore/mlx…

English

564

Anthony@DePasqualeOrg·28 Ara

@ivanleomk Most recently a small base model for experimenting with typing completions. Performance during inference in Swift is generally pretty close to Python, and I’m on a mission to close the remaining gaps.

English

102

Ivan Leo@ivanleomk·28 Ara

@DePasqualeOrg What models are you currently running on MLX? Been tempted to use it for local apps but not sure how far it can go

English

239

Anthony@DePasqualeOrg·28 Ara

@guitaripod @awnihannun M3 MacBook Pro. The same model on an iPhone 16 Pro loads in ~600 ms after my optimizations.

English

167

Anthony@DePasqualeOrg·23 Ara

@EricCO2Removal That's a great tip. Keep in mind that this is a very new project, with many conceivable paths for future development. My current focus and point of differentiation at this early stage is implementing a wide range of models in MLX in Swift.

English

Eric Matzner@EricCO2Removal·23 Ara

@DePasqualeOrg How many models does one need once we hit a certain level ;) Need new features that make whichever models are there, better... If doing model, maybe some realtime ones (this has vad) github.com/collabora/Whis… Then it can be used widely for captioning/translation/live meeting notes.

English

Anthony@DePasqualeOrg·22 Ara

In the past week, I've added two new text-to-speech models (Chatterbox Turbo and CosyVoice 3) and one new speech-to-text model (Fun-ASR) to mlx-swift-audio. I've also improved performance across all models. You can try them out in the example apps in the repo (link below).

English

5.6K

Keşfet

@GergelyOrosz @mattcassinelli @awnihannun @angeloskath @zcbenz @DiganiJagrit @NasFilippova @trebolloc