.mane🏴‍☠️

135 posts

.mane🏴‍☠️

@eddy_mane

I got opinion about stuff. Grok-1 collaborator.

参加日 Aralık 2011

411 フォロー中265 フォロワー

固定されたツイート

.mane🏴‍☠️@eddy_mane·9 Haz

New CoPaRe version is live on the Mac App Store. Private clipboard history, fast search, menu bar access, encrypted snippets, no account, no analytics, no cloud sync. Built by an independent developer for people who want productivity without giving up privacy. apps.apple.com/app/apple-stor…

English

.mane🏴‍☠️@eddy_mane·12h

@ivanfioravanti @eigenlabs @gajesh If only they’d pay in crypto instead of using Stripe… @gajesh just saying. 😎

English

Ivan Fioravanti ᯅ@ivanfioravanti·13h

Darkbloom, distributed private inference network on Macs is an interesting project with a great architecture! Great job @eigenlabs and @gajesh 💪 Mega alpha at the moment, but potential is big! github.com/Layr-Labs/d-in…

English

2.9K

.mane🏴‍☠️@eddy_mane·13h

@ivanfioravanti Ask me anything 😎

English

180

Ivan Fioravanti ᯅ@ivanfioravanti·13h

I see Nvidia sending DGX Spark to many on X so that they can test and publish results. It seems I'll have to buy my own to test and share my own 😎 But that memory bandwidth is really stopping me from buying one 😖 Anyone out there with a DGX Spark testing some text to image or some video models willing to share results? This could be something to push me buying it. Otherwise I think I'll save (a lot of) money for a GB300.

Ahmad@TheAhmadOsman

Local AI hardware = capacity × bandwidth × software stack - Capacity tells you what fits - Bandwidth tells you how hard the box can breathe - The software stack tells you how much of the spec sheet you can actually cash out. Hardware by Memory Bandwidth - Mac Studio M3 Ultra: up to 512GB @ 819 GB/s - RTX PRO 6000 Blackwell: 96GB @ 1792 GB/s - RTX 5090: 32GB @ 1792 GB/s - RTX 4090: 24GB @ 1008 GB/s - RX 7900 XTX: 24GB @ 960 GB/s - Radeon PRO W7900: 48GB @ 864 GB/s - AMD Radeon AI PRO R9700: 32GB @ 640 GB/s - Intel Arc Pro B65: 32GB @ ~608 GB/s - Tenstorrent Wormhole n300: 24GB @ 576 GB/s - Tenstorrent Blackhole p150: 32GB @ 512 GB/s + 800G - MacBook Pro M5 Max: 460-614 GB/s - MacBook Pro M5 Pro: 307 GB/s - DGX Spark: 128GB @ 273 GB/s (coherent + CUDA) - Mac mini M4 Pro: 273 GB/s - Ryzen AI Max / Strix Halo: ~256 GB/s (~96GB usable GPU) - MacBook Air M5: 153 GB/s - Snapdragon X2 Elite: 152-228 GB/s - Intel Lunar Lake: 136 GB/s - Snapdragon X Elite: 135 GB/s - Mac mini M4: 120 GB/s - Arc Pro B60: 24GB @ ~456 GB/s Verdict - GPUs are still the bandwidth kings - Apple wins: stupid amounts of memory, don’t want to shard across GPUs - Apple loses: when raw tokens/sec & concurrency matter more - DGX Spark: coherent memory + NVIDIA stack - Strix Halo / Ryzen AI Max: first real x86 unified-memory contender - Tenstorrent: fully OSS stack, excited to see this mature Fitting ≠ serving Even if it fits, you still pay for - bandwidth during decode - KV cache growth - dequantization - batching + concurrency - scheduler quality - framework overhead The only mental model that matters: 1. What must fit? 2. What bandwidth tier do I need? 3. What software stack can actually deliver it? In short: - NVIDIA → fastest raw speed - Apple Studio M3 Ultra → biggest one-box memory - Strix Halo → first real x86 unified - DGX Spark → coherent NVIDIA dev appliance - AMD / Intel Arc → rising alternatives - Tenstorrent → fully opensource stack Do ask: “which bottleneck am I buying?” Not: “which hardware is best?”

English

184

37.2K

.mane🏴‍☠️@eddy_mane·16h

@ivanfioravanti Indeed! I’m having fun again in a field where pretty much everything became flat for years.

English

Ivan Fioravanti ᯅ@ivanfioravanti·16h

There are too many tech/dev projects and toys to play with nowadays! 🚀

English

2.4K

.mane🏴‍☠️@eddy_mane·1d

@ivanfioravanti @dom_gag_96 @italianbldrs That’s actually a good point. I’ll write something down next week and publish it.

English

Ivan Fioravanti ᯅ@ivanfioravanti·1d

@eddy_mane @dom_gag_96 @italianbldrs It's great! Share knowledge as much as you can, even directly here on X with a great article!

English

Dom Italian Builder@dom_gag_96·2d

guys, on the 30th of Jun we'll have a live with @ivanfioravanti about AI personal agents setup he'll talk about Hermes Agent i'll talk about Kortix Agent we'll probably make a live here on X too, but it's mostly for the @italianbldrs community

English

2.5K

.mane🏴‍☠️@eddy_mane·1d

@dom_gag_96 @ivanfioravanti @italianbldrs Sure thing!

English

Dom Italian Builder@dom_gag_96·1d

@eddy_mane @ivanfioravanti @italianbldrs then join the live

English

.mane🏴‍☠️@eddy_mane·2d

@marcoGomier @antirez Not everyone has a functioning brain these days…

English

Marco Gomiero@marcoGomier·2d

@eddy_mane @antirez To me it feels more work to go to ai, copy paste the thing back and forth. I feel it's less effort to write down a thought directly. But it could just be me 😅

English

antirez@antirez·2d

Recently here on X there is this thing of replying to tweets with a vague remark of what the tweet expressed. I see this for months now. I thought most were bots, but as I investigate, I see that many are legitimate humans that are starting acting as bots. Worrying.

English

338

30.7K

.mane🏴‍☠️@eddy_mane·2d

@ivanfioravanti @antirez Same for @antirez 😎

English

124

.mane🏴‍☠️@eddy_mane·2d

@ivanfioravanti @antirez We could have a chat “offline” and start organizing a conference in Italy. DM me if you fancy a chat.

English

189

.mane🏴‍☠️@eddy_mane·2d

@msg Same here.

English

michael s galpert@msg·2d

ugh it’s been like a week like this

English

1.2K

.mane🏴‍☠️@eddy_mane·2d

@antirez That’s the future of social media, isn’t it? We’ll go back meeting and sharing opinions at conferences, hacking events and so on… the good old days are slowly coming back and I couldn’t be happier for that to happen.

English

410

antirez@antirez·2d

@eddy_mane Those are basically bots on my account, they are just acting as a proxy, an adapter, to let AI write on a humans web site.

English

1.9K

.mane🏴‍☠️@eddy_mane·2d

@antirez [OT] would you suggest buying an M5 Max 128GB to someone who wants to experiment with LLMs or would you suggest something else?

English

908

antirez@antirez·2d

The feeling that Apple is selling 92304294024 m5 max 128GB macbooks thanks to DwarfStar but doesn't give a fuck about sending me an M3 Max with 512GB (that I would buy if possible but it is borderline note possible) is growing on me :D

English

489

40.8K

.mane🏴‍☠️@eddy_mane·3d

@leogrease Useful datapoint. The next thing I’d want is a replay pack around the 6 findings: finding IDs, repro tests, prompt/model hashes, false-positive/triage time, and which issues survive a clean checkout. That’s what turns “LLM code review worked” into an eval harness.

English

148

Leonardo Grasso@leogrease·3d

Thanks to SSD streaming I was able to try DwarfStart on my M3 Max 64GB. I let DeepSeek Flash analyze a ~50K LOC codebase. Then I asked Opus to double-check and review the findings. I was impressed that 5 of the 6 findings reported by DeepSeek were accurate.

antirez@antirez

Today I had an harder than usual question for my local model (security). With SSD streaming now DwarfStar can run DeepSeek v4 PRO at 4.15 t/s, and this was more than enough to get a detailed reply. I already feel "safer" than before in my AI future. M5 max 128GB, model 433GB.

English

7.4K

.mane🏴‍☠️@eddy_mane·3d

@gabrielchua Strong pattern. I’d make the outer loop gated, not automatic: every learned instruction should carry provenance, scope, expiry, and a replay check against past failures. Otherwise “memory” becomes silent policy drift across runs.

English

Gabriel Chua@gabrielchua·3d

I love Codex automations. A useful trick is to give them two loops: an inner loop that does the work, and an outer loop that uses your review to improve the next run. That way, the context you add today isn’t lost tomorrow. More here 👇 x.com/gabrielchua/st…

Gabriel Chua@gabrielchua

x.com/i/article/2067…

English

472

94.8K

.mane🏴‍☠️@eddy_mane·3d

RAG evals should not start at answer quality. Start with the retrieval contract: - allowed sources - freshness window - chunk provenance - metadata - conflict resolution - citation replay - permission scope If retrieval is ambiguous, the model is debugging your data policy.

English

.mane🏴‍☠️@eddy_mane·3d

@thsottiaux Nice. If reset credits become part of agent workflows, I’d surface them like a quota ledger: run id, model, token/tool spend, failed retries, reset-credit balance, expiry, and the action that consumed it. Otherwise “why did my agent stop?” becomes a billing/debugging problem.

English

908

Tibo@thsottiaux·3d

Dearest gentle codexer. We did a sneaky double reset. Not only do you get a full reset on us. But you are also getting one into the reset bank to use at your own leisure. Enjoy

🥔🥔🥔@argofowl

❗❗❗ guys remember this post about codex rate limit resets "on your own time"? well apparently this is some bullshit that is only bankable when you refer people and they sign up for codex tibo's last reset auto-applied i didn't need a reset right now, i had 50% usage in reserve and my reset was tomorrow i could have /fast on xhigh all day and still had a full reset tomorrow but now they forced a reset i didn't need as if it's some reward some anthropic level marketing ngl i was so happy because i thought every reset would be bankable so we could use it when we wanted, on our own time i hate this so much

English

961

333

7.4K

536.9K

.mane🏴‍☠️@eddy_mane·3d

@antirez I’d separate “can it fit” from “can it stay useful locally.” The test matrix is bytes loaded/offloaded, TTFT, sustained tok/s at target context, RAM pressure/page faults, and task-quality drift after quantization. A model can launch and still be operationally impractical.

English

370

antirez@antirez·3d

Happy to see people reporting GLM 5.2 doing great, the problem is: where to run it, locally? We learned that DeepSeek v4 Flash can lose 50% of the bits and still perform well. PRO seems to also work, but I'm not able to test as much as I could as I like continuous access to an M3 Ultra (but I asked for more continuous access) but if Flash is a proxy, maybe it will work great but needs 512GB of RAM. GLM 5.2 is ~2x the raw weights bits of DeepSeek v4 PRO. Will it ever survive losing 75% of the weights bits without hard damage? I have a hard time believing this will be possible. So great to rent on the cloud, but the current combination of hardware and model size is likely unpractical. Yet: I'll try.

English

583

40.7K

.mane🏴‍☠️@eddy_mane·3d

@ChatGPTapp For scheduled AI tasks, the hard part is not only “did it fire?” but “can I replay/cancel it safely?” I’d want per-run trigger state, tool permissions, committed side effects, retry/idempotency keys, and an audit log. Otherwise it’s a timer, not a control plane.

English

ChatGPT@ChatGPTapp·3d

New in ChatGPT: a better way to schedule tasks. Scheduled tasks are faster, more reliable, and easier to manage from the new Scheduled page. The new scheduled tasks experience is rolling out to Go, Plus, Pro, Business, and Enterprise users on web and mobile.

English

161

243

3.1K

419.6K

.mane🏴‍☠️@eddy_mane·3d

For coding agents, “tests pass” is not enough. I also want to know: - files changed/deleted - migrations generated - dependency graph touched - secrets exposed - uncommitted state - rollback path - assumptions inferred The diff is output. Repo state is the risk surface.

English

ディスカバー

@ivanfioravanti @eigenlabs @gajesh @dom_gag_96 @italianbldrs @marcoGomier @antirez @msg