John Records

24 posts

John Records

John Records

@JohnRecords

Se unió Eylül 2025
198 Siguiendo4 Seguidores
libapi
libapi@libapi_·
昨天,Hermes Studio「小方盒」正式上线。 今天我们已经开放了完整接口,喜欢折腾和二次开发的小伙伴可以自由接入。 理论上,你甚至可以通过单片机调用 Hermes Studio 的全部能力,把它嵌入到自己的硬件、工作流或创意项目里。 欢迎开发者、创客和自动化爱好者一起探索更多玩法。
libapi tweet media
中文
25
7
93
21.9K
John Records
John Records@JohnRecords·
@sudoingX Or a used m1 Max MacBook pro with 64 GB for around $1,300 with a 1-year warranty refurbished, on eBay
English
1
0
0
480
Sudo su
Sudo su@sudoingX·
this is the right read, and it's the box i'd point a newcomer at. the 64gb framework is the cleanest on-ramp into moe local ai there is. same chip as my 128gb, so identical speed, not a cut down part. 64gb holds the 35b-class moe models at full quant with room for context, which covers most of what you'd actually run day to day. and it's a clean integrated box, no discrete gpu to wire in, no psu to cable, you drop in storage and you're running models. $1,959 for the chip, ~$2,253 configured like this one with storage. a complete, quiet ai machine that punches above its price. the 128gb is for people loading the giants. for getting started with moe, the 64gb is genuinely the move. here is the link: frame.work/products/deskt…
Sudo su tweet media
exitLQ@CdeBurner

@sudoingX with current prices i think the 64gb version is the sweetspot for me

English
14
4
94
119.6K
John Records
John Records@JohnRecords·
@Prince_Canuma @GoogleDeepMind A google engineer posted that apple hardware will not provide the amazing speed boost that separate gpu provides. What's your experience? Thanks for your great work.
English
1
0
0
184
Prince Canuma
Prince Canuma@Prince_Canuma·
mlx-vlm v0.6.3 is here 🚀 Day-0 support for TWO new models from our partners we work closely with: 🔥 @GoogleDeepMind DiffusionGemma — a genuinely new architecture. Instead of token-by-token, it generates 256-token blocks in parallel with bi-directional attention and iteratively self-corrects the whole block, image-generator style. 26B MoE, only 3.8B active, fits in 18GB quantized. Day-0 MLX support via our Google DeepMind partnership, with long-context prefill tuned and ready. 🔥 @cohere's North Mini Code 1.0 — a 30B MoE with just 3B active, running ~66 tok/s in BF16 before any compression. Day-0 on MLX thanks to our close collaboration with the Cohere team. Get started today — install from source: > uv pip install -U mlx-vlm Then serve the model and point your favorite agent at it (pi, opencode, hermes, etc.): uv run mlx_vlm.server --model MODEL-REPO Model collection 👇🏽
Google Gemma@googlegemma

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

English
12
18
135
15.4K
Nate Keating
Nate Keating@Nate_Keating·
One thing to keep in mind as you try DiffusionGemma – you'll want a dedicated accelerator (GPU or TPU) to see real speedups. In particular, we love our MacOs AI developers, but this model may not be best for you!
Google Gemma@googlegemma

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

English
10
0
42
7.3K
Alex Ign
Alex Ign@aleksey_ignatov·
This is my first-ever open-source project. github.com/alxgntv/OpenWa… This app turns your @Apple Watch into an AI-powered, wrist-first interface for @openclaw agents. fyi @steipete It hasn't been published to the @AppStore yet, but I'll do that soon. For now, it can be installed via @Xcode and paired with your watch manually. It supports a ton of features: 1. Displaying your main and sub-agents 2. Chat history for each session with each agent 3. Haptic feedback 4. Welcome messages 5. Voice responses 6. Greeting messages 7. Works even when your iPhone is locked For now, it only works on Apple Watch, but I'll be expanding it step by step for other devices. Built entirely with @cursor_ai. Looking for feedback.
Alex Ign tweet media
English
8
2
70
24.9K
John Records
John Records@JohnRecords·
@hermes_updates Downloaded, and it wants to install Hermes. But Hermes already is on the computer. I paused the installation since I’ve got it working nicely and don’t want to mess it up.
English
1
0
1
29
Hermes Agent Tips
Hermes Agent Tips@HermesAgentTips·
im trying to gather some knowledge of off the local LLM folks on X... What's an affordable solution for someone that wants to get into learning the basics of the local LLM scene and dont have the cash to drop on expensive hardware drop some hardware spec/models lets see what yall suggest
Hermes Agent Tips tweet media
English
28
1
33
3.9K
JustPratibha
JustPratibha@just_pratibha·
An immersive mythological fantasy that blends ancient wisdom, spirituality & modern-day realities into a captivating narrative. Set across different parts of India during the 80s & 90s, the book follows 3 people whose lives are unknowingly connected to a much larger cosmic design
JustPratibha tweet media
English
1
0
11
218
John Records
John Records@JohnRecords·
@signalgaining Excellent, looking forward to more on your Jetson cases and WendyOS.
English
1
0
0
20
Maximilian Alexander
Maximilian Alexander@signalgaining·
The NVIDIA Jetson Orin Nano is perfect for developers, but it comes naked, without a hard drive or an operating system. We're changing that, ready to go with WendyOS installed, just plug and play!
English
3
6
79
6.2K
Hermes Agent Tips
Hermes Agent Tips@HermesAgentTips·
whats your favorite model to use with Hermes agent?
English
125
0
98
20K
John Records
John Records@JohnRecords·
@jinyuhou0 @vishalm4341 I’ve looked for the link to the models, no luck. I’m eager to see it! Please consider posting it conspicuously, perhaps in its own tweet. Thanks, mate!
English
1
0
0
28
Jinyu Hou
Jinyu Hou@jinyuhou0·
@vishalm4341 Yes! Everything is in the last post of the original thread (3/3) — code and models are all open.
English
1
0
5
209
Jun Song
Jun Song@jun_song·
Daily reminder: Use MLX format for local llms running on Mac Fastest and best format for Apple silicon.
English
10
1
54
3.6K
rabbit inc.
rabbit inc.@rabbit_hmi·
pov: never forgetting what happened in a meeting 🪄
English
15
7
119
20.2K
John Records
John Records@JohnRecords·
@HermesAgentTips I haven’t seen any studios at that price. But ebay has Mac laptops with those specs for about that price
English
1
0
0
262
Hermes Agent Tips
Hermes Agent Tips@HermesAgentTips·
everyone's buying $5,000 GPUs to run local LLMs meanwhile a used Mac Studio M1 Max 64GB is doing 60+ tok/s on Qwen3 35b for $1,500 silent. cool. holds resale value.
English
38
4
107
13.5K
comma
comma@comma_ai·
We aspire to build AI and hardware that you can own. How can the comma four and openpilot platforms better interoperate within your life?
English
31
2
106
8.8K
Ethan Lipnik
Ethan Lipnik@EthanLipnik·
Currently using Mirage while my laptop charges at my desk and I know at this point I shouldn’t be surprised but it works so well. Super smooth and pixel perfect. Looks and feels like macOS is running on my iPad. No other app feels this smooth and uses touch this well.
Ethan Lipnik tweet media
English
25
6
364
33.8K