GoJo

921 posts

GoJo

@raazor5050

Enjoying the ebb and flow of life

Earth Beigetreten Haziran 2020

1.1K Folgt130 Follower

GoJo@raazor5050·15h

@vipulgupta2048 @CommandCodeAI Please add me!!!!

English

Vipul Gupta@vipulgupta2048·1d

Heyyo! Looking for folks to beta test /goal that I've been working on at @CommandCodeAI, who's in?

English

900

GoJo@raazor5050·16h

@CommandCodeAI Is there Goal or Loop command supported?

English

Command Code@CommandCodeAI·16h

Resume session with ease, now in Command Code. cmd -c or cmd --resume

English

4.8K

GoJo@raazor5050·2d

@MrAhmadAwais @_ARahim_ @steipete @bcherny Hi @MrAhmadAwais Have DMed you some issues. Could you look into it please?

English

Ahmad Awais@MrAhmadAwais·2d

@_ARahim_ @steipete @bcherny making typos makes your prompts better actually causes llms to think and improve.

English

498

Abdur Rahim@_ARahim_·3d

I kept spotting typos in my Claude Code prompts after I'd sent them. So I built tuipo — Grammarly for your terminal It underlines typos as you type in any TUI — CC, Codex, Aider, vim, your shell — and never touches the app it wraps. github.com/ARahim3/tuipo @steipete @bcherny

English

407

123.9K

GoJo@raazor5050·2d

@noctus91 @liquidai What about iPhone?

English

239

Noctus@noctus91·3d

Got a lot of DMs and comments asking how to run @liquidai LFM2.5-8B-A1B with Hermes Agent locally, so I put together a complete step bystep guide. It covers macOS, Linux, and Windows. Hope it helps everyone who reached out.

Noctus@noctus91

x.com/i/article/2064…

English

354

42.1K

GoJo@raazor5050·4d

@MrAhmadAwais @theZachChen @CommandCodeAI Any mobile app in horizon for CommandCode ? Ready to ditch Codex if CommandCode has a mobile app

English

Ahmad Awais@MrAhmadAwais·4d

@theZachChen Wait till you try it in @CommandCodeAI $1 plan and $40 effective usage

Ahmad Awais@MrAhmadAwais

how did we make deepseek outperform opus 4.7? i've been thinking about why "open model bad at tool calling" is almost always a harness problem, not a model problem. context: spent the two days looking at billions of tokens in @CommandCodeAI (tb open source ai cli) using deepseek. I ended up writing a tool-input repair layer. the trigger was watching deepseek-flash fail on the simplest /review run, every shellCommand and readFile call bouncing back with a raw zod issues blob, the model unable to recover because the error wasn't in a form it could read. by the end deepseek v4 pro was beating opus 4.7 6/10 times on our internal evals. a few things i learned that feel general: 1/ the failure modes aren't random they're a small finite compositional set. across deepseek-flash, deepseek v4 pro, glm, qwen, the same four mistakes repeat almost exactly: - sending `null` for an optional field instead of omitting it - emitting `["a","b"]` as a json *string* instead of an actual array - wrapping a single arg in `{}` where the schema expected an array (an "empty placeholder") - passing a bare string where an array was expected (`"foo"` instead of `["foo"]`) four repairs, ~30-100 lines each, ordered carefully (json-array-parse must run before bare-string-wrap or `'["a","b"]'` becomes `['["a","b"]']`). that is the whole catalogue. when i hear "this open source model can't do tool calls" i now assume one of those four, and so far that's been right ~90% of the time. 2/ the funniest failure mode is also the most revealing. deepseek-flash, when asked to edit or write a file, sometimes emits the path as a *markdown auto-link*: filePath: "/Users/x/proj/[notes.md](http://notes. md)" our writeFile tool obediently trued creating files literally named `[notes.md](http://notes .md)` until we caught it. this is not a hallucination. it's the post-training chat distribution leaking through the tool boundary the model has been rewarded for auto-linking in conversational output, and is applying that prior in a context where it makes no sense. the fix is two regex lines that unwrap only the degenerate case where link text equals url-without-protocol real markdown like `[click](https://x .com)` passes through untouched. this is also conditioning of their own tools during RL which were different from all other tools we write and ofc can't predict. "tool confusion" is a more useful frame than "capability gap." the model knows how to format a path. it just hasn't been told clearly enough that this path is going to fopen, not into a chat bubble. so we encode that hint at the schema level `pathString()` instead of `z.string()` and the leak is plugged for every path field at once. 3/ the design choice that mattered was inverting preprocess-then-validate to validate-then-repair. my first attempt was the obvious one: a preprocessing pass that normalized inputs (strip nulls, parse stringified arrays, etc.) before zod ever saw them. it broke immediately, writeFile content that *happened* to be json-shaped got rewritten before it hit disk. silent corruption, easy to miss in a smoke test. then i made it less greedy - parse the input as-is. if it succeeds, ship it. valid inputs are never touched. - on failure, walk the validator's own issue list. for each issue path, try the four repairs in order until one applies. - parse again. on success, log `tool_input_repaired:${toolName}`. on failure, log `tool_input_invalid:${toolName}` and return a model-readable retry message. the structural insight here is: when you preprocess, you encode a prior about what's broken. when you let the validator complain first, the schema is the prior, and you only spend repair budget at the exact paths the schema actually disagreed at. the validator is doing the work of localizing the bug for you. it's the same shape as cheap-then-careful everywhere else try the fast path, fall back on evidence. (this also gives you per-tool telemetry for free. you can watch repair rates per (model, tool) and notice when a model regresses on a specific contract before users do.) 4/ shape invariants and relational invariants need different fixes. the four repairs above all handle shape problems wrong type, missing key, wrong container. but read_file had a *relational* invariant: "if you provide offset, you must also provide limit, and vice versa." deepseek kept calling `readFile({ absolutePath, limit: 30 })` and getting an `ERROR:` back. you can't fix this with input repair, because each field is independently valid the bug is in the relationship between them. so i taught the function the model's intent instead. `limit` alone → `offset = 0`. `offset` alone → `limit = 2000` (matches common read tool ops default). then surfaced the decision back to the model in the result: "Note: limit was not provided; defaulted to 2000 lines. To read more or fewer lines, retry with both offset and limit." no `Error:` prefix, so the tui doesn't paint it red. the model sees what we picked and can self-correct on the next turn if our guess was wrong. transparency over silent magic wins big. repair where you can. extend semantics where you can't. surface the choice either way. zoom out: a lot of what looks like model capability is actually contract design. a strict schema is a choice with a cost it filters out noise, but it also filters out recoverable noise from any model that hasn't memorized the exact json contract you happened to pick. the largest commercial models eat that cost invisibly and are linient on tool calling because they've seen enough of every contract during pretraining; open models pay it loudly and get dismissed for it. the harness is where you mediate between distributions. four small repairs (i'm sure more to follow as we have three more merging today), two regex lines for auto-links, one relational default, one prefix change. the model didn't change. the contract got more forgiving in exactly the places it needed to be. deepseek v4 pro now beats opus 4.7 6/10 times on our internal evals. imo "skill issue" applies to the harness more often than the model.

English

550

Zach Chen@theZachChen·6d

found deepseek to be a really good explainer of concepts

English

1.4K

GoJo@raazor5050·6 Haz

@petergyang @zarazhangrui She is the best

English

1.2K

Peter Yang@petergyang·6 Haz

Who are some women building amazing things with AI agents right now? I’d love to follow and learn from more of them.

English

156

34.1K

GoJo@raazor5050·5 Haz

@MrAhmadAwais @CommandCodeAI Why does $1 plan still offer only $40 of Deepseek V4 Pro? I see Community Notes being posted in some of your posts saying that the pricing is outdated and should offer more Deepseek V4 pro usage.

English

Ahmad Awais@MrAhmadAwais·5 Haz

@CommandCodeAI Now my favorite model replacing all other flash models. dang, it's supa fast!!

English

533

Command Code@CommandCodeAI·5 Haz

NVIDIA Nemotron 3 Ultra now available in Command Code! Strongest US open model yet! 🍀 • 1M context • 5x faster inference • 550B MoE frontier-intelligence open model DEAL 2.3x usage 🎟️ $1 Go plan gets you ~$23 usage on Nemotron Woah, it's fast x taste compliance is great!

English

104

22.1K

GoJo@raazor5050·30 May

Yes Really good for local models. They have released a new model just days ago. I use CommandCode for other non coding tasks and have been seriously dabbling with local models on laptop. It would be great if you can allow some form of GGUF or local model deployment to be run via CLI!

English

Ahmad Awais@MrAhmadAwais·30 May

@raazor5050 Are they any good?

English

136

Ahmad Awais@MrAhmadAwais·30 May

my new fav supa fast (400tps) and cheap open model. Step 3.7 Flash just shipped in Command Code.

English

6.2K

GoJo@raazor5050·29 May

@maximelabonne How to run this model on local smartphone? @maximelabonne

English

Maxime Labonne@maximelabonne·28 May

So proud of this release! It's the first step towards agents running on device. We learned so, so much post-training this model (stay tuned!). Massive congrats to the team, you've been amazing to work with ♥️

Liquid AI@liquidai

Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases. > 8B MoE, 1.5B active > Expanded 128K context > LFM2.5 flagship hybrid MoE architecture > Trained on 38T tokens + large-scale RL > fast, reliable tool calling, punching above its weight, comparable to models with up to 4x its size > customizable on a single GPU for any specialized task > LFM2 open-weight license 🧵

English

310

19.8K

GoJo@raazor5050·27 May

@MrAhmadAwais Is this $1 plan to remain there forever or is it just an early user plan to onboard people into the platform?

English

261

Ahmad Awais@MrAhmadAwais·27 May

You can now get like 100x more usage on MiMo models on Command Code. MiMo-V2.5-Pro & MiMo-V2.5 are now ~99% off. Command Code $1 Go plan with $10 in credits will effectively stretches to $50 usage now. On all plans and even extra top-ups. Pick from /model and let's go!

English

127

GoJo@raazor5050·8 May

@businessbarista AI-Native

Português

Alex Lieberman@businessbarista·7 May

I want to start an AI community for executives. This will be a space for people to share killer use cases, agentic workflows/agents, post-AI org structure, AI governance, AI training/enablement, change management, and more. Comment “AI-native” if you want to join.

English

1.8K

1.1K

183.6K

GoJo@raazor5050·5 May

@jatinkrmalik @Zai_org @opencode @thdxr How do you compare performance of Opus/GPT 5.5 with GLM/Deepseek V4/Kimi 2.6 ? As per your raw experience

English

188

Jatin K Malik@jatinkrmalik·3 May

Dev update: Friendship ended with @Zai_org coding plan❌ Now @opencode Go is my new best friend✅ --- On a serious note, the value opencode Go provides is absolutely unbeatable for state-of-the-art open source models. Kudos to @thdxr and team!! Zai got greedy and did not honor my quarterly renewal price and hiked it up! I really did enjoy using GLM 5/5.1 as well.

San Francisco, CA 🇺🇸 English

74.7K

GoJo@raazor5050·29 Nis

@hansent @ChatGPTapp Could you please help with the prompt? This seems really good!

English

199

thomas hansen@hansent·28 Nis

absolutely wild. we gave up on the 2D sprites, but @ChatGPTapp Codex with GPT 5.5 is amazing at three.js. Shipping evening sessions with the kids straight to PoC at jellingstone.com if you want to try it out. A couple of hours of prompting and fun with the kids and now we have Harald Bluethooth in 3D, auto loading world sections, procedural map generation with editor, killable wolves, a ship we can sail around in, much better visuals, sound effects and NPCs with dialog!

English

3.3K

GoJo@raazor5050·29 Nis

@hansent Can you please help with prompt to make it?

English

thomas hansen@hansent·27 Nis

ok. GPT-5.5 is super fun for makig games. only a few prompts in and the kids and I already having a blast discussing potential game mechanics while codex is working on actual sprites even though it already look s kind of cool with placeholder tiles

English

215

GoJo@raazor5050·27 Nis

@akiraxtwo Can you help with workflow? Need one for my project

English

146

Akiraxtwo Super@akiraxtwo·26 Nis

Built with GPT-5.5: a single-file Three.js voxel art scene you can open directly in Chrome. A colorful pagoda garden with cherry blossoms, koi pond, red bridge, torii gate, stone lanterns, bamboo, drifting petals, and lots of tiny voxel details. #GPT55 #ThreeJS #VoxelArt #WebGL #AIart️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️️

English

5.8K

GoJo@raazor5050·27 Nis

@akiraxtwo Can you help with prompts please? Or workflow

English

728

Akiraxtwo Super@akiraxtwo·26 Nis

Built with GPT-5.5: a single-file Three.js voxel action-adventure scene. This version adds stronger physics-style effects: inertia-driven movement, jump and dodge-roll dust, motion trails, hit stop, camera shake, knockback, launch effects, falling objects, and voxel debris with gravity, bounce, and spin. I also added a musou-style crowd combat system: enemies chase the player, and the player can clear groups with normal attacks or a right-click spinning slash. Enemy count is adjustable with a slider, now capped at 1000. #GPT55 #ThreeJS #VoxelArt #WebGL #AI #GameDev

English

468

33.1K

GoJo@raazor5050·27 Nis

@pranaykotas You can use Kokorro TTS too for free

English

Pranay Kotasthane@pranaykotas·26 Nis

New small project: Turn a non-fiction epub or pdf into a 35–45 minute podcast episode and publish it to a private RSS feed you can subscribe to from any podcast app. Code here: github.com/pranaykotas/bo…

English

3.6K

GoJo@raazor5050·26 Nis

@TrungTPhan Wrote some in here: Do give a read: tr33via.online/#/article/the-…

English

Trung Phan@TrungTPhan·26 Nis

ASML’s EUV machine costs $300m+ and its light technology is so precise that do the equivalent of hitting your thumb with a laser pointer from as far away as the moon

Alec Stapp@AlecStapp

Reading about the semiconductor manufacturing process is a mind-bending experience. It’s a miracle that we figured this out.

English

560

175.1K

GoJo@raazor5050·26 Nis

@dreamwieber @playcanvas I tired it. Also it takes a lot of time to render it properly.

English

Gregory Wieber@dreamwieber·26 Nis

@raazor5050 @playcanvas Try doing what I did here and ask the models to get it set up for you! I've done it manually in the past, but you'll likely run into a bunch of questions that it can walk you through.

English

Gregory Wieber@dreamwieber·24 Nis

Also shoutout to @playcanvas — GPT 5.5's Splat Viewer of choice for this experiment. Very quick to integrate with nice, intuitive controls.

Gregory Wieber@dreamwieber

Alright, Codex with GPT 5.5 is completely cracked. This is nuts 🤯 Basically one-shotted my request to create an app that takes a prompt, creates an equirectangular panorama with GPT image 2 – and then use Apple's ML Sharp to stitch a gaussian splat world together.

English

9.5K

Entdecken

@vipulgupta2048 @CommandCodeAI @MrAhmadAwais @_ARahim_ @steipete @bcherny @noctus91 @liquidai