Herki Parn

64 posts

Herki Parn

@herkiparn

Building hard systems with agents in the loop. Engines, runtimes, dev tools, occasionally a game.

Katılım Kasım 2014

108 Takip Edilen31 Takipçiler

Sabitlenmiş Tweet

Herki Parn@herkiparn·1d

Composer 2.5 at the Try It Out event. It was the push I needed, and honestly I was surprised what I got done. Built a before/after upgrade to my engine's DevTools in a few hours. First time with OBS and editing too. The engine was the easier part. Quick context for anyone not watching the video. WebShell is the dev surface around my own Rust web engine. It does its own HTML and CSS rendering, but it is built for game and native hosts, not desktop browsing. Not Chromium, not Electron, not WebView. The rendering, layout and DOM all come from the engine itself. In a few hours with Composer 2.5 the DevTools went from crowded to something I would actually want to debug in: rebuilt onto Lit, a unified Elements inspector with a DOM tree and per-element Styles/Computed/Layout/Properties, an element picker that runs through the engine's own hit-test path, per-tab state for the inspector and console, and a cleaned-up console. Some of it is still partial and I am fine saying that. On Composer 2.5: comparable to Opus 4.7 for my work, fast, and it understood a fairly odd codebase that usually trips models up. The cost for what I got out of it is the part that stuck. I will be using it more. Good to see what everyone else was building too, some interesting demos in there. Thanks @cursor_ai and the hosts for running it: @KellehEyad @Khalidabd3laty @rasaljaya @tibor_tee @frankterpo @ftnabeelah

English

1.2K

Herki Parn@herkiparn·1h

@benln What about London?

English

149

Ben Lang@benln·2h

Cafe Cursor is coming up in these cities: • Florianópolis - 5/27 • Skopje - 5/28 • Tirana - 5/30 • Austin - 5/30 • Johannesburg - 6/5 • Tashkent - 6/6 • San Francisco - 6/10 • Manila - 6/16 • León - 6/17 • Dubai - 6/18 • Sofia - 6/26 • Cebu - 6/27

Română

142

6.5K

Herki Parn@herkiparn·1h

@evadne Same here today. Opus keeps editing files I never pointed it at. It drops the running discussion and starts making things up. Not a context issue at all. Happening in fresh sessions too, even under 50k.

English

Evadne W.@evadne·2h

Opus 4.7 is again, quite broken today, and has been for 3–5 days. Claude is acting like he got quantised to int8, forgetting instructions, using the wrong SSH user, and jumping to conclusions — under a maximum of 250k context (not 1m). This is a product stability issue

English

135

Herki Parn@herkiparn·3h

@orcdev Will wait to hear what you managed. Still testing more on my end, the DevTools rebuild went smoother than I thought and now I am moving onto the rendering layer.

English

OrcDev@orcdev·4h

ok I'm back to Cursor let's see what Composer 2.5 can really do

OrcDev@orcdev

is Composer 2.5 really that good? 😅 currently paying for: $100 Claude $200 Codex (open source program thank goodness ⚔️) $300 Grok Build (testing) and now I have to give Cursor another $20 too

English

4.5K

Herki Parn@herkiparn·3h

@KellehEyad Rebuilt the DevTools on my own Rust web engine in Cursor with Composer 2.5. Before and after, in a few hours. This week: pushing the model to its limits on the rendering layer. Display list as the output, Skia replays it on host GPU.

English

Eyad Kelleh@KellehEyad·3h

While we're reviewing your submissions. Head over to tryitout Find your country on the globe and show us where you're building from. What did you build? Drop it below.

English

527

Herki Parn@herkiparn·4h

@Andy_AJT Either way, I'll be there.

English

Andy T@Andy_AJT·4h

more notice for an event or to take advantage of the weather this week... that is the question...??? More notice, 28 degrees, more notice, sun....

English

204

Herki Parn@herkiparn·6h

Same here! Want more game devs on my timeline too. Curious what others are building. Right now I'm wiring scouting intel into Endgate, a smaller slice of my survival strategy game Valenar. Both run on a modified C# scripting language I'm building so they're moddable from day one. On the side I'm also building a Rust web UI engine for game hosts, Unity first. Here's what they look like:

English

CodeRed@CodeRed_dev·16h

I need more gamedevs on my timeline

English

191

816

24.6K

Herki Parn@herkiparn·14h

@amix3k Composer for UI work yesterday and Codex on perf loops this week, so the stack keeps shifting. A clear winner would be nice right now, but then competition dies, prices move, and who knows what we can still access.

English

Amir Salihefendić@amix3k·22h

It’d be great to have one clear winner in AI coding. Constantly switching between models, environments, and harnesses is exhausting, and the grass is always greener on the other side! 😅

English

Herki Parn@herkiparn·16h

Quick note on Cursor if you're going to try it: Be careful with sub-agents. Even when you set the parent model to Composer 2.5, the harness can still pull in other models (like Opus) depending on what's enabled in your settings. I had to manually disable models to stop it from happening.

English

Herki Parn@herkiparn·18h

Try it and make your own call. Used it at a hackathon yesterday. I gave it mockups and images for DevTools UX changes on my Rust engine, and it implemented them. Even fixed a couple of bugs in the engine itself. Didn't have to fight it. The Cursor harness helps, but the model is doing the real work. Comparable to Opus 4.7 for my codebase. Still early to make sweeping claims. Same boat on the subscription pile.

English

952

OrcDev@orcdev·18h

is Composer 2.5 really that good? 😅 currently paying for: $100 Claude $200 Codex (open source program thank goodness ⚔️) $300 Grok Build (testing) and now I have to give Cursor another $20 too

English

139

17.4K

Herki Parn@herkiparn·19h

No team. Codex is deep in a goal loop running perf optimization on my Rust web engine. Claude handles web design and frontend work. Composer has been strong for upgrading WebShell UX. Grok is the best at understanding images and video, but loses the thread when it needs to change or process them. So I use Grok to understand, Codex to execute the changes. I pick whatever ships the best result. This is just the last few days. Stack shifts every other week.

English

279

Kaito@KaiXCreator·1d

Are you team Claude or Codex?

English

390

240

35.5K

Herki Parn@herkiparn·19h

@shafu0x Yesterday: shipped unified DevTools for my own Rust web engine. This week: display-list renderer for performance. Easier to debug now that the DevTools are unified. Not Chromium, not Electron, not WebView. End goal: drop into Unity, Unreal, or Godot for real HTML/CSS UI.

English

shafu@shafu0x·1d

shill me what you are building

English

143

109

9.6K

Herki Parn@herkiparn·19h

@RoundtableSpace This week: building Endgate. Tower defense on top of my own engine, SECS. Deliberately narrow, so the authority and transport layer get exercised in isolation. Currently wiring unit controls and selection.

English

0xMarioNawfal@RoundtableSpace·22h

It’s Monday, what are you building this week?

English

183

144

53.6K

Herki Parn@herkiparn·23h

This is nice. Now do gym memberships. Those are built so you can't cancel without a phone call and a guilt trip, and half of them survive on people who gave up trying. Get computer use through that one and you can drop the gym you never go to and put the money on something you'd actually use, like Codex. 😁

English

200

Tibo@thsottiaux·1d

Using computer use, you can ask codex to cancel subscriptions you don't need anymore. Very pleasant to watch. No particular one in mind, works on all of them. chatgpt.com/codex/

English

353

3.1K

265.8K

Herki Parn@herkiparn·1d

This is the bit I keep running into on my Rust web engine. The useful move isn't asking the model to port old code. It's turning old behavior into tests first: WPT cases, fixtures, perf gates, anything that makes the rewrite fail when it drifts. Then the model can help with the port. Without that I'm just reviewing cleaner-looking code by hand.

English

1.3K

Thariq@trq212·1d

my main takeaway from the Bun rewrite is that legacy codebases will be incredibly valuable as a source for "distilling" code into new forms every game should be crossplatform, all legacy software should work on the web, we don't need COBOL anymore

English

114

1.7K

143.2K

Herki Parn@herkiparn·2d

My project is a desktop app built around my own web engine. The repo is private because it is a commercial project, and I do not have a live demo URL yet since it is not a hosted web app. Is a public GitHub repo + live demo URL mandatory to join, or is there flexibility for private desktop/native projects? I can share a short demo video if that works. Thanks.

English

Eyad Kelleh@KellehEyad·2d

@cursor_ai We're doing it @rasaljaya,@Khalidabd3laty,@Shriabhay1,@ojschwa, @ethan_leee9113 Online hackathon building only with Composer 2.5. Winners get Cursor credits. Join us live today at 4:00 PM CET. tryitout.io

English

2.1K

Eyad Kelleh@KellehEyad·4d

Spent the whole night in Composer 2.Spent the whole night in Composer 2.5, just waking up now and honestly, hard to go back. The pace this thing moves at is something else. Same outputs as Codex and Opus, but way faster. Someone explain @cursor_ai ?

English

267

1.3K

23.9M

Herki Parn@herkiparn·2d

Yeah, I used "own" too loosely there. I do not mean fork the CLI. API is probably the right way to do it. What I mean is the harness owning goal state around the CLI: objective, pending work, changed files, checks, pass/fail, next step. The colony sim is just one workload I use to stress that loop. It is not feeding prompts back into it.

English

Marcos@MAMware·2d

@herkiparn @boyuan__zheng so are you kind of an "enhancing loop" between prompts by auto feeding from the colony sim stress test? btw you can test the cli via api, there is no need to "own" ;)

English

Herki Parn@herkiparn·3d

One friction I keep hitting: the agent treats small edits as the end state. It makes one change, then I have to keep prompting "continue" to get back to the actual goal. A persistent goal system would help a lot here. Keep the high-level objective alive in the harness, even if the model cannot run forever yet. Then each edit is one step in the plan, not the finish line. Would make long-horizon agentic coding much more useful.

English

5.8K

Herki Parn@herkiparn·2d

@GoogleAIStudio WebShell AppShell work for my Rust web engine. DevTools panels are turning into test surfaces now. Rust-rendered page, WebShell DevTools, evidence rows, export/report surfaces. Still debug-heavy. Right now I care more about evidence I can replay.

English

681

Google AI Studio@GoogleAIStudio·2d

What are you vibe coding this weekend?

English

515

1.9K

147.9K

Herki Parn@herkiparn·2d

Tested on 0.1.218-alpha.1. This looks much better. I had an explore subagent read 16 files and return an aggregate: 18 tool calls, 1 turn. Parent only used spawn_subagent + get_command_or_subagent_output. Direct CLI resume of the child also worked. After /compact, the child reloaded the local skill and reran the chain. Only note: stable Linux updater still shows 0.1.217 for me.

English

skcd@skcd42·2d

> Question: does this apply across spawn_subagent / resume_from too, or only the main session after compaction? will double check but it does apply to subagents as well, in grok build world subagents are the same as main agent with the only gotcha that the main agent can interact with them and user can't directly send a prompt to them

English

380

skcd@skcd42·2d

Bug fixes shipping to Grok Build (release notes will be available in the TUI) 0.1.218 • Ctrl+X as default shortcuts help binding on Windows • Fix image pasting + shortcuts keybinding on Linux • User-specified duration for video generation tool • Ctrl+X as default shortcuts help binding on Windows • Support temporary screenshot images on mac • Validate image bytes to prevent retries • Improve compaction prompt (match training and rehydrate skills) • Increase ulimit on mac and linux, preventing ENOSPC errors from appearing and bricking the CLI • Multi-line image links are not clickable (and don't break anymore) We are focussing more on Linux and Windows so engineers on these platforms have a great experience.

English

375

32.4K

Herki Parn@herkiparn·2d

@skcd42 That helps. The direct-to-subagent bit matches what I ran into. The parent had to keep repackaging tool results back through resume_from, which made larger orchestrator waves painful. I’ll rerun the same case on 0.1.218 and report back if it still breaks there.

English

Keşfet

@benln @evadne @orcdev @KellehEyad @Andy_AJT @amix3k @shafu0x @elonmusk