matti timonen

178 posts

matti timonen

@timonen_matti

Cloud Adoption Lead@OP Financial Group

Oulu, Finland Katılım Ekim 2014

206 Takip Edilen72 Takipçiler

matti timonen@timonen_matti·2 May

@OrenMe @code Can you give an example how end users prompts or orchestration can impact caching?

English

Oren Melamed@OrenMe·2 May

New token controls dropping in @code insiders You can now view your cached tokens in the Agent Debug Logs and drill down using a new Cache Explorer View each turn, prompt signature(system, user, assistant, tool and drift), request components and cache performance stats and insights like "where the cache broke" etc This is a great tool to learn how to effectively re-construct your prompting for better model and cost utilization

English

5.1K

matti timonen@timonen_matti·30 Nis

@mattpocockuk Prd, adr’s are product level. Spec/why and to what end (epic), plan/how to achieve it i.e approach, tactics, phases, ordering (story), tasks/what exactly (step/task/issue). They essentially map to an system with a simple table since it’s just agreed terminology anyway.

English

Matt Pocock@mattpocockuk·30 Nis

I'm thinking of renaming /to-prd and /to-issues PRD is super-specific to a certain style of document Issues are specific to GitHub The important thing is that the PRD is a multi-session plan and the issue is the plan in that session. Some options: 1. /to-mission-brief, /to-session-briefs 2. /to-epic, /to-tickets Or, come up with your own!

English

180

17.8K

matti timonen@timonen_matti·27 Nis

@OrenMe A big part of my job for the past six months has been to maximise output quality of workflows. Next six months to minimize token usage. Fml.

English

715

Oren Melamed@OrenMe·27 Nis

I will totally need to reevaluate how I work with AI now But this makes a lot of sense, it is not really sustainable to manage economics when some people consume millions of tokens per single premium token Nevertheless it’s hard to grok for anyone who got used to taking advantage of sub agents not costing additional budget 🙃

GitHub@github

Starting June 1st, GitHub Copilot will move to a usage-based billing model as GitHub Copilot supports more agentic and advanced workflows. In early May, you'll see a preview bill experience, giving visibility into projected costs before the transition. 👉 Read more about the upcoming change: github.blog/news-insights/…

English

235

41.4K

matti timonen@timonen_matti·27 Nis

@github Are you kidding me? We have 2000+ licenses and there’s no benefit whatsoever to that commitment?

English

196

GitHub@github·27 Nis

English

529

935

2.9K

3.7M

matti timonen@timonen_matti·25 Nis

@github @OpenAIDevs Most people commenting still don’t really understand how ghcp pricing works. 1x user request is about 4 cents. So 30 cents for 7.5x. You can do a lot with one user request if you optimize towards that, which you can do with subagents (free) and askQuestions tool. Still cheap.

English

GitHub@github·24 Nis

🆕 @OpenAIDevs GPT-5.5 is now generally available and rolling out in GitHub Copilot. Our early testing shows ➡️ It delivers its strongest performance on complex agentic coding tasks ➡️ It resolves real-world coding challenges previous GPT models couldn’t Try it out in Copilot CLI or @code. 👇 github.blog/changelog/2026…

English

101

499

240.2K

matti timonen@timonen_matti·25 Nis

@MindTheGapMTG @github @OpenAIDevs Github copilot supports parallel subagents and nested subagents support, and also multiple concurrent session are working fine. Cli, sandboxed, and cloudagents as well. It’s not a single agent tool.

English

147

Chen Avnery@MindTheGapMTG·25 Nis

@github @OpenAIDevs "Complex agentic coding tasks" sounds great until you realize Copilot is still a single-agent tool. The bottleneck in production isn't the model - it's constraint files, recovery logic, state management. Ship GPT-10, won't matter without multi-agent coordination.

English

1.5K

matti timonen@timonen_matti·24 Nis

@d4m1n Yeah, but absolutely no one wants to use O4.7, cause it won’t do anything the way you want it done. The benches don’t really matter at that point.

English

Dan ⚡️@d4m1n·23 Nis

GPT 5.5 benchmarks BUT w/o hiding the ones where they lose to Opus 4.7 lol also check vs Mythos marketing bastards got me again

OpenAI@OpenAI

GPT-5.5 delivers this step up in intelligence without compromising on speed. GPT-5.5 matches GPT-5.4 per-token latency in real-world serving, while performing better across nearly every evaluation we measured. It also uses significantly fewer tokens to complete the same Codex tasks, making it more efficient as well as more capable.

English

122

136

1.5K

348.5K

matti timonen@timonen_matti·24 Nis

@OrenMe @code I accidentally discovered this just a few days ago as i was asking the agent to figure out how to get multi-model review working with my reviewer subagent, and then it just ran it. Had to look at debug logs to see if it really was that easy 😄

English

Oren Melamed@OrenMe·24 Nis

This dropped almost 2 weeks ago and not sure I saw this around - you can pass optional model name to subagent now in @code It opens the path to things like multi model review and more fleet like or multi agent multi model orchestration

English

2.3K

matti timonen@timonen_matti·24 Nis

@muzib__ @pierceboggan Same experience. pretty much the only 0x model that does things roughly right and can provide value in real workflows. Only available on consumer side though for more training data i guess. hopefully it'll be finetuned on to a more recent base model later.

English

Mozibur Rahman 🇧🇩@muzib__·23 Nis

Raptor mini is Github Copilot is surprisingly good, especially considering its 0x multiplier and 264K token window. Its really underrated. I am surprised that MS is not taking it seriously (or, are they?) When are we going to get Raptor Max with , lets say, 0.33x? @pierceboggan

English

4.4K

matti timonen@timonen_matti·24 Nis

@mattpocockuk I use orchestrator setup with a clearly mandatory arch, plan subagent steps and HITL defined. Started a run with Opus 4.7 with grill-me, and while it did ask me about 10 questions, it completely skipped repo discovery, and the required workflow. straight to building! That model🤯

English

906

Matt Pocock@mattpocockuk·24 Nis

Starting to notice that even with /grill-me, Opus 4.7 w/ Claude Code jumps straight to implementation 😡 Just WAIT until we're aligned, silly harness

English

109

739

151.5K

matti timonen@timonen_matti·19 Şub

@realmcore_ You’re right, I should have stopped at the restaurant because you’re hungry. But we are at B now so we accomplished the task correctly. Do you want me to find you a restaurant here?

English

matti timonen@timonen_matti·19 Şub

@realmcore_ We have our map from A to B, and it doesn’t include a stop at your requested restaurant, because we’ll get to B faster without it.

English

akira@realmcore_·19 Şub

Sonnet 4.6 is an incredibly weird model First time I have both immediately thought a model is dumb and smart. It's incredibly agentic. As in, it feels almost like it has a will of it's own. I don't think I trust it

English

4.9K

matti timonen@timonen_matti·19 Şub

@realmcore_ I get why anthropic is blocking them from other tools. It’s cause they’re turning unusable in others.

English

matti timonen@timonen_matti·19 Şub

@realmcore_ I just hit this. Been working on orch/subs dev system. keeping it general and trust model judgement to decompose/delegate. Really good results, until opus-4.6. It just ignored everything. Turns out the model thinks about time/complexity completely different. -> New scoping model

English

167

akira@realmcore_·18 Şub

Feels like everyone making their own agent stumbles across the same primitives and thinks they solved something Let me save you some time (read this, it's funny and useful): - You're going to make an agent - You're going to run it on benchmarks > It's going to suck - You're going to make a tool to analyze traces - You're going to say this helped you > It wont work - You're going to think about role based agents for solving a single task - You're going to make a workflow for solving a benchmark > Both will work. Neither are generalized - You'll think you made it > It will be nearly unusable by an end user > Back to square one - You're going to realize you're stuck with a for loop - You're going to think about swarms > In swarms single agent usability doesn't matter - But wait you need a task manager - But wait you need a merge queue - But wait you need compression for long jobs > Compression is a foot gun - But wait now you need an agent to manage it all - But wait now you need something that checks to make sure the manager is managing - You're going to go back to single loop agents - Well, subagents seem like the way to do all of this > Bam! Plot twist: subagents are hard to do well - You're going to think "Hmm well subagents isolate context" because said so - You're going to start to look at other agent implementations > How have they all solved compaction, multi-agent, task management, memory etc.? - You're going to realize it's all just tradeoffs, but most of them have only one side people care about - "Oh it's all just context engineering" > Yep. But it has to be good and it has to be general. > Back to the starting loop. Rinse and repeat. Congrats. Keep it simple. Keep it general.

English

708

56K

matti timonen@timonen_matti·18 Şub

@svpino Secrets are hard locally. Doing two things about this. I use cross-keychain package for all secrets on openclaw. JIT inject to any local build/task. Not great still, but better than nothing. And I’m building openid4vc wallet for agent credentials and 2fa mechanics, DPoA, etc.

English

Santiago@svpino·17 Şub

Your agents can work autonomously, but they still use a hardcoded API key stored in a configuration file. Let that sink in. We are building autonomous machines with access to our most critical infrastructure, and securing them with a password on a sticky note. This is crazy. Identity is one of the biggest problems with agents right now. Deployed agents touch real infrastructure: databases, APIs, file systems, MCP servers, and other internal services. These agents act autonomously, make decisions, and access live resources. The "solutions" I usually see: • Hardcoded API keys and persistent credentials in config files • Shared service accounts with way too many permissions • Long-lived credentials that never get rotated • Zero visibility into what agents accessed what data This might work for traditional software, but agents are a completely different animal. Agents operate continuously, make non-deterministic decisions, and follow different execution paths every time they run. The old identity models built around static roles and long-lived secrets simply don't fit.

English

150

18.4K

matti timonen@timonen_matti·17 Şub

@theo I got the 20 buck plan for testing 5.3-codex, and got to weekly limit in under 2 days. Openclaw 30 minute intervals on 2 projects (small iterations). codex only for coding tasks while openclaw main threads with gemini-3-flash. 200 plan needed for sure. 5.3-codex is good though.

English

181

Theo - t3.gg@theo·17 Şub

I've had 3 or more agents running in parallel with Codex for 2+ hours. I've used 8% of my 5-hour window. 2% of my weekly. I am literally trying to hit the limits and still can't.

English

322

2.8K

271.4K

matti timonen@timonen_matti·13 Şub

@techNmak While i use opencode, and pi in scripts , i still prefer to have vscode open all the time. Usually few of them. you can run multiple parallel sessions in one, although the session specific branch would be nice. With subagents it’s easy to get to a few hour sessions. Works fine.

English

Tech with Mak@techNmak·13 Şub

Unpopular opinion: VS Code is holding back AI coding. I’ve been spinning up multiple Cline instances across tmux panes and terminal tabs. Each agent has its own isolated state and can run a different task, branch, or idea in parallel. They just keep going while I focus on something else, no context collisions, no babysitting a single chat thread. You can pipe input, chain output, or run them headless in CI/CD. It fits into the workflow I already use in the terminal. Getting started literally took me 30 seconds: npm install -g cline AI is finally just another process in my workflow, instead of a panel or plugin.

Cline@cline

Introducing Cline CLI 2.0: An open-source AI coding agent that runs entirely in your terminal. Parallel agents, headless CI/CD pipelines, ACP support for any editor, and a completely redesigned developer experience. Minimax M2.5 and Kimi K2.5 are free to use for a limited time. From prompt to production. All in your terminal.

English

168

43K

matti timonen@timonen_matti·7 Şub

@latentsaint I prefer an IDE, and vscode is better than any of the forks. Gh copilot has really come a long way over the past 6 months. Not really missing anything worth crying over at this point.

English

212

Tom@latentsaint·6 Şub

does anyone use github copilot?

English

118

68.7K

matti timonen@timonen_matti·6 Şub

@nateberkopec The only reason to do it is context management. you need strict minimal input/output definition, and clear state sharing between the subagents, and it'll get complicated if you're freely adding agents. Ralph looping solutions actually need to enforce similar things.

English

Nate Berkopec@nateberkopec·5 Şub

Most of the multi-agent/orchestration stuff I see on here is just disappointing. I'm extremely skeptical that agent "roles" ("You are an expert frontend tester...") and "personalities" ("I have a team of 10 agents! Here's Bill, Jill...") are worth fuck-all.

English

301

29K

Keşfet

@OrenMe @code @mattpocockuk @github @OpenAIDevs @MindTheGapMTG @d4m1n @muzib__