Max Sumrall

332 posts

Max Sumrall banner
Max Sumrall

Max Sumrall

@MaxSumrall

@picnic

Amsterdam Katılım Mayıs 2008
1.2K Takip Edilen238 Takipçiler
Max Sumrall retweetledi
@Hatim
@Hatim@ItsDonHatim·
@mattpocockuk That grill-me skill + pi-interview extension from @nicopreme is chef's kiss. I can stay in a grilling session for hours. I started using it mainly for fixing bugs. It always finds bad design choices and asks the right questions. It's not just for building new features.
@Hatim tweet media
English
2
3
77
5.5K
Max Sumrall
Max Sumrall@MaxSumrall·
@teslaeurope why can HW3 owners subscribe to FSD? Especially those with EAP, what is the intent of the €49 subscription to FSD?
English
0
0
0
28
Max Sumrall
Max Sumrall@MaxSumrall·
@tvanschadewijk Interesting, there’s a minor lane shift there in the middle of the intersection which it seems to gotten confused by.
English
0
0
1
437
Thijs van Schadewijk
Thijs van Schadewijk@tvanschadewijk·
We are seeing lots of examples of FSD doing really well in the Netherlands, and that’s great. But from my own and my friends’ experience, I know that despite it doing great most of the time, it makes mistakes every now and then. I’d love for these to be shared too so we get a realistic view of EU FSD. Out of the 4 instances I experienced, I was only recording during one - and it was the most serious. The car turned left onto Wibautstraat, a big through-traffic street in Adam East. It then did something really weird: in the flow of traffic after a green light, it suddenly stopped in the middle of a crossing on the right side of the right lane (but blocking traffic from behind), despite the navigation directing it to continue on Wibautstraat. 🤷‍♂️
English
40
13
204
30.8K
Sundar Pichai
Sundar Pichai@sundarpichai·
Introducing Gemini on Mac. It’s the first time we’re bringing the @Geminiapp to desktop. The team built this initial release with @Antigravity, and it went from an idea to a native Swift app prototype in a few days. More features on the way!
Sundar Pichai tweet media
English
529
851
11.5K
892.1K
Lucas Meijer
Lucas Meijer@lucasmeijer·
@onusoz how do we integrate this in a daily workflow though? it's not yelling "now make it simple if you can" after every session. but what is it?
English
4
0
0
152
Lucas Meijer
Lucas Meijer@lucasmeijer·
Agents always want to add. it's low risk. We need systems that push back. That remove. That combine, that simplify.
English
36
30
296
13.2K
Max Sumrall
Max Sumrall@MaxSumrall·
@badlogicgames @lucasmeijer This was so much fun seeing in person with no expectations, and realizing there’s someone else who’s excited about pi and has similar feelings about using llms. So happy to have met @lucasmeijer
English
0
0
4
548
Mario Zechner
Mario Zechner@badlogicgames·
recommended viewing. one more time, on it's own. this is probably yhe most practical talk on using coding agents i've watched to date. watch it. by @lucasmeijer it's also a great demo of pi and captures exactly why i built it. youtu.be/fdbXNWkpPMY?si…
YouTube video
YouTube
English
17
33
568
40.6K
Max Sumrall retweetledi
antirez
antirez@antirez·
The difference between the two, for serious engineering work, is simply brutal. Claude Code with Opus is, when the task at hand is very complicated, borderline useless, while GPT 5.4 can do a reverse engineering mixing: hardware knowledges, major disassembly skills, and so on.
English
9
52
794
109.6K
emot-sun.gif industries
emot-sun.gif industries@jjpcodes·
plugged gstack into @comper_io . doesn't look that wild compared to openclaw honestly? (lots smaller! openclaw codebase is biiiig) like 80% of the code is in the skills; bunch of tests, skills do look like slop tho. not as bad as i thought tbh.
emot-sun.gif industries tweet mediaemot-sun.gif industries tweet media
English
2
1
3
235
Max Sumrall
Max Sumrall@MaxSumrall·
@VoxxedAmsterdam Come to my talk on day 2! It's tips for coding with LLMs: "What AI agents actually need from you". If you can't make it, the tl;dr is install pi from @badlogicgames and skills from @nicopreme, and don't clank it harder than the situation allows.
English
1
2
5
1.8K
Mario Zechner
Mario Zechner@badlogicgames·
@ferologics ain't nobody got time to watch videos. or implement operational transform.
English
2
0
7
1.1K
Mario Zechner
Mario Zechner@badlogicgames·
chat, is he serious?
Mario Zechner tweet media
English
33
0
96
19.4K
Max Sumrall
Max Sumrall@MaxSumrall·
@hibakod Because of this potential bug, OpenAI has reset limits for users.
English
0
0
0
38
Max Sumrall
Max Sumrall@MaxSumrall·
@onusoz Is the visualization just a one-off thing? Looks nice. Making flows seems kind of hard, looking at the examples. Obv not with an agent helping, but a built in viz could help.
English
1
0
0
1.9K
Onur Solmaz
Onur Solmaz@onusoz·
acpx v0.4 ships Agentic Workflows, or as I like to call them "Agentic Graphs" It let's you create node-based workflows on top of ACP (Agent Client Protocol), to drive any coding agent (Codex, Claude Code, pi) through deterministic steps This let's you automate routine, mechanical legwork like triaging incoming PRs, bugs in error reporting, and so on... For example, OpenClaw receives 300~500 new PRs per day. A lot of them are low quality, but they still relate to real issues, so you have to address them somehow You need to: - extract the intent - cluster them based on intent - figure out if the proposed changes are legit, or whether they are slop local solutions, like trying to catch flies instead of drying out the swamp - if the PR is too low quality or the intent is not clear, close them - run AI review on them them and address any issues that come up - refactor them if the changes are half-baked - resolve conflicts - and so on... So that when the PR is presented to the attention of the maintainer, all the routine legwork is done and the only remaining thing is the decision to (a) merge, (b) give feedback to the PR author, or (c) take over the PR work yourself I wanted to build this feature since a couple months now, since Codex got so good. OpenAI models are now good at judging implementation quality, so I found myself repeating the same steps I wrote above over and over I also tried putting all this in a single prompt. But I believe there are workflows that should not be a single prompt, but a sequence of prompts in the same session That is because like humans, LLMs are prone to PRIMING. I claim that putting all steps in the same prompt at the beginning of the context will generally give suboptimal results, compared to revealing the intention to the model step by step Creating such a workflow also gives more OBSERVABILITY into the each step that an agent is supposed to take. Agent generates JSON at the end of each step, and that structured data can be used to monitor thousands of agents running at the same time in an easier way, on a dashboard Similar features have been introduced in e.g. n8n, langflow. But AFAIK they are not integrating ACP like the way I do I wanted to have a fresh approach, and to build an API that I can develop freely the way I want, so I created a new workflow API inside acpx The video is from the workflow run viewer, but that is not where you build the workflow. You build it by using the acpx flow typescript API. See examples/pr-triage in acpx repo Before building that, I started from a Markdown file with a Mermaid chart of the flow I had in mind. The Markdown file acts as a spec for the flow, and I have built the workflow through trial and error. I call this process "workflow tuning" I started working on acpx repo PRs one by one, tuning the flow, slowly scaling to more PRs. Finally, when I felt confident, I ran it in parallel over all external open PRs in the acpx repo. I believe it already saved me hours this week My next goal, if well received, is to set this up on a cloud agent so that it can process the 300~500 PRs the OpenClaw repo receives every day, in real time, as they come in I believe this will save all open source maintainers around the world countless hours and make it much easier to herd and absorb external contributions from everyone!
English
80
99
1.1K
114.7K
emot-sun.gif industries
emot-sun.gif industries@jjpcodes·
thoughtful post. you're totally right, but i'd limit this to the subset of software that _actually matters_. (you sort of touch on this but not super explicitly). a lot of software doesn't matter, including a lot of the slop-projects that many of us ship. those are, well, tech demos at best, and it's fine, if not great, for them to be hastily thrown together slop. we can now validate faster - and that's a great thing! if you can clank out a slop-startup in 2-3 weeks instead of 9 months, get PMF for a tool and then re-write, thoughtfully iterate - and make a good version (that scales, has real users, etc), i'd say that on balance that's a good thing. obviously, Serious Business Software should probably not be clankin it to the max and bringing society down with them. and boasting about having INT_MAX commits per day is very silly. but at the same time, we can still have some fun clankin it for the majority of throwaway, inconsequential, toy software. as long as we recognize it for what it really is.
English
3
0
9
5.5K
Mario Zechner
Mario Zechner@badlogicgames·
i love this comment. it illustrates how we software people permantently get real-life wrong, then build systems that are supposedly helpful, when they are not. bonus points if you claim the system replaces humans. the world is more complex than it seems. domain expertise is still everything.
Mario Zechner tweet media
English
23
30
381
27.5K
Max Sumrall
Max Sumrall@MaxSumrall·
What week is it? Week 14? Or 13? You can just do things. So I made a menubar app that tells me what week it is. (Maybe it already exists, but building it with codex was easier). I even spent the time to get it notarized by apple, so it just works.
English
1
0
1
73