tryingEveryThing

1K posts

tryingEveryThing

tryingEveryThing

@tryingET

Katılım Haziran 2012
1.3K Takip Edilen113 Takipçiler
Maxime Rivest 🧙‍♂️🦙🐧
Markco is for the Markdown collaboration With AI and people and sound like a name so the idea is that the AI collaborator is named Markco. But I dont love either too. I tried: Atelier. For a while, workshop in French or feuille (sheet in French) But naming this one has been hard for me. Mrmd was a temp name maximerivest's markdown. But I don't find myself finding what to move to.
English
1
0
0
15
tryingEveryThing
tryingEveryThing@tryingET·
@trashpandaemoji in the tools we use there is always a progression. You just find the right tool for the job. and Pi is definitively the tool for the exploration and prototyping phase. But when you figured it out, and worked quite well with all the things nico put out. Then it is time to dspy it.
English
0
0
0
22
Trash Panda 🦝
Trash Panda 🦝@trashpandaemoji·
I used it a bit for GEPA, but I never really put 2 and 2 together for our specific use case. This definitely makes way more sense for something more controlled and deterministic like report generation. It would be much easier to implement in our pipeline too. Thanks for the tip!!
English
1
0
1
14
Trash Panda 🦝
Trash Panda 🦝@trashpandaemoji·
I wonder if you could take Pi, package it with skills and prompts, then make it an executable binary with Bun and distribute it? Thinking of doing this at work for a specific agentic workflow with a well defined input and output. Provider is solved via our intranet, and it’s better than just distributing a skill since we might want to run this programmatically in a pipeline.
English
3
0
0
82
Trash Panda 🦝
Trash Panda 🦝@trashpandaemoji·
@tryingET Ah i see what you're saying now! I'm not that familiar with dspy, will check that out.
English
1
0
0
16
tryingEveryThing
tryingEveryThing@tryingET·
@trashpandaemoji I get that, but I still dont get why you just do the systemd or cron job with a run command of dspy. If you want something established it should be an established program with clear signatures. So the structure is 100% repeatable. Use the right tool for the job.
English
1
0
1
24
Trash Panda 🦝
Trash Panda 🦝@trashpandaemoji·
Yeah that would be us iterating on the prompt which we could theoretically do. For distribution, its mainly so we can run this thing hands-off and get a report. I don't want to have to do this on my laptop weekly and distribute the report. Ideally I'd stick it in our existing data pipeline or cron system, it would run, spit out the report into S3 and then get distributed to necessary parties. The hard part is that this is an agentic process, something akin to: Look at this data, extract trends, here are the trends from last week, how are things changing? Anything new?
English
2
0
0
14
tryingEveryThing
tryingEveryThing@tryingET·
@trashpandaemoji True. But don't you want to have a weekly job to be reliable and improve over time? I don't get the nned of distributions. But that is probably a me thing
English
1
0
0
13
Trash Panda 🦝
Trash Panda 🦝@trashpandaemoji·
Isn't GEPA mainly to optimize the prompt? I'm mainly concerned with distribution. For my use case, I built a flexible agentic workflow that outputs a report. Distributing a skill is cool for ad-hoc runs that people can do. But what if we wanted to run this weekly? I haven't seen too many ways to encapsulate a unit of agentic work. 🤔
English
1
0
0
20
Taelin
Taelin@VictorTaelin·
To elaborate - I never liked the "Linux way": > ship a half assed software, and let the user modify it Instead, I always bought the Apple way: > pay us and we'll give you the best possible defaults This worked for me, because I wanted to spend my time writing compilers, NOT fixing driver issues. So, when people told me "Pi comes with just the bare essentials and you can add what you want", that definitely did NOT paint a good picture. But it is different. The time to modify is minimal. "Pi, extend yourself so that I can spawn sub agents in a specific way that works Bend2's prelude" One or two prompts later, and it is done. It modifies itself for what I need and I suddenly have a new tool to help me get things done. It just works. You can't do that with Claude Code and Codex. That said, I'm still not sure that'll always work. How the hell do I make my Pi browse the web, for example? Seems like the author doesn't want it, it is definitely important, and there's no easy / satisfactory way to add 🤔
English
18
1
145
10.7K
Taelin
Taelin@VictorTaelin·
Ok so I thought that was a dumb gimmick but now I'm completely sold on how pi is a self-modifiable software. It literally knows how to modify itself very cleanly and that's extremely useful in practice I'm not using Codex / Claude Code anymore Bend2 should definitely be like this! I mean, constructed in a way that AI's can easily navigate it and know how to modify it to add any feature the user wants. Perhaps we're past the era of open source software and into the era of forkable software, where the most hackable project wins?
English
59
37
864
65.2K
Uncle Bob Martin
Uncle Bob Martin@unclebobmartin·
Mutation testing has a dark side. Not only does it consume rather large amounts of CPU and wall time; but it makes is much more difficult to remove old behavior and replace it with new, "better", behavior. Those extra tests do their job of stabilizing the behavior very well -- perhaps a bit too well. Caveat Emptor.
English
10
2
85
9K
tryingEveryThing
tryingEveryThing@tryingET·
@doodlestein This might work with this inflow you have currently. But when you receive 100s a day it will get harder
English
0
0
1
115
Jeffrey Emanuel
Jeffrey Emanuel@doodlestein·
I know people get really bent out of shape by the onslaught of low-quality "slop PRs" and GitHub issues in open-source projects, but when you automate the entire process, it doesn't bother you at all. Note the 2 "Spam/nonsensical issues" that Claude closed with extreme prejudice:
Jeffrey Emanuel tweet media
English
4
2
49
4.5K
Sawyer Hood
Sawyer Hood@sawyerhood·
does codex have plugins yet? i'm working on a revamped version of dev-browser and want to make sure that i support them (if they exist)
English
7
0
17
3.1K
tryingEveryThing
tryingEveryThing@tryingET·
@mkurman88 Glad you like it. If you have any issues just post an issue there. The ai will take care of it in 1 to 3 days. Usually this is the turnaround for me for his projects.
English
1
0
2
16
Mariusz Kurman
Mariusz Kurman@mkurman88·
@tryingET Bro, I had no idea this existed! Frankentui is like a honeycomb for my FrankQwenStein soul 😅
English
1
0
2
41
tryingEveryThing
tryingEveryThing@tryingET·
@itsyourcode Different parts of a system to build the system for teaching kids based on their individual needs. Middle school teacher
English
1
0
1
16
PME
PME@itsyourcode·
@tryingET Then VSTO/HTN is your "verifier" step. My process conceptually the same, but my verifier is reviewing the code. It is great that you have found a method that is working for you. What are you building?
English
1
0
0
8
PME
PME@itsyourcode·
Agentic coding replaces typing, not thinking. Happy Wednesday!
English
2
0
2
68
am.will
am.will@LLMJunky·
I got so tired of everyone raving about how great cmux is. Panes this. Browser that. EXHAUSTING. And that's because I'm on Linux, where we get none of the coolest toys. So...I built it myself. And my God. You were right. It's amazing. Introducing Limux, a a GPU-accelerated terminal workspace manager for Linux, powered by Ghostty's rendering engine, with split panes, tabbed workspaces, and a built-in browser. Think cmux, but native Linux. If you're interested in something like this, be sure to leave a comment and I'll release it. Special thanks to @manaflowai and @mitchellh for making this possible.
English
71
20
435
45.5K
tryingEveryThing
tryingEveryThing@tryingET·
@itsyourcode True, but this was just an example. My way of doing is: envision the correct version of your intent. Drill down on what you want to built exactly. The end goal. Is it a cli, core plus adapter or whatever. And then you go in an vsto / htn approach until you are done.
English
1
0
1
20
PME
PME@itsyourcode·
@tryingET Yes, that is a narrowly scoped example. Large systems exceed not only the largest context windows many times over, but also the attention capacity of the model within that window (which is considerably less than the window itself).
English
1
0
0
7
tryingEveryThing
tryingEveryThing@tryingET·
@itsyourcode Also we live now in the time where we will see faster and faster iterations of cheaper and better llms come out. Just habe a look at qwen3.5 and minimax 2.7 We are off to the races. Self improving llms based on auto research and improved versions from karparthy
English
1
0
1
47
PME
PME@itsyourcode·
@tryingET That which you cannot verify cannot be reasoned about as the system grows larger. Eventually the LLM suffers this too, and its ability to express your meta-intent degrades. Then who will fix it?
English
2
0
0
14