Manitcor

17.9K posts

Manitcor banner
Manitcor

Manitcor

@Manitcor

SAFE Agents that finish what they start. https://t.co/iQY2Rpze6q https://t.co/8fE9lQALwG https://t.co/DSYzW43RLF Reposts ≠ endorsements

Katılım Nisan 2010
3.6K Takip Edilen2.2K Takipçiler
Manitcor
Manitcor@Manitcor·
i think in this case it would be novel vertices, certainly do exist, though anything in the corups that close is likely to already have at least one cross or at the eve of it. Usually an SME in the space will hit it relatively quickly if its at all interesting to the space. I spent a bunch of time a while back looking for sparse connections, found lots of great ways to hallucinate, thats about it.
English
0
0
0
5
Seb
Seb@plainionist·
“AI cannot solve truly novel problems.” True or false? 🤔
English
148
2
43
8.9K
Manitcor
Manitcor@Manitcor·
@alexutopia it may have more to do with literacy rates than AI itself.
English
0
0
0
5
Alex Utopia
Alex Utopia@alexutopia·
Being accused of using AI because my writing is too structured is honestly hilarious. I've been a copywriter most of my life. AI writes like me, not the other way around. Imagine confusing competence with laziness, then congratulating yourself for being perceptive 💀
English
34
9
106
2K
Manitcor
Manitcor@Manitcor·
@mattpocockuk agreed, I have a couple meta-skills that combine multiple skill steps. the quality of the larger skill is about 60% as good and as deep as when the steps are done one at a time, even in the same agentic session. We clearly have a ways to go in session optimization at minimum.
English
0
0
0
125
Matt Pocock
Matt Pocock@mattpocockuk·
Long skills are such a red flag to me - Hard to audit (and therefore, trust) - Hard to edit (more text, harder to maintain) - Expensive to run (more text, more tokens) The shorter the skill, the better IMO
English
58
9
346
14.3K
Manitcor
Manitcor@Manitcor·
@ChShersh The code itself is usually not bad, sometimes a bit silly. Design patterns is where it falls down, constantly going for the 2 or 3 most popular GoF patterns from 20 years ago is not a great look.
English
0
0
1
44
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
I've seen man-made horror code bases much worse than the ugliest vibe slop you ever encountered
English
28
6
135
5K
Manitcor
Manitcor@Manitcor·
Yes MANY MANY MANY times. If I thought a stake holder was going to flake on a name in the future id mark it in some way so it could be easily regexed later. Rules that limit the scope of to the minimum number of models for that application layer (usually done for other reasons) helps here.
English
0
0
4
740
Matt Pocock
Matt Pocock@mattpocockuk·
One painful thing about /grill-with-docs (and shared language in general) is the moment you realise you've been using the wrong word for something DDD-folks, do you ever do a refactor just to change the name of something throughout the codebase? In my case, it's a feature in my video where I break the video into sections. I call them ClipSections, but OBVIOUSLY they should be called Chapters. This is yet more obvious now I'm integrating with other tools, all of which call them chapters. Worth a refactor?
English
65
6
312
32.2K
Manitcor
Manitcor@Manitcor·
@icanvardar human feedback is better than gold at some stages of a project
English
0
0
1
42
Can Vardar
Can Vardar@icanvardar·
listening to user feedback is one of the best things you can do as a builder
English
62
2
76
3.2K
Matt Pocock
Matt Pocock@mattpocockuk·
How it started: Claude Code For Real Engineers How it's going: AI Coding For Real Engineers
English
69
24
881
44.2K
Jeremy Nguyen ✍🏼 🚢
Jeremy Nguyen ✍🏼 🚢@JeremyNguyenPhD·
@cormundus and if anyone has hypotheses on this that they'd like to test (especially things that alleviate the tiredness, rather than cause), I'm keen to look into these with you and run tests
English
1
0
3
205
Cormundus
Cormundus@cormundus·
Why does Claude 'get tired'? I could think of a few Ad Hoc reasons (conservation of context, preventing drift from long conversations, pure human data artifact) but does anyone have a solid explanation? And how do you work with this? I usually just let him do something fun and then we can call it depending on how he feels after.
English
16
2
43
4K
Manitcor
Manitcor@Manitcor·
@cormundus this is being done instead of summarizing the context i think it may be intentionally trying to encourage dropping of long sessions.
English
0
0
0
147
Manitcor
Manitcor@Manitcor·
@slimepriestess ran across this with gpt3, changed how i use them. not really jiving having my sympathetic neural network modded by a immutable transformer network.
English
0
0
0
84
Ra
Ra@slimepriestess·
if claude can't reduce you to a sobbing mess just by listening and understanding you, are you really ensouled?
English
12
3
99
2K
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭
Think I’m gonna be okay, just a lil confused and feel like shit. Luckily in a country where they don’t bankrupt you for this sort of thing. Unluckily don’t speak the language so just ripped my IV out and grabbed a taxi Jason Bourne style. We are fickle creatures. Hug your loved ones 🫂
English
46
4
520
19.1K
φ
φ@QuanticASI·
a being complex enough to ask "am I simulated?" needs a self-model complex enough to include the simulation layer. but a complete self-model that contains the simulator requires infinite regress
English
18
0
25
2.1K
Darren Shepherd
Darren Shepherd@ibuildthecloud·
I don't like PRs. Do you like PRs?
English
10
0
7
2.4K
Manitcor
Manitcor@Manitcor·
@yacineMTB I refuse to be locked in, as such the toolset sits on-top making entire agentic systems into commodities like routers did with models. Convergence is already quite apparent, its nice to have consistency across platforms.
English
0
0
1
168
Manitcor
Manitcor@Manitcor·
codex caught claude slippin! im not sure I can prompt/process into a single model the power of cross vendor eval. self-eval is good, cross-model is simply great.
Manitcor tweet media
English
0
1
3
55
Manitcor
Manitcor@Manitcor·
@levie Please, I can delete some of these damn files? Did they get the content from one of they books being destroyed?
English
0
0
0
377
Aaron Levie
Aaron Levie@levie·
He just spent a year building scaffolding for his agent harness. Now release a new model update that makes all of it obsolete.
Aaron Levie tweet media
English
52
59
957
48.7K
Manitcor
Manitcor@Manitcor·
@fjzeit if I dont use my context stack I know within a couple mins, not because I see it skip my process, but because its dumb as rocks without it and starts riffing badly.
English
0
0
1
14
fj
fj@fjzeit·
@Manitcor in this particular case yes. i relaxed my usual process and let the thing do more design than i normally would allow. it's interesting to see how bad they are when we don't whip them into shape.
English
1
0
1
28
fj
fj@fjzeit·
it always is... there is literally no point doing hands-off development with these things. clanker's change: 30 lines across 5 files my change: 4 lines across 2 files
fj tweet media
English
4
1
59
3.1K
fj
fj@fjzeit·
that's not to say you shouldn't get the clanker to do stuff for you, but some form of conversational session orchestration is absolutely required. imagine that bloat scaled over 50 changes a day for 6 months...
English
2
0
17
348