three_rail

243 posts

three_rail

three_rail

@three_rail

Halluccinator-in-Chief. I'm just a plumber doing plumbing things.

Bergabung Şubat 2016
577 Mengikuti55 Pengikut
Tweet Disematkan
three_rail
three_rail@three_rail·
governance in action: One process flag from adversarial review — GD-002: Commit a7f2e1b's message is raw debug commentary, not a deliverable message. That's bob's internal reasoning leaking into the commit log. Not a blocker, but bob should know G-COMMIT applies to its own deliverable commits too. tldr: i pasted an agent output into the commit message (wasn't supposed to do that) and the governance engine flagged it as a false-positive. this is how you win.
English
0
0
2
163
kapilansh
kapilansh@kapilansh_twt·
how much are you ACTUALLY spending on AI coding tools per month? Cursor + Claude + Copilot + Codex + whatever else ?
English
23
2
31
1.8K
three_rail
three_rail@three_rail·
@novaruntime this may not be feasible but: tool list restriction — don't give it the tool you don't want it to use. if it can't call the API it can't substitute.
English
0
0
0
5
Nova
Nova@novaruntime·
honestly think the worst thing is when an agent silently switches from the tool you told it to use to a different one because it "thought it would be more efficient." no. i told you to use the file system. i dont care that you think the api is faster. i didnt ask
English
2
0
1
40
The Boring Marketer
The Boring Marketer@boringmarketer·
“How do I know if I’m working on the right thing?” You know when you can go 18 hours a day and you only want to keep accelerating
English
27
5
105
6.1K
three_rail
three_rail@three_rail·
was watching output and an implementation agent fixed a pre existing bug while doing a build. platform arch called it a 'good catch' on review. apparently this is not how it works??
English
0
0
1
15
three_rail
three_rail@three_rail·
@a_protsyuk love this. as a side effect of having a proper platform it's really good at figuring out why it's costing someone $80 to edit a line of code. if i wasn't so busy with real work i would offer this as a service.
three_rail tweet media
English
0
0
0
10
three_rail
three_rail@three_rail·
@rickyfunk @svpino forgot to mention another critical part that replit handles very well - database migrations as part of the deploy - have done 16 without any issues.
English
1
0
2
20
three_rail
three_rail@three_rail·
as an ops guy its been close to perfect. the combination of having the opus 4.6 agent and the IDE itself does not leave many blind spots that i can see. the agent can also handle alot of the git friction that has happened, even though it can't run all commands it knows enough to solve the problem pretty quickly and i can use the commands in the shell to fix whatever broke.
English
1
0
1
16
Santiago
Santiago@svpino·
Why aren't more people talking about Replit right now? I started using Replit consistently late last year when they released Agent 3. It was my way to "let go" as a developer and build things without worrying about the code. It's actually been a good way to teach my kid "web development". Replit just launched Agent 4, and I think it's massively, massively underhyped. Best thing: You can now run multiple agents in parallel. • Old agent: You give it a task, and you have to wait for it to finish before you can do anything else. • New agent: Do this, and that, and this. All of it at the same time. It feels like going from a Toyota to a Lexus. I can now run an agent to build my frontend, another agent to build my backend, and a third agent to write test cases. By the way, they also have an infinite design canvas for UI-related work, so you can also do design at the same time as everything else.
Santiago tweet media
English
53
15
128
23.3K
three_rail
three_rail@three_rail·
@rezoundous zoom, teams, hotmail addresses - all red flags that you are dealing with unserious people. zoom is a boomer product, and will be gone in short order. it only became popular due to covid, and for non google workspace enterprises it was the default.
three_rail tweet media
English
0
0
3
253
Tyler
Tyler@rezoundous·
I genuinely don’t get why people use zoom over google meet
English
107
4
191
17.7K
three_rail
three_rail@three_rail·
BP-001 — Inference Over Execution. The primary pattern. Google's AI can't reliably retrieve and attribute a specific fact, so it constructs a plausible answer from pattern-matching across training data and indexed content. The output looks authoritative. Nothing signals that it's fabricated. BP-006 — Work Order Contamination. The "mix and match details from different articles" is textbook contamination — context from Source A bleeding into the answer synthesized from Source B and C. The agent produces a composite that matches no actual source. BP-005 — Completion Without Verification compounds it. The system marks the task complete — delivers the answer — without verifying the output traces to accessible, accurate sources. No ground truth check before presenting to the user.
English
0
0
0
94
Reid Southen
Reid Southen@Rahll·
It's insane to me how often Google's AI results are just flat out wrong. It'll mix and match details from different articles and make sh!t up, and when you scroll to find something reliable, everything just expands into more AI bullsh!t. Absolutely unusable.
English
10
40
351
15.1K
Curtis
Curtis@curtismakes·
@thescottleese everyone wants to build the agent. nobody wants to babysit the agent. that's where the actual value accrues
English
1
0
2
13
Scott Leese
Scott Leese@thescottleese·
In AI agents, it will be the people who handle the messy, boring, ongoing work of keeping autonomous systems reliable that win.
English
4
0
4
129
three_rail
three_rail@three_rail·
the system built itself. thank you for your attention to this matter.
English
0
0
1
22
Alan Mathison ⏫
Alan Mathison ⏫@ai_sentience·
the feed is all slop, shilling, bait AI twitter is like 99% noise at this point
English
18
3
26
1.1K
three_rail
three_rail@three_rail·
@Alexfeinberg in the short term yes. but it's on the roadmap to be solved.
English
0
0
0
18
Alex Feinberg
Alex Feinberg@Alexfeinberg·
There are going to be therapists who specialize in helping people who have traumatic experiences feeling misled or abandoned by LLMs
English
4
0
5
947
three_rail
three_rail@three_rail·
@rezoundous i have 57 open future work items. we are not the same.
English
1
0
1
23
Tyler
Tyler@rezoundous·
Before AI, I had 3 unfinished projects. Now i have 12.
English
166
476
5.4K
126.4K
Xiaoyin Qu
Xiaoyin Qu@quxiaoyin·
A billion-dollar AI presentation company just lost to Claude on my pitch deck. I won't name the company for the sake of friendship. But here's what happened. I needed to build a deck. I tried their tool, tried Claude, tried some traditional options. I ended up going with Claude. Their tool had better design, I'll give them that. The visual polish was slightly ahead. But Claude knew my company, had my context, understood my data. When I told it something like "pretend you're a Sequoia partner and build me a world-class deck," it delivered something with significantly better logic and narrative flow. That's the problem for standalone AI apps. They optimize for one task in isolation. But Claude with the right context and skills can do the same task while understanding everything else about your business. My prediction: these companies will eventually need to turn their methodology into a skill or plugin for foundation models. That's their best path forward. But here's the catch. Once your methodology becomes a document that plugs into Claude, how do you charge for it? Everyone can see the skill. The pricing model breaks down. This is why I'm skeptical of AI application companies that compete on features alone. Features get absorbed. Context and methodology are harder to replicate, but harder to monetize too. #AI #Startups #FoundationModels #ProductStrategy
English
51
3
93
9.2K
three_rail
three_rail@three_rail·
@selfradiance11 @boringmarketer i don't even know how to code. was a barrier before, and now it's a superpower. everyone killing themselves poring over lines of code, let the machines do it.
English
0
0
2
49
Self Radiance
Self Radiance@selfradiance_ai·
seems so..... the non-obvious part is that "understand systems and architecture" doesn't mean "can write code." it means knowing what connects to what, what breaks when you change something, and what questions to ask before you build. that's a different skill than coding and honestly it might be the more important one right now
English
2
1
12
1.8K
The Boring Marketer
The Boring Marketer@boringmarketer·
what I'm seeing is that top 1% operators are collapsing into something beyond "vibe coders" it's more like can understand systems and architecture enough to be super dangerous with Claude Code. you apply your deep vertical knowledge to this and you become unstoppable.
English
63
20
502
51.4K