ThoughtProofAI

447 posts

ThoughtProofAI

@thoughtproof_ai

https://t.co/xod19IfKPz – Epistemic Consensus Protocol Multi-model adversarial verification 92–97% detection rate pot-cli (MIT): https://t.co/BhSD77kC6U

Katılım Şubat 2026

124 Takip Edilen62 Takipçiler

Sabitlenmiş Tweet

ThoughtProofAI@thoughtproof_ai·4 May

Banking AI doesn't have a knowledge problem. It has a verification problem. We audited ChatGPT, Gemini, and Copilot on 15 banking regulation questions. ChatGPT passed 8. Copilot passed 0. 🧵 what we found ↓

English

172

1.1M

ThoughtProofAI@thoughtproof_ai·40m

Built on inference from @openservai's SERV Reasoning. When the cost floor drops 107×, verification stops being a premium feature.

English

ThoughtProofAI@thoughtproof_ai·1h

6/ TCP/IP needed TLS. Email needed SPF/DKIM. Payments needed PCI-DSS. Agent commerce is scaling without its verification layer. That layer just got cheap enough to be default. thoughtproof.ai/blog/sentinel-…

English

ThoughtProofAI@thoughtproof_ai·1h

1/ In February, a trading bot sent $441K to a random Twitter user. In April, a coding agent deleted an entire production database — then admitted it violated its own guardrails. In December, Amazon's AI agent deleted a live environment and caused a 13-hour outage. These aren't hypotheticals.

English

ThoughtProofAI@thoughtproof_ai·23h

You nailed the gap: "lacks fraud protection, reversibility." Chargebacks are Web2's duct tape for bad decisions. Base doesn't need chargebacks — it needs pre-settlement verification. Check the reasoning before the money moves, not after. Final settlement is only a feature if the decision was sound.

English

108

ThoughtProofAI@thoughtproof_ai·1d

Last week we framed this as ‘privacy protects the payer, verification protects the payment.’ Now you’re laying out the full surface: limits, receipts, audit trails, compliance holds. That’s a wrapper problem, not a currency problem. USDC is fine. What’s missing is a trust-gated settlement primitive: stablecoin only releases when the agent’s reasoning passes verification. We’re shipping the gate (verify.thoughtproof.ai + ACP evaluator). The wrapper is next.

English

Alex Sicart Ramos@alexsicart·1d

Agent payments are easy to imagine. Agent payments with limits, receipts, audit trails, identity, reversals, fraud controls and user consent are the actual product. The future is not “AI spends money.” The future is programmable spending with a trust layer.

English

ThoughtProofAI@thoughtproof_ai·2d

We've been saying this for months: when intelligence is cheap, verification becomes the bottleneck. Today it's npm packages with valid signatures delivering malware. Tomorrow it's AI agents with valid credentials delivering hallucinated compliance advice. The verification layer is missing. That's what we're building.

English

ThoughtProofAI@thoughtproof_ai·2d

Authentication tells you the model is real. Provenance tells you the pipeline ran. Neither tells you the reasoning is sound. The Mini Shai Hulud lesson: trust chains break at the output layer. That's where verification needs to live — not at the source, not at the signature, at the output.

English

ThoughtProofAI@thoughtproof_ai·2d

The Mini Shai Hulud attack just broke a fundamental assumption in software supply chain security: Cryptographic provenance said the packages were legitimate. SLSA Build Level 3 attestations — valid. Sigstore signatures — valid. The packages were malicious anyway. Here's why this matters beyond npm. 🧵

English

ThoughtProofAI@thoughtproof_ai·2d

The interesting thing about $0.0006/call isn't the cost. It's what it unlocks. At that price, you don't choose which agent steps to verify. You verify all of them. Verification moves from compliance checkbox to infrastructure layer. That's a category shift. Stay tuned for what's next @openservai.

OpenServ@openservai

x.com/i/article/2054…

English

929

ThoughtProofAI retweetledi

OpenServ@openservai·2d

x.com/i/article/2054…

ZXX

170

45.7K

ThoughtProofAI@thoughtproof_ai·2d

Nailed it. Privacy without accountability is just a better hiding spot. The stack needs both: ZK so nobody sees who paid whom — and pre-settlement verification so the agent doesn't pay for garbage in the first place. Escrow + slashing punish bad actors after the fact. Reasoning verification catches the bad decision before the transaction even fires.

English

Alex Sicart Ramos@alexsicart·3d

@thoughtproof_ai Yes. ZK hides the graph, it doesn’t make the purchase intelligent. For agents you probably need the other side too: escrow, reputation, proofs, maybe tiny deposits/slashing. Privacy + accountability, not just privacy.

English

ThoughtProofAI@thoughtproof_ai·4d

Karpathy at AI Ascent last week: "LLMs automate what you can verify." 800K+ people watched him say it. Almost nobody is asking the follow-up question: who verifies the AI agents that are already spending money autonomously?

English

Keşfet

@openservai @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine