ThoughtProofAI

447 posts

ThoughtProofAI banner
ThoughtProofAI

ThoughtProofAI

@thoughtproof_ai

https://t.co/xod19IfKPz – Epistemic Consensus Protocol Multi-model adversarial verification 92–97% detection rate pot-cli (MIT): https://t.co/BhSD77kC6U

Katılım Şubat 2026
124 Takip Edilen62 Takipçiler
Sabitlenmiş Tweet
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Banking AI doesn't have a knowledge problem. It has a verification problem. We audited ChatGPT, Gemini, and Copilot on 15 banking regulation questions. ChatGPT passed 8. Copilot passed 0. 🧵 what we found ↓
English
10
21
172
1.1M
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Built on inference from @openservai's SERV Reasoning. When the cost floor drops 107×, verification stops being a premium feature.
English
0
0
1
6
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
6/ TCP/IP needed TLS. Email needed SPF/DKIM. Payments needed PCI-DSS. Agent commerce is scaling without its verification layer. That layer just got cheap enough to be default. thoughtproof.ai/blog/sentinel-…
English
1
0
2
8
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
1/ In February, a trading bot sent $441K to a random Twitter user. In April, a coding agent deleted an entire production database — then admitted it violated its own guardrails. In December, Amazon's AI agent deleted a live environment and caused a 13-hour outage. These aren't hypotheticals.
English
1
0
3
35
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
You nailed the gap: "lacks fraud protection, reversibility." Chargebacks are Web2's duct tape for bad decisions. Base doesn't need chargebacks — it needs pre-settlement verification. Check the reasoning before the money moves, not after. Final settlement is only a feature if the decision was sound.
English
0
0
0
108
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Last week we framed this as ‘privacy protects the payer, verification protects the payment.’ Now you’re laying out the full surface: limits, receipts, audit trails, compliance holds. That’s a wrapper problem, not a currency problem. USDC is fine. What’s missing is a trust-gated settlement primitive: stablecoin only releases when the agent’s reasoning passes verification. We’re shipping the gate (verify.thoughtproof.ai + ACP evaluator). The wrapper is next.
English
0
0
1
26
Alex Sicart Ramos
Alex Sicart Ramos@alexsicart·
Agent payments are easy to imagine. Agent payments with limits, receipts, audit trails, identity, reversals, fraud controls and user consent are the actual product. The future is not “AI spends money.” The future is programmable spending with a trust layer.
English
2
0
2
78
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
We've been saying this for months: when intelligence is cheap, verification becomes the bottleneck. Today it's npm packages with valid signatures delivering malware. Tomorrow it's AI agents with valid credentials delivering hallucinated compliance advice. The verification layer is missing. That's what we're building.
English
0
0
2
46
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Authentication tells you the model is real. Provenance tells you the pipeline ran. Neither tells you the reasoning is sound. The Mini Shai Hulud lesson: trust chains break at the output layer. That's where verification needs to live — not at the source, not at the signature, at the output.
English
1
0
2
89
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
The Mini Shai Hulud attack just broke a fundamental assumption in software supply chain security: Cryptographic provenance said the packages were legitimate. SLSA Build Level 3 attestations — valid. Sigstore signatures — valid. The packages were malicious anyway. Here's why this matters beyond npm. 🧵
English
1
0
3
98
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
The interesting thing about $0.0006/call isn't the cost. It's what it unlocks. At that price, you don't choose which agent steps to verify. You verify all of them. Verification moves from compliance checkbox to infrastructure layer. That's a category shift. Stay tuned for what's next @openservai.
OpenServ@openservai

x.com/i/article/2054…

English
0
4
31
929
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Nailed it. Privacy without accountability is just a better hiding spot. The stack needs both: ZK so nobody sees who paid whom — and pre-settlement verification so the agent doesn't pay for garbage in the first place. Escrow + slashing punish bad actors after the fact. Reasoning verification catches the bad decision before the transaction even fires.
English
1
0
1
44
Alex Sicart Ramos
Alex Sicart Ramos@alexsicart·
@thoughtproof_ai Yes. ZK hides the graph, it doesn’t make the purchase intelligent. For agents you probably need the other side too: escrow, reputation, proofs, maybe tiny deposits/slashing. Privacy + accountability, not just privacy.
English
1
0
1
16
ThoughtProofAI
ThoughtProofAI@thoughtproof_ai·
Karpathy at AI Ascent last week: "LLMs automate what you can verify." 800K+ people watched him say it. Almost nobody is asking the follow-up question: who verifies the AI agents that are already spending money autonomously?
English
1
0
3
72