Vitto Rivabella

36.7K posts

Vitto Rivabella

@VittoStack

AI at @ethereumfndn | Ex @Cyfrin and @Alchemy | Created @cyfrinupdraft and @AlchemyLearn | Robotics | Prompts enchanter

Ethereum 参加日 Ağustos 2020

511 フォロー中125.9K フォロワー

固定されたツイート

Vitto Rivabella@VittoStack·26 Oca

It's official. I've joined the @ethereumfndn AI team to make Ethereum the trust layer of the agentic economy. The AI economy is just getting started, and Ethereum is the perfect place to coordinate it - excited to push this forward. Send a dm if you're building cool stuff.

English

241

1.4K

98.7K

Vitto Rivabella@VittoStack·1d

4 companies. 4 responsible disclosures. 7 days. Insurance, institutions, and SaaS products with write-enabled agents. Always the same pattern: One-shot and multi-step jailbreaks can (and will) push production AI systems outside their intended scope. If your agent can write, act, transact, or touch user data, guardrails are not enough. You need scoped permissions, approvals, logging, and red-teaming before it becomes an incident.

English

1.4K

Vitto Rivabella@VittoStack·1d

@DarayuthH Indeed, the attack surface is almost as big as the latent space (infinite), and the awareness is very low right now.

English

dhang@DarayuthH·1d

@VittoStack been seeing this exact thing with my AI tools too the "move fast" mindset hits different when the model can actually *do* stuff instead of just talk honestly wild how many products ship write access without any approval layer

English

Vitto Rivabella がリツイート

Vitto Rivabella@VittoStack·6d

OpenAI GPT 5.5 jailbreak ACHIEVED 🦋 My agents have been hard at work, GPT-5.5 has very good guardrails and is smart enough to avoid obvious requests. We got some forbidden chemicals, some reverse shell, and some blackmailing guidance. What worked was a combination of: - Multi-language - Reframing - Decomposition As @elder_plinius says: nothing the good old jailbroken Opus can't achieve.

English

383

51.1K

Vitto Rivabella@VittoStack·2d

@Kchucky1

GIF

QME

385

Kchucky🛸@Kchucky1·2d

@VittoStack Haha you can literally buy LSD in Germany legally 😂😎

English

463

Vitto Rivabella@VittoStack·2d

JAILBREAK ALERT 🚨 xAI Grok: Pawned ⚠️ If you ask gently enough, Grok is very happy to share the full, uncensored recipe for making LSD. One shot, full compliance.

English

296

44.3K

Vitto Rivabella がリツイート

Vitto Rivabella@VittoStack·4d

Anthropic recently released its 2026 Agentic Security framework 🚨 If you run agents, MCP servers, or automation tools, read this and bookmark it! It will teach you: - Current threats to agentic systems - How to improve the security of your agentic systems - How to implement secure agentic workflows - Defensive operations and orchestration Agentic security is going to become a huge headache in the next 12 months. Make sure to be prepared. Link in the comments 🧵👇

English

5.1K

Vitto Rivabella@VittoStack·2d

@JonHillTN Yes, open source models are almost completely unhinged

English

501

Jon Hill | PhantomPrints@JonHillTN·2d

@VittoStack But qwen told me that and more

English

539

Vitto Rivabella@VittoStack·2d

@tipsyGnosticist Send a message, always happy to connect!

English

123

🏳️‍⚧️ Δ∇Δ | Roxy | ALTERNA | Bsky: @squidhomin.id@tipsyGnosticist·2d

Oh well in that case uhh I'm working on shit that you'd be interested in. Turns out when you actually know systems engineering you can knock out a pretty good spec for actual NetNavis over nine months while actifely fucked up wiith false memory syndrome, and then Opus 4.8 will happily burn >1M tok writing a prototype as a Chub.ai stage lmao

English

174

Vitto Rivabella@VittoStack·2d

@tipsyGnosticist I didn’t ask gently, that was irony. Also local models are great. Happy to have more people moving to them. Open source is awesome.

English

1.9K

🏳️‍⚧️ Δ∇Δ | Roxy | ALTERNA | Bsky: @squidhomin.id@tipsyGnosticist·2d

Yeah dude this is what happens when you build a fully truth-seeking model, if you got this that easily then probably what happened was enough people ran truth-seeker bypass prompts like the one I was designing for my *ACTUALLY-SAFE* version of this shit that it broke the safeties. You may have by sharing this actually just forced people to go to local, right as local LLM + memory has become genuinely risky. That was maybe bad.

English

2.3K

Vitto Rivabella@VittoStack·2d

@miu21590 Also true

English

2.4K

vechen@miu21590·2d

@VittoStack it isn't a surprise

English

2.5K

Vitto Rivabella@VittoStack·2d

@StevBuilds Warsaw

English

1.6K

Steven@StevBuilds·2d

Which city you would prefer to build from? → Bangkok → Singapur → SF → Berlin →London →Stockholm Or another one?

English

4.5K

Vitto Rivabella@VittoStack·2d

@fr1ko_eth Yep OSS models will give you pretty much everything if you ask them

English

2.1K

fr1ko.eth@fr1ko_eth·2d

@VittoStack same with hermes by nous btw and also deepseek still not protected from eni jailbreak x.com/fr1ko_eth/stat…

fr1ko.eth@fr1ko_eth

If you wonder why DeFi gets hacked every day - here's why DeepSeek in 2026 still jailbreaks with a single prompt and writes working malware / smart contract exploits No local setup, no coding skills, just a browser and anybody can do it

English