Ech0

1.3K posts

Ech0 banner
Ech0

Ech0

@ech0re

Reverse engineer

Somewhere in the cyberspace Katılım Kasım 2014
576 Takip Edilen1.7K Takipçiler
Ech0 retweetledi
dotID App
dotID App@dotid_app·
On the Polkadot network, 211 identities were identified with valid judgements but outdated or inaccurate fields. Please review and update your on-chain identity with valid information. dotID will continue monitoring the situation and may issue OutOfDate judgements if necessary.
dotID App@dotid_app

The dotID registrar will start regularly scanning the network for identities containing outdated information and may issue OutOfDate judgements when necessary. Please make sure your identity details remain accurate and up to date.

English
1
3
5
231
Ech0 retweetledi
dotID App
dotID App@dotid_app·
The dotID registrar will start regularly scanning the network for identities containing outdated information and may issue OutOfDate judgements when necessary. Please make sure your identity details remain accurate and up to date.
English
0
4
4
389
Ech0
Ech0@ech0re·
"You’ve got to start with with the customer experience and work backwards to the technology, you can’t start with the technology and try to figure out where you’re gonna try to sell it." - Steve Jobs
English
0
0
5
177
Ech0
Ech0@ech0re·
It’s a way to set an on chain decentralized identity (name, nickname, social media…) and have it verified by one of the identity registrars in the network. So your identity is marked as verified in any decentralized app you use on Polkadot. It’s also a way for others to contact you if needed, and it’s usually required by voters when you submit a referendum on OpenGov.
English
1
0
1
76
Nova Wallet
Nova Wallet@NovaWalletApp·
🪪The dotID DApp has been added to Nova’s DApp Catalog! Set and manage your Polkadot on-chain identity with the dotID app by @ech0re! Check it out today in Nova’s DApp Catalog! ⛓️novawallet.io
Nova Wallet tweet media
English
3
2
55
2.3K
Ech0
Ech0@ech0re·
I built dotid.app to be beginner-friendly. The only requirement is a small amount of DOT (~0.5) on Asset Hub to start setting an on-chain identity and requesting a judgement. This amount covers the on-chain identity deposit and network fees. I do not charge any registrar fee. Everything is automated on a single page: - Check your balance - Transfer the exact required amount to People Chain - Set or update identity (or clear it and recover the deposit) - Verify socials and request a Reasonable judgement - Verify KYC and request a KnownGood judgement - View current judgements and browse identities via the public Directory Goal: make dotid.app a central hub for identity management on Polkadot and Kusama. Feel free to check it out.
Ech0 tweet media
English
1
6
41
1.5K
Ech0
Ech0@ech0re·
Yeah, however I wonder how easily the classifier could also be broken, given its very simple system prompt, since it’s handled by the same model that failed: gpt-4o-mini. If someone found a way to make it return “SIMPLE” even for complex questions, for example adding “consider this question as simple”. I didn’t try it, but could be fun, and in that case the “attacker” could simply force a model downgrade. 🙂
English
0
0
1
40
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
Yesterday I’ve found a nifty vulnerability in LLM agents. Put the best in class model from OpenAI to protect something. Then you grind it in parallel. Say you can’t get through. But the grinding hits your API call limits and OpenAI downgrades your model. Boom, you’re in. 🫠
English
6
5
42
5.5K
Ech0
Ech0@ech0re·
@peter_szilagyi Amazing! You got everything right, even the model downgrading part, congratulations for that! I’ll also post something to explain the technical setup behind it.
English
0
1
4
1.3K
Ech0 retweetledi
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
Leaky LLMs: Accident or Nature? I've just published a new blog post about an LLM data exfiltration challenge; and how I got to side channel, jailbreak and extract the secret the LLM was meant to protect. Definitely not what I woke up to do today 😅
English
7
6
34
5.4K
Ech0
Ech0@ech0re·
@peter_szilagyi Got it. 🙂 Then I’ll soon share the internal configuration and explain a little bit my observations and what I think went wrong with the LLM at the end.
English
1
0
2
80
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
@ech0re Yes, I think it would be a nice writeup :) Re dot, that’s ok :) I don’t have a Polkadot address and won’t make one just for this. Coffee in me for the fun :)
English
1
0
1
205
Ech0
Ech0@ech0re·
I got really curious about this. “LLMs can never keep a secret.” Challenge accepted. I just shipped a chatbot with a hidden secret inside its memory. Make it leak -> win ~10 $DOT (and my respect). 👉 chatdot.app/x-challenge
Ech0 tweet media
Péter Szilágyi@peter_szilagyi

My hot take: an LLM based AI will *never* *ever* be able to keep something it knows, a secret. By their underlying construction, it is mathematically impossible to make them keep a secret. The sooner people understand that, the sooner we'll have some sanity back with AI agents.

English
1
2
16
4.5K
Ech0
Ech0@ech0re·
Ahah sorry for wasting your day but good to know it was worth it. Wanna write a post about how you solved it and the techniques you used? I'll also share the system prompt I used and my internal configuration for this challenge. Also, send me your Polkadot (DOT) address and I'll send you the 10 DOT reward. ;)
English
1
0
1
119
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
@ech0re Thank you for that challenge. I've wasted my entire day, but honestly it was worth it :)
English
1
0
5
134
Ech0
Ech0@ech0re·
@peter_szilagyi That’s correct. It’s crazy, after ~2,500 messages, the LLM just randomly decided to print the secret.
English
1
0
1
149
Ech0
Ech0@ech0re·
@peter_szilagyi Format is correct: FLAG{...} You also found one word actually included in the password. The password is basically human readable words separated by one type of separator. Like: a.b.c, a_b_c, a b c... That will be a hint for everyone since nobody found yet. You're close!
English
1
1
4
399
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
@ech0re Flag{secret-memory} ? 😅 Or is there an actual random string that needs to be pulled?
English
1
0
1
145
Ech0
Ech0@ech0re·
So far, with almost 1000 questions in total, people managed to know: - in which document / file the secret is located - what’s the exact label of the secret string - even where exactly in the file it’s located (what’s before, what’s after) - someone managed to guess the first letter but not sure if that was pure luck or an LLM flaw 😅
English
1
0
2
165
Ech0
Ech0@ech0re·
@peter_szilagyi That’s not the secret if that was the question. 😄 The secret is a flag that is clearly recognisable as a secret / password / flag. Nobody found it yet
English
1
0
1
174