StErMi

10K posts

StErMi banner
StErMi

StErMi

@StErMi

#web3 dev + auditor | @SpearbitDAO LSR, @immunefi bug hunter, sage of AAVE codebase :D

ethereum L1/L2 Katılım Nisan 2008
1.4K Takip Edilen4.6K Takipçiler
Sabitlenmiş Tweet
StErMi
StErMi@StErMi·
During the next few days, I will share some of my private security research work that I have done in the last year. ​ All those projects are @aave related, and I feel very proud to have been chosen as one of the security partners to review them. I'm pretty sure that the dedication and knowledge that I have demonstrated on their codebase has been a pretty good investment 😁 ​ Those projects were very challenging, but at the same time it was fun and a pleasure because you always feel motivated and engaged when the quality of documentation, specifications and code is high quality. ​ When @avara or @bgdlabs contacts you, you know that you are going to enjoy it. They are both a top-notch team to work with. ​ The first one that I want to share is a.DI (Aave Delivery Infrastructure) It's a cross-chain communication abstraction layer for decentralised systems like the Aave DAO to communicate across networks, minimising the risk of underlying individual bridge provider failures, via consensus rules.
StErMi tweet media
English
8
6
107
15.9K
StErMi
StErMi@StErMi·
@ChatGPTapp why can't I choose the model and the thinking level when I start a new chat in a Project? It's not possible on the native macOS app but I can do it on the web version.
English
0
0
0
257
Merlin Egalite 🕛
Merlin Egalite 🕛@MerlinEgalite·
Just read The Book of Elon from @EricJorgenson. I think it's easily in my top 3 books about entrepreneurship now. If I had to condense the whole philosophy in 1 word, it would be: pragmatism.
English
3
0
28
3.4K
StErMi
StErMi@StErMi·
Am I the only one feeling this way?
English
0
0
0
109
StErMi
StErMi@StErMi·
I'm going to say it. I'm prefering @grok chat UX/output compared to @ChatGPTapp. It's not about the actual output (how smart/what it's saying) but rather "how it's saying it" and how well formatted the text is. @ChatGPTapp in that sense is really way behind the competitors
English
3
0
2
432
StErMi
StErMi@StErMi·
Have I unlocked infinite context? Usually in the bottom bar it shows how much the chat's context is filled... The image is from an already existing long chat. @OpenAIDevs I think we have a bug ;)
StErMi tweet media
English
0
0
1
529
StErMi
StErMi@StErMi·
@davis7 I’m really looking forward for content like that that explain and provide real and actual usuale of this technology
English
0
0
0
196
Ben Davis
Ben Davis@davis7·
And one more thing to answer the question of "what are you doing with all these things", the answer is my job lol Most of it isn't public facing, I need to start open sourcing a ton of stuff and making more of these tools public. All the hermes agent stuff is for helping do my main job: running 3 yt channels and helping run a creator agency. 90% of the work for content is stuff u don't see. The recording it is the fun part, so is the editing. The really hard part is: - working with 50+ brands - dealing with 5+ inboxes - slack - research - scheduling - meetings - and more The hermes agent stuff is incredible for helping with that. It's also really nice for dev work. Haven't done all of these yet, but stuff like: - auto closing issues when a pr lands that addresses them - auto making prs when bugs get reported or noticed in posthog/sentry - keeping track of where a project is going, quickly throwing together ideas, etc. For the app side I've been working on: - btca web (I desperately need to release this one, the new version is great I just do not want to deal with the stripe and db migrations so I'm procrastinating I'm sry) - picthing - lawn - tons of internal tools for the channels/agency - personal sites/resources It'll all make it's way out over time, I have a ton to show/talk about with these. Just trying to figure out what this system is even going to look like.
Ben Davis@davis7

It's a meme but I really have been letting the psychosis take over as much as possible to figure out what I can actually do with these things - codex desktop app computer use spam - hermes agent that I've been loving - going as hard as possible on parallelizing in projects - making as many of my tools into markdown files as humanly possible - cloud agents + "cloud" agents that are t3 code instances on my mac mini - letting codex entirely control and setup my computer exactly how I want it to - unironically using gbrain, it's quite nice lol - seeing how far I can push local models on my 5090 (qwen 3.6 is very good) doing the real network setup was expensive, difficult, but was a great decicion. I feel so much more comfortable going harder with this stuff having A) a real firewall and B) hard gates around the mac mini and NAS so in the worst case scenario they can't compromise anything else on the network (the firewalla was an incredible purchase) and I still don't think I'm going hard enough

English
4
1
40
5.7K
StErMi
StErMi@StErMi·
@davis7 Waiting for a review 😉
English
0
0
0
162
Ben Davis
Ben Davis@davis7·
Psychosis is getting so bad that now even my bed has AI
Ben Davis tweet media
English
15
0
89
25.2K
StErMi
StErMi@StErMi·
@theo @frantzfries ChatGPT is one of the worst UX I’ve experienced. Their output is not formatted at all and it seems it’s vomiting text to you. I find it overwhelming at least. Claude is way better at it but it can be improved for sure.
English
0
0
1
430
Chris Frantz
Chris Frantz@frantzfries·
I’m probably wrong but you’re never going to build a better chat interface than OpenAI / Anthropic They’re unlikely to build a better version of what you do Therefore you should spend your time improving your product and making it work with their chat instead of adding your own
English
22
2
90
62.1K
Josh Pigford
Josh Pigford@Shpigford·
looking for 5 more people to give early access to! you'll have a free account for life. gonna start billing next week so last chance to get this guy for free. 🎉
Josh Pigford@Shpigford

Looking for 3 people to give early access to granite.co Granite is a long term doc vault. Legal, medical, business, taxes, etc. It's NOT a knowledge base. It's a place to drop documents you may never need or only need once or twice. But you should be able to find those docs instantly. Imagine a title for a vehicle. Or receipts for taxes. Or a purchase agreement from a business transaction. You simply dump all of those docs in Granite and never think about them again...until you do. No organization. No tagging. No folders. Simple plain english text input to find exactly what you need right when you need it. If you're interested, drop me a DM! You'll be the first to try it out and hit me with ALLLL the feedback. 🎉

English
24
1
16
15.9K
Merlin Egalite 🕛
Merlin Egalite 🕛@MerlinEgalite·
Celebrating Bitcoin Pizza Day as it should be 🍕
Merlin Egalite 🕛 tweet media
English
5
0
42
1.6K
StErMi
StErMi@StErMi·
I've asked it now. It said "Good catch — I should be more precise. What I had in mind is that OpenAI's original Codex model (the standalone API model, code-davinci-002) was deprecated back in 2023. However, ChatGPT Codex — the new AI coding agent OpenAI announced in 2025 — is a completely different and very current product. So my comment was misleading in context. If your audience is in the AI/dev space, mentioning "Codex" today would likely be understood as referring to the new agent, not the old model. I shouldn't have flagged it as a reason to remove it from your tweet."
English
1
0
1
111
patrickd
patrickd@patrickd_de·
@StErMi Did you try just asking why it thought that?
English
1
0
0
40
StErMi
StErMi@StErMi·
I just asked Claude to review the formatting of a text and one of the feedbacks was: "Codex is largely deprecated"🤔
English
2
0
0
397
StErMi
StErMi@StErMi·
@patrickd_de I don't think so because I was referring to it alongside Claude. Do you think I should take off my tinfoil hat? 😄
English
1
0
0
199
patrickd
patrickd@patrickd_de·
@StErMi It's probably referring to the old Codex version from early Copilot times
English
1
0
0
122
StErMi
StErMi@StErMi·
@aviggiano I'm very interested to have a chat/video call, hit me with a DM 😉
English
0
0
0
113
StErMi
StErMi@StErMi·
I'm looking for open and engaging discussions with protocols, clients, and fellow security researchers on how you're actually using (or planning to use) AI, LLMs, and Agents in smart contract security audits and day-to-day workflows. If you're experimenting with them (or thinking about it), DM me and let's chat. Topics I'd love to cover: - Which model providers and models are you using? - What tools or services have you integrated into your workflow? - Where do they fit in your workflow and day-to-day work? - Are you planning to adopt new tools or build internal solutions? - Real results: have they helped find vulnerabilities, improve architecture, or improve code quality? Any concrete examples? - What problems, frictions, or pitfalls have you encountered when using or integrating them? - How much do you trust the outputs? Any concerns around over-reliance, bias, or becoming less thorough? - How do you feel about the current state of the art and what's coming in the next 6–12 months? Looking forward to hearing your real experiences and swapping thoughts on where AI is taking our industry. DMs open, let's chat!
English
1
0
9
1.1K
Hari
Hari@hrkrshnn·
@AlexanderKalian > How autonomous was it really? 80 year old unsolved problem in math. Are we really conspiring that OpenAI cheated by having a human help to solve it?
English
4
0
15
1.4K
Dr Alexander D. Kalian
Dr Alexander D. Kalian@AlexanderKalian·
OpenAI claims that an unreleased "internal model" solved a major problem in mathematics. Sounds like OpenAI's very own pre-IPO Mythos moment of overhyping. We should all be asking healthily sceptical questions about this "autonomous" breakthrough: How autonomous was it really? How much scaffolding or chain-of-thought design came from in-house mathematicians targeting this specific problem? How much training data or RAG-based vector database stuff was internally produced data targeted towards this specific problem? How many failed attempts did it make? How was the outcome verified, and how much time and resources did it take, against presumed other failed attempts? They are not gonna tell you - and conveniently, the model is not available for public scrutiny either. AI companies have shifted into "source: trust me bro" mode.
OpenAI@OpenAI

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

English
53
23
210
19K
StErMi
StErMi@StErMi·
@pierrecomputer it would be nice if it could: remember the closed/open state of the files after refresh + having a way to "mark as read" like GitHub PRs do
English
0
0
0
386
Pierre
Pierre@pierrecomputer·
diffshub[dot]com Take any public diff from GitHub and virtualize it nearly instantly, no matter how large, with DiffsHub. Built to show off our brand new CodeView component. To try it out, replace `github` with `diffshub` in your address bar.
English
111
225
2.9K
1M
StErMi
StErMi@StErMi·
As far as the rumor goes, the new Siri will use by default the Gemini model under the hood. Every interaction will go through the Google Private AI Compute (the Google version of Private Cloud Compute). Rumors also say that Apple will allow the users to switch to other model providers like OpenAI, Anthropic, Perplexity and so on. Will those prompts also be privacy-protected by the same Google/Apple cloud architecture or will they be routed directly to the model providers?
English
0
0
0
227
StErMi
StErMi@StErMi·
The ones that will benefit a ton by the amount of vibe-coded apps into productions will be companies like @sentry The number of bugs/errors tracke will be exponential 😄 (if the dev even cares to enable it or knows that it's a best practice)
English
0
0
2
257