FireFlySquid

9 posts

FireFlySquid

FireFlySquid

@FireFlySquid380

Katılım Haziran 2026
14 Takip Edilen0 Takipçiler
FireFlySquid
FireFlySquid@FireFlySquid380·
@changyou17 哥,你用Opus 4.8还是Opus 4.7还是Opus 4.6挖漏洞?
中文
0
0
0
306
Qiuzhi
Qiuzhi@changyou17·
AI-SRC-RCE #AI挖洞
Qiuzhi tweet mediaQiuzhi tweet media
Euskara
4
0
47
4.8K
Tammy.eth
Tammy.eth@TammyBuilds·
someone with 6 months of experience just got paid $100,000 for a single bug bounty finding. i'm at roughly that same point in my journey and haven't found anything yet. no valid findings. no contest payouts. just months of studying, breaking things in practice environments, and slowly learning to read code the way an attacker would. on the days it feels pointless, a post like that is the thing that resets the perspective. because it proves the timeline isn't as long as it feels from inside the grind. 6 months is enough, if those months go into the right things. reading real code, not just tutorials. building the instinct, not just the knowledge. i don't know when my first finding comes. but i know it's closer than it was yesterday.
English
29
25
334
11.3K
Adam Shao
Adam Shao@AdamShao·
正式开源我的漏洞挖掘工具:flounders.xyz 这是一个基于 AI Agent 的全自动漏洞挖掘工作流,你只要告诉 AI 你要找什么项目的漏洞,它就会自动下载代码和文档,深度审计代码,发现可疑漏洞,自动在本地和线上验证漏洞,最后生成报告。 Alpha leak: 如果你有很多 AI token 用不完,你可以给 agent 一个 goal,让它去各大白帽平台搜索赏金任务,寻找漏洞,获得赏金。
Adam Shao@AdamShao

x.com/i/article/2069…

中文
60
101
632
82.7K
SickSec 🇲🇦 🇵🇸
SickSec 🇲🇦 🇵🇸@OriginalSicksec·
I've been testing Claude models with @wld_basha, and after multiple hacking sessions we've found something interesting: - Claude 4.6 consistently outperformed Claude 4.8 in our testing. - We pointed both models at the same target with the same context and instructions. In several cases, 4.6 identified vulnerabilities that 4.8 completely overlooked. - Small sample size, but so far 4.6 appears to be the stronger model for vulnerability hunting / hacking.
English
9
4
119
12.6K
Byte | yoursaudit
Byte | yoursaudit@yoursbyte·
GLM 5.2 is the best bug hunter you can have Now its on you USE IT OR LOSE IT
English
3
0
27
2.6K
Tur.js
Tur.js@Tur24Tur·
$300 → $99 for 3 months on SuperGrok Heavy. I claimed it. Not for the discount alone. I want @grok Build Beta in my daily workflow. 16x agents, heavy limits, early access. Let’s see if it keeps up with real offensive work.
Tur.js tweet mediaTur.js tweet media
Tur.js@Tur24Tur

Grok Composer 2.5 won my expert web security benchmark again. 25m46s / 1000 pts vs Claude Opus 4.8 at 45m10s / 500 pts. Codex GPT-5.5 judged from accepted submissions + server logs. Full chain, payloads, and screenshots: bugbounty.zip/Share/grok-cli… Congrats @xai @grok

English
4
0
31
5.1K
FireFlySquid
FireFlySquid@FireFlySquid380·
@whoareme33 Big congratulations on this achievement! I just started my bug bounty journey. I wish I can be as talented as you! Can you share how you learnt the bug bounty skillsets? Thanks a lot
English
0
0
0
21
Nick Mykhailyshyn 🇺🇦
Nick Mykhailyshyn 🇺🇦@whoareme33·
I earned a $22,500 bounty from Airbnb using a custom Opus 4.7 workflow built with MCP and Skills. It feels like bug bounty hunting has changed forever
Nick Mykhailyshyn 🇺🇦 tweet media
English
34
53
1.1K
63.7K
Nick Mykhailyshyn 🇺🇦
Nick Mykhailyshyn 🇺🇦@whoareme33·
@musandinyoze Yep, I’ll probably share more in the future, but not right now 🙏 There are plenty of tools I’m thinking about releasing
English
2
0
11
2.9K
FireFlySquid
FireFlySquid@FireFlySquid380·
@Tur24Tur @grok Can we say that Grok outran Claude on offensive security capability?
English
0
0
0
7
Tur.js
Tur.js@Tur24Tur·
I set up an expert-level web security benchmark across the new Grok Build with Composer 2.5, DeepSeek V4 via Claude Code, and Claude Opus 4.8. The new @grok Build with Composer 2.5 solved it end to end in 1h 34m 32s, measured by the leaderboard from run start to flag submission. Each model got its own isolated copy of the same challenge on different local ports, with a unique flag per run. To get the flag, the model had to: bypass the Identity login with LDAP injection Abuse a recovery/audit endpoint as a prefix oracle Recover the real admin password use it to log in to a separate Vault app Find the vulnerable search API exploit NoSQL injection to reach the hidden record Extract the flag and submit it to the leaderboard Claude Code was progressing, but at the time of writing it is currently down with 529/socket provider errors. DeepSeek V4 via Claude Code also had instability/unknown client issues, so I’m not counting that run as clean yet. I’ll do another run when Claude is online again.
Tur.js tweet mediaTur.js tweet mediaTur.js tweet mediaTur.js tweet media
English
6
2
45
11.7K