no1x

1.6K posts

no1x banner
no1x

no1x

@no1x__

Analysis & Vibe Engineering: EWT, PA, Macro/BTC | Open Source Tools

Katılım Kasım 2023
332 Takip Edilen164 Takipçiler
dex
dex@dexhorthy·
have started doing some switch-hitting between claude and codex ever since we rolled our own alternative harness in march. Opus 4.7 now available in humanlayer ofc. Recent vibes: - opus 4.5 more reliable than 4.6 or 4.7 for most things. (fwiw i actually preferred intelligence of 4.1 to 4.5 but 4.5 is so much faster that its worth the tradeoff) - 1m context is useful for some cases but i still try to stay under 100k tokens for serious work on our large monorepo by externalizing task progress to MDs - codex takes more time to review and research code before - this means it needs less intentional steering to research and planning, it can tackle bigger things with just a simple prompt - codex takes more time, this makes it slower, which can be annoying if you are used to claude just getting to work - codex, esp 5.4 is less trigger-happy than claude models, it is more likely to ask "want me to run the tests" or "want me to go make this change" vs. just going to do the thing. This actually makes afk yolo-ing harder. but if you are working off plans its fine. - claude code cli/sdk continues to add features, flag on random new things, we disable a lot like auto memory and adaptive thinking while working. In general the more built in tools and agents, the less room for your instructions. If you don't own the harness you're at the whims of what the model provider thinks you need to get better results. - this generally seems to be optimized for new users getting results, at the cost of giving flexibility to power users - the frontier is jagged and opus 4.7 is good at a few random things that prev models were not as good at (no spoilers! go poke around!) most people in the GC have been saying these things for weeks. but in case you were waiting to hear from me and @0xblacklight and team, here's what we got thanks @nayshins @GeoffreyHuntley @SickBots @nisten and many others for contributing to the vibe report
dex tweet media
English
8
9
77
8.5K
no1x
no1x@no1x__·
@AlicanKiraz0 baktıkça gülüyom güldükçe bakıyom
Türkçe
0
0
1
23
Alican Kiraz
Alican Kiraz@AlicanKiraz0·
Dostlar bilmiyorum bana mı öyle geldi ama Opus 4.7 metriklerinde ben bi fark göremedim, hala 4.6’nın GPT5.4’den geride olduğu kısımlarda 4.7’de geride 🧐
Alican Kiraz tweet media
Türkçe
28
0
68
10.4K
Claude
Claude@claudeai·
Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.
Claude tweet media
English
4.3K
9.7K
75.1K
9.7M
Google Antigravity
Google Antigravity@antigravity·
Mission Report is out now! Catch up on everything new in Antigravity + Q&A!
English
339
61
683
102.3K
0xSero
0xSero@0xSero·
Another OpenCode Go giveaway. --------- 10 People will win this time. Winners will be chosen based on your comment. 10 most liked comments win.
0xSero tweet media
English
197
8
367
16.5K
Jay
Jay@jayair·
Kit built this, should we ship it? (sound on)
English
102
33
1.5K
152.2K
Alican Kiraz
Alican Kiraz@AlicanKiraz0·
Claude bu 2-3 gündür çok başarısız, devamlı revize ediyorum verdiği cevapları. Sanırım extended-thinking’i düşürdüler mythos hazırlıkları için
Türkçe
26
5
224
31.6K
Tibo
Tibo@thsottiaux·
I realize yesterday’s Codex reset came in a bit at an unfortunate time given the last one was almost perfectly a week ago. To really celebrate the 3M I’ll reset again tomorrow. Thanks for the feedback!
English
644
297
6.6K
548.2K
Tibo
Tibo@thsottiaux·
Three million people are now using Codex weekly - up from two million a little under a month ago. Incredible to see the growth. Thank you to all of you and to the ecosystem we’re part of. To celebrate, we’re resetting rate limits so you can keep building, and we’ll reset them every additional 1M users until we reach 10M, so we can keep celebrating along the way. Enjoy and thank you!
English
400
301
4.5K
520.7K
no1x
no1x@no1x__·
@mdisec anlatılmak istenen: Aga bu grafikteki sıçrama şaka mı? Model zafiyet bulmada hayvan gibi bir accuracy'ye fırlamış. Eğer durum harbi böyleyse biz dükkanı kapatıp köye domates ekmeye gidelim oğlum, kafayı mı yediniz siz?
Türkçe
0
0
4
707
Ahmet Göker🇹🇷🇳🇱
Ahmet Göker🇹🇷🇳🇱@_shadowintel_·
Bence birçok insan, yapay zekanın vulnerability bulmada ne kadar iyi olduğunu ciddi şekilde hafife alıyor.
Türkçe
6
1
61
9.5K
dawgyg - WoH
dawgyg - WoH@thedawgyg·
Im working on doing the same lol if what im doing works I'll let you know. But then only being able to exploit 1% of their vulns with opus 4.6 then 70% with Mythos has me curious. Im near 100% success with opus 4.6 on exploit dev for memory corruption vulns, so i really wanna see how it can get better. Only thing I can think of is time + token count reduce by alot maybe
English
2
0
25
5.8K
Douglas Day
Douglas Day@ArchAngelDDay·
Alright how do I get access to Mythos?
English
6
0
48
9.4K
corbin
corbin@corbin_braun·
claude mythos endgame for coding?
English
7
1
26
2K
Newton Cheng
Newton Cheng@newton_cheng·
Super proud of my team and everyone in the org that came together for this! This is probably one of the craziest things we’ve pulled off in my 3+ years here. And this is just the start -- we have a lot more planned in the next several months!
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
4
4
22
961
Md Ismail Šojal 🕷️
Md Ismail Šojal 🕷️@0x0SojalSec·
If this is real, it's a major hit for Cybersecurity? 😑 Anthropic's secret model, Claude Mythos Preview their most powerful unreleased model yet, which autonomously identified high-severity zero-day vulnerabilities across every major operating system and web browser. Full unauthenticated root exploit on FreeBSD. They won't release it publicly. Instead, Anthropic launched Project Glasswing, a $100M defensive coalition with Apple, Google, Microsoft, Amazon, NVIDIA & more. They're racing to patch the world's infrastructure before these capabilities spread. This is the moment Al security crossed a dangerous line.
English
7
1
10
1.8K