nwptz.eth

56.1K posts

nwptz.eth banner
nwptz.eth

nwptz.eth

@nwptz

💼 Solo dev, experimenting with @zarosabe 🏰 I run a small game guild

Web3 Katılım Şubat 2012
739 Takip Edilen1K Takipçiler
nwptz.eth
nwptz.eth@nwptz·
using gpt-5.4-mini at least once a day just to enjoy the speed.
English
0
0
0
8
nwptz.eth
nwptz.eth@nwptz·
Just did minimal css fixes and it consumed 10% of my codex 5 hours limit. The heck.. When reset?
English
0
0
1
57
nwptz.eth
nwptz.eth@nwptz·
@JasonBotterill not something I preferred. I'd rather to have codex isolated chat feature rather than shared memory. Don't want to mix personal and work.
English
0
0
1
105
nwptz.eth
nwptz.eth@nwptz·
why my codex now doing a lot of exec command. sign of another reset from Tibo?
English
0
0
0
22
nwptz.eth
nwptz.eth@nwptz·
@phuctm97 same as human. if you told it to "think hardest" on task that not hard enough, the output is worse. select reasoning based on the scope of work, you will see the different.
English
0
0
1
1.3K
Minh-Phuc Tran
Minh-Phuc Tran@phuctm97·
Is it true that Codex GPT 5.4 with Extra High thinking effort is worse than with Medium/High thinking effort? If it’s true, that’d be very bad design. 😅
English
81
1
194
45K
nwptz.eth
nwptz.eth@nwptz·
One of the goal on @zarosabe ADE is to build environment for those who skeptic with AI eventually enjoying using AI.
Andrej Karpathy@karpathy

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code. But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along. So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions. TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.

English
0
0
1
10
Pankaj Kumar
Pankaj Kumar@pankajkumar_dev·
Codex, please don't be Antigravity or Claude Code.
Pankaj Kumar tweet media
English
9
1
89
7.1K
nwptz.eth retweetledi
OpenAI
OpenAI@OpenAI·
We’re updating our ChatGPT Pro and Plus subscriptions to better support the growing use of Codex. We’re introducing a new $100/month Pro tier. This new tier offers 5x more Codex usage than Plus and is best for longer, high-effort Codex sessions. In ChatGPT, this new Pro tier still offers access to all Pro features, including the exclusive Pro model and unlimited access to Instant and Thinking models. To celebrate the launch, we’re increasing Codex usage for a limited time through May 31st so that Pro $100 subscribers get up to 10x usage of ChatGPT Plus on Codex to build your most ambitious ideas.
English
1.3K
1.4K
16K
5.2M
nwptz.eth
nwptz.eth@nwptz·
@vikhyatk Just change the email locally once. They won't overwrite it. It's likely happened if you let codex set up the project workspace and git for you.
English
0
0
0
471
vik
vik@vikhyatk·
sad to see codex adopt the most annoying claude code feature (advertising itself all over your commits, branch names, PR titles & descriptions)
English
18
3
218
35.6K
nwptz.eth
nwptz.eth@nwptz·
GPT-5.3-codex high is still my first choice for code. let GPT-5.4 do the planning.
English
0
0
0
37
nwptz.eth
nwptz.eth@nwptz·
@andreasrtobing_ yang paling sulit adalah ngajarin usernya. selevel openclaw juga tetep ga user friendly buat normies yang pengen "punya agent" buat bantu hidupnya. mau pake gemini cli free juga kalo bisa wrap jadi user friendly, banyak yang mau. makanya skrg lagi coba bikin wrapper buat normies
Indonesia
0
0
0
360
andreasrtobing.hl
andreasrtobing.hl@andreasrtobing_·
Gw lagi building sesuatu yang bikin gw sadar satu hal… AI agent itu bukan soal seberapa pinter dia. Tapi seberapa *berguna* dia buat workflow yang udah ada di hidup lu. Kebanyakan orang build agent dari nol. Nyari tools. Nyambung-nyambungin sendiri. Gagal. Ngulangin lagi. Padahal masalahnya bukan di agennya — masalahnya di *infrastruktur* yang ngehubungin semua bagian itu. Ini yang gw pelajarin setelah beberapa minggu eksperimen: Agent yang beneran berguna itu bukan yang paling canggih. Tapi yang bisa *di-deploy* cepet, bisa dipakai orang lain, dan ga butuh lu jelasin 30 menit sebelum dipakai. Gw penasaran, kalian yang lagi coba build AI agent, bagian mana yang paling bikin frustrasi? Setup awal? Nyambungin tools? Atau justru pas coba kasih ke orang lain buat dipake?
Indonesia
17
6
111
8.8K
nwptz.eth
nwptz.eth@nwptz·
@cryptodizcorvus bisa juga suruh agentsnya define sendiri low output mode. taruh di agents.md kalo gamau install2.
Indonesia
0
0
0
151
Diz
Diz@itsdizcorvus·
Banyak yang ngeluh karena penggunaan token LLM yang berlebihan, terus gimana caranya biar ga boncos?? Saran gw coba install RTK dulu sebelum interaksi sama modelnya. RTK (Rust Token Killer) ini tools yang ngefilter output command sebelum masuk ke otak AI kalian. Jadi informasi yang dimakan AI bisa turun 60-90% tanpa kehilangan konteks penting. Cara kerjanya simpel. Biasanya kalau AI jalanin perintah kayak cek status project, output mentahnya panjang banget padahal yang penting cuma beberapa baris. RTK otomatis buang bagian yang gak penting, compress sisanya, terus kasih versi ringkas ke AI. Informasinya sama, tapi jauh lebih hemat. Contoh nyata: dalam satu sesi 30 menit, pemakaian token bisa turun dari 118,000 jadi 23,900. Itu hemat 80%. Cara installnya gampang. Tinggal suruh AI agent kalian yang udah terintegrasi sama OpenClaw: "Install RTK (Rust Token Killer) di sistem ini. Caranya jalankan brew install rtk lalu rtk init -g. Setelah itu restart session." atau " Pelajari dan install RTK dari link ini {sertakan link rtk github}" *Linknya gw taruh di kolom komentar Buat yang pake OpenClaw atau AI coding tools tiap hari, ini berguna bange. API bill kalian bakal bilang makasih.🤣 DYOR, NFA.
Diz tweet media
Twips | AI & Crypto@TwipsX

Pernah ga kalian lagi chat-an sama AI, eh tiba-tiba berhenti dan diminta buat tunggu sampai beberapa jam kemudian? Sebenernya itu bukan bug, tapi "Token" kalian lagi habis. Nah, disini gua bakal kasih kalian 7 tips cara bikin token kalian jauh lebih efisien, jadi kalian bisa tetep pakai AI gratis lebih lama. Jangan lupa di save 🧵

Indonesia
10
48
428
16.5K
nwptz.eth
nwptz.eth@nwptz·
Imagine the hype if twitter blue allow X Oauth for @grok models. It's a mass onboarding in a short timeframe.
English
1
0
0
18
Charles Packer
Charles Packer@charlespacker·
I don't get how this works. Claude OAuth + OpenClaw (OSS) = banned Claude OAuth + "personal software" using Agent SDK = OK So if I fork OpenClaw to use Agent SDK under the hood, this is OK? Or is it only OK for "personal use", and if I sell it, I am banned from allowing end-users to connect their own Claude OAuth? I think the question people want to know is "can businesses allow end-users to bring-their-own Claude plan, if using the Agent SDK under the hood"?
Boris Cherny@bcherny

@EricBuess Yep, working on improving clarity here to make it more explicit

English
51
12
422
182.6K
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
ICYMI: you can use your ChatGPT sub with OpenClaw, OpenCode, Pi, Cline and a lot more! Infact you can double down and build your own interfaces on top of the ChatGPT Sub via the Codex App Server too - it’s fully open source Enjoy your Claw & build things you want, when you want
Vaibhav (VB) Srivastav tweet media
Boris Cherny@bcherny

Starting tomorrow at 12pm PT, Claude subscriptions will no longer cover usage on third-party tools like OpenClaw. You can still use these tools with your Claude login via extra usage bundles (now available at a discount), or with a Claude API key.

English
42
28
453
69.8K
nwptz.eth
nwptz.eth@nwptz·
@shadcn in progress building one! codex-app-server is open source so this is possible!
English
0
0
0
361
shadcn
shadcn@shadcn·
I need Chat in Codex. Codex UI + ChatGPT.
English
106
19
1.2K
96.2K
nwptz.eth
nwptz.eth@nwptz·
@steipete Same nowadays. I created a simple rule called 'propose mode' - 1st cycle , full proposal after task given - any additional cycles, just list delta what changed/added in the proposal - agent will implement only when the user says 'proceed' Agent answers, no questions.
English
1
0
0
522