Niko E.

5K posts

Niko E.

@nefthy

Erde Katılım Nisan 2010

515 Takip Edilen37 Takipçiler

Niko E.@nefthy·2h

@KaiXCreator I have preferences at points in time, and I pick the best model I can get my hands on. I'd recommend leaving the fan things for team sports.

English

Kaito@KaiXCreator·10h

Are you team Claude or Codex?

English

245

135

14.3K

Niko E.@nefthy·11h

@rxhit05 My theory is that it's like football, you stay with your team no matter what. Also for corporate environments deepseek feels riskier than anthropic because there is no big company backing it and China. Not very rational considering the amount of rugs pulled by anthropic.

English

141

Rohit@rxhit05·22h

Why are people still using Claude when a $5 DeepSeek V4 run is starting to feel insanely close to a $100 Claude Code session?

English

122

949

62.9K

Niko E.@nefthy·11h

@cifilter It's a trade-off not a clear win either way.

English

Shannon Potter@cifilter·1d

I'm dumb, so bear with me: using subagents should reduce overall token usage because you don't have one gigantic context window/thread doing everything? So why does Codex seem to never want to use them unless I tell it to?

English

103

33.8K

Niko E.@nefthy·18h

@0xglitchbyte @theo Theo has a point. There is a class of tech dept that no longer needs to exist, because you can fix it with a prompt and under an hour of agent time.

English

Glitchbyte@0xglitchbyte·1d

This will never happen, nor should ever happen @theo is wrong here Your codebase is a contract you make with your users. Given a sufficient number of users, dependence on your codebase occurs in unexpected ways you did not intend or account for. Anything you change has unintended side effects elsewhere. That's "Hyrum's Law" in a nutshell. If you cannot account for these, you cannot specify for them. That doesn't even take into account ai problems like: - sycophantic behavior - hallucinations - non-determinism - context loss despite long windows - cannot prove generated code is correct - cannot generally prove a regenerated codebase behaves the same as the old one Each one of these alone is a difficult problem to solve, if it can be solved at all. The codebases you generate would be akin to allowing a lootbox determining your SLAs. Relevant XKCD:

Theo - t3.gg@theo

@zygisSS22 Why maintain tech debt when you can tell an agent “here’s some tests, rebuild the whole thing” Lots of legacy systems, infrastructure etc will be replaced much faster than most people seem to think

English

3.5K

Niko E.@nefthy·1d

@avrldotdev LLM cannot handle the concept of consequences very well, the strategic layer for where the architecture should go given the current and future requirements is not covered. And tbh. I don't believe they will cover architecture for the same reason.

English

avrl ☘@avrldotdev·1d

@nefthy elaborate?

Español

avrl ☘@avrldotdev·1d

Software engineers, what's your plan when AI develops better taste & architectural/systems knowledge than you in next 3-4 years?

English

185

551

121.1K

Niko E.@nefthy·1d

@samdotb I did a refactor skill using a lsp package. That really helped with the toke burn when teaching and moving stuff around.

English

Samuel Bodin@samdotb·2d

llms should really learn to copy code with tool instead of using tokens it's insane

English

3.4K

290.3K

Niko E.@nefthy·1d

@Kappaemme1926 5.5 Low for most things

English

Kappaemme@Kappaemme1926·1d

Codex models Which is the best one to use?

English

171

37.7K

Niko E.@nefthy·1d

@RyanJamesShaw @levelsio Didn't mean for it to come off as a brag at all, my bad. I just meant it's locked into my muscle memory at this point. Your daemon.json + firewall setup is a smart, foolproof way to do it.

English

Ryan J. Shaw@RyanJamesShaw·1d

@nefthy @levelsio You claimed never to forget. That makes you perfect. I'm not perfect. I forget and make mistakes. That's why I have a default localhost bind in my daemon.json, and VPS-level firewall on top of that. I really don't understand the point of your comments other than to brag?

English

Ryan J. Shaw@RyanJamesShaw·2d

I tried going the SQLite approach @levelsio uses but got too annoyed because there’s no way to run a query remotely. I’d have to use the command line over 200ms latency to the VPS, or build an “admin” page in my app where I can run queries instead of the tools I’ve used for 20 years. So I spun up Postgres in a Docker compose setup. By dumb luck I noticed the ‘ufw’ firewall on the VPS wasn’t working anymore afterwards... You see Codex told me that docker won’t respect ‘ufw’ and my Postgres container ports will be exposed to the world if I map them the easy way. It recommended I add additional iptables firewall rules just for docker, and I’d need to install another package to make them persistent. I installed that package, which apparently uninstalls “ufw”. It prompted me to confirm but I didn’t pay attention, and just Yesed the install not realizing I was about to lose my firewall ENTIRELY. TL;DR I ended up with no firewall by trying to get a firewall on docker, all because I just wanted to friggin run Postgres. It is SO easy to shoot yourself in the foot when you get clever with your stack.

@levelsio@levelsio

I'm starting to think it has to do with my tech stack why I'm only one doing this Vanilla PHP + Vanilla JS + SQLite is so simple and basic it's hard for AI to fuck up The more complex your stack the higher odds AI (or you!) will make a fatal bug Simple works here

English

28.2K

Niko E.@nefthy·1d

@RyanJamesShaw @levelsio It's not about perfection. It's about habit. But you do you.

English

Ryan J. Shaw@RyanJamesShaw·1d

@nefthy @levelsio Not everybody is as perfect as you

English

Niko E.@nefthy·1d

@jeromeq2004 @theo @darrenjr 5.5 is the level up. I don't even use codex.

English

Jerome@jeromeq2004·1d

@theo @darrenjr the 10x on codex reads more like harness than model to me. context handling, subagents, tool routing all got way better while the raw model stayed kinda flat. what makes you put it on the model side?

English

1.9K

darren@darrenjr·1d

the anthropic team is still innovating the most in the harness layer and it’s not even close opencode, codex and the rest are just copying claude code features mostly sure claude code is incredibly buggy but you can’t deny they’re actually pushing the frontier

English

132

757

150.7K

Niko E.@nefthy·1d

@RyanJamesShaw @levelsio Binding to localhost is the secure thing to do and my default. I don't forget it.

English

Ryan J. Shaw@RyanJamesShaw·1d

@nefthy @levelsio That works great until you forget it one day, or your coding agent forgets. The docker easy path is the least secure path, unfortunately. Point is it’s easy to make a mistake.

English

Niko E.@nefthy·1d

@bentlegen Yes but is-even is dead

English

Ben Vinegar@bentlegen·2d

unpopular opinion, less code is still good, replacing well-documented/tested libraries with what-will-become unmaintained codegen is not better

English

111

11.3K

Niko E.@nefthy·1d

@thdxr That's why you A/B tests things related to the funnel

English

734

dax@thdxr·1d

every company that works has set up a funnel this funnel is delicate and intricate with many critical pieces working exactly right. it's not an obvious and if it were it would have no value which is why it's very easy for employees to break this funnel

English

284

28.3K

Niko E.@nefthy·1d

@trikcode Claude's tui is a vibe code mess lacking polish to a degree where it is barely usable.

English

Wise@trikcode·2d

Claude code is still better than Codex Prove me wrong

English

202

565

76.7K

Niko E.@nefthy·1d

@trikcode I vibe-coded a whisper flow clone in cpp. no issue.

English

412

Wise@trikcode·3d

I haven't seen a C++ vibecoder yet. I wonder why?

English

776

234

7.2K

1.3M

Niko E.@nefthy·2d

@thsottiaux Can you make something to schedule tasks for the off peak hours?

English

Tibo@thsottiaux·2d

Should we bring batch compute to codex? Aka /slow mode

English

1.1K

4.8K

232.7K

Niko E.@nefthy·2d

@thesadiksaifi @saeed_vz I did not read it otherwise, just wanted to give you my own experience. I use it all the time for debugging mostly even with low reasoning, and I'm always amazed at the stuff it finds and solves.

English

Sadik Saifi@thesadiksaifi·2d

@nefthy @saeed_vz I’ve not said it’s bad or something. I’m just saying from personal experience that I like the findings from Opus.

English

Saeed Vaziry ⚡@saeed_vz·2d

I see a consistency with OpenAI models which I don't know to be happy about it or scared of it! Did a task with both GPT-5.5 and Opus 4.7 and both did well. Asked models to review the task they did: - Opus found critical things to fix so I had to have a second pass. - GPT found nothing.

English

Niko E.@nefthy·2d

@thesadiksaifi @saeed_vz GPT 5.5 is excellent at fighting bugs. I especially like the find, reproduce with failing test case, fix and verify test passes workflow, with GPT.

English

Sadik Saifi@thesadiksaifi·2d

@saeed_vz From my personal experience, GPT is really good at implementing things, and Claude is very good at finding bugs or security vulnerabilities.

English

Keşfet

@KaiXCreator @rxhit05 @cifilter @0xglitchbyte @theo @avrldotdev @samdotb @Kappaemme1926