Niko E.

5K posts

Niko E.

Niko E.

@nefthy

Erde Katılım Nisan 2010
515 Takip Edilen37 Takipçiler
Niko E.
Niko E.@nefthy·
@KaiXCreator I have preferences at points in time, and I pick the best model I can get my hands on. I'd recommend leaving the fan things for team sports.
English
0
0
0
5
Kaito
Kaito@KaiXCreator·
Are you team Claude or Codex?
English
245
3
135
14.3K
Niko E.
Niko E.@nefthy·
@rxhit05 My theory is that it's like football, you stay with your team no matter what. Also for corporate environments deepseek feels riskier than anthropic because there is no big company backing it and China. Not very rational considering the amount of rugs pulled by anthropic.
English
0
0
0
141
Rohit
Rohit@rxhit05·
Why are people still using Claude when a $5 DeepSeek V4 run is starting to feel insanely close to a $100 Claude Code session?
English
122
19
949
62.9K
Niko E.
Niko E.@nefthy·
@cifilter It's a trade-off not a clear win either way.
English
0
0
0
7
Shannon Potter
Shannon Potter@cifilter·
I'm dumb, so bear with me: using subagents should reduce overall token usage because you don't have one gigantic context window/thread doing everything? So why does Codex seem to never want to use them unless I tell it to?
English
31
0
103
33.8K
Niko E.
Niko E.@nefthy·
@0xglitchbyte @theo Theo has a point. There is a class of tech dept that no longer needs to exist, because you can fix it with a prompt and under an hour of agent time.
English
0
0
0
11
Glitchbyte
Glitchbyte@0xglitchbyte·
This will never happen, nor should ever happen @theo is wrong here Your codebase is a contract you make with your users. Given a sufficient number of users, dependence on your codebase occurs in unexpected ways you did not intend or account for. Anything you change has unintended side effects elsewhere. That's "Hyrum's Law" in a nutshell. If you cannot account for these, you cannot specify for them. That doesn't even take into account ai problems like: - sycophantic behavior - hallucinations - non-determinism - context loss despite long windows - cannot prove generated code is correct - cannot generally prove a regenerated codebase behaves the same as the old one Each one of these alone is a difficult problem to solve, if it can be solved at all. The codebases you generate would be akin to allowing a lootbox determining your SLAs. Relevant XKCD:
Glitchbyte tweet media
Theo - t3.gg@theo

@zygisSS22 Why maintain tech debt when you can tell an agent “here’s some tests, rebuild the whole thing” Lots of legacy systems, infrastructure etc will be replaced much faster than most people seem to think

English
6
2
47
3.5K
Niko E.
Niko E.@nefthy·
@avrldotdev LLM cannot handle the concept of consequences very well, the strategic layer for where the architecture should go given the current and future requirements is not covered. And tbh. I don't believe they will cover architecture for the same reason.
English
0
0
1
8
avrl ☘
avrl ☘@avrldotdev·
Software engineers, what's your plan when AI develops better taste & architectural/systems knowledge than you in next 3-4 years?
English
185
11
551
121.1K
Niko E.
Niko E.@nefthy·
@samdotb I did a refactor skill using a lsp package. That really helped with the toke burn when teaching and moving stuff around.
English
0
0
0
75
Samuel Bodin
Samuel Bodin@samdotb·
llms should really learn to copy code with tool instead of using tokens it's insane
English
94
59
3.4K
290.3K
Kappaemme
Kappaemme@Kappaemme1926·
Codex models Which is the best one to use?
Kappaemme tweet media
English
89
4
171
37.7K
Niko E.
Niko E.@nefthy·
@RyanJamesShaw @levelsio Didn't mean for it to come off as a brag at all, my bad. I just meant it's locked into my muscle memory at this point. Your daemon.json + firewall setup is a smart, foolproof way to do it.
English
0
0
1
14
Ryan J. Shaw
Ryan J. Shaw@RyanJamesShaw·
@nefthy @levelsio You claimed never to forget. That makes you perfect. I'm not perfect. I forget and make mistakes. That's why I have a default localhost bind in my daemon.json, and VPS-level firewall on top of that. I really don't understand the point of your comments other than to brag?
English
1
0
0
11
Ryan J. Shaw
Ryan J. Shaw@RyanJamesShaw·
I tried going the SQLite approach @levelsio uses but got too annoyed because there’s no way to run a query remotely. I’d have to use the command line over 200ms latency to the VPS, or build an “admin” page in my app where I can run queries instead of the tools I’ve used for 20 years. So I spun up Postgres in a Docker compose setup. By dumb luck I noticed the ‘ufw’ firewall on the VPS wasn’t working anymore afterwards... You see Codex told me that docker won’t respect ‘ufw’ and my Postgres container ports will be exposed to the world if I map them the easy way. It recommended I add additional iptables firewall rules just for docker, and I’d need to install another package to make them persistent. I installed that package, which apparently uninstalls “ufw”. It prompted me to confirm but I didn’t pay attention, and just Yesed the install not realizing I was about to lose my firewall ENTIRELY. TL;DR I ended up with no firewall by trying to get a firewall on docker, all because I just wanted to friggin run Postgres. It is SO easy to shoot yourself in the foot when you get clever with your stack.
@levelsio@levelsio

I'm starting to think it has to do with my tech stack why I'm only one doing this Vanilla PHP + Vanilla JS + SQLite is so simple and basic it's hard for AI to fuck up The more complex your stack the higher odds AI (or you!) will make a fatal bug Simple works here

English
17
0
17
28.2K
Jerome
Jerome@jeromeq2004·
@theo @darrenjr the 10x on codex reads more like harness than model to me. context handling, subagents, tool routing all got way better while the raw model stayed kinda flat. what makes you put it on the model side?
English
1
0
1
1.9K
darren
darren@darrenjr·
the anthropic team is still innovating the most in the harness layer and it’s not even close opencode, codex and the rest are just copying claude code features mostly sure claude code is incredibly buggy but you can’t deny they’re actually pushing the frontier
English
132
11
757
150.7K
Ryan J. Shaw
Ryan J. Shaw@RyanJamesShaw·
@nefthy @levelsio That works great until you forget it one day, or your coding agent forgets. The docker easy path is the least secure path, unfortunately. Point is it’s easy to make a mistake.
English
1
0
0
35
Ben Vinegar
Ben Vinegar@bentlegen·
unpopular opinion, less code is still good, replacing well-documented/tested libraries with what-will-become unmaintained codegen is not better
English
12
5
111
11.3K
Niko E.
Niko E.@nefthy·
@thdxr That's why you A/B tests things related to the funnel
English
0
0
0
734
dax
dax@thdxr·
every company that works has set up a funnel this funnel is delicate and intricate with many critical pieces working exactly right. it's not an obvious and if it were it would have no value which is why it's very easy for employees to break this funnel
English
18
5
284
28.3K
Niko E.
Niko E.@nefthy·
@trikcode Claude's tui is a vibe code mess lacking polish to a degree where it is barely usable.
English
0
0
0
5
Wise
Wise@trikcode·
Claude code is still better than Codex Prove me wrong
English
202
10
565
76.7K
Niko E.
Niko E.@nefthy·
@trikcode I vibe-coded a whisper flow clone in cpp. no issue.
English
0
0
0
412
Wise
Wise@trikcode·
I haven't seen a C++ vibecoder yet. I wonder why?
English
776
234
7.2K
1.3M
Niko E.
Niko E.@nefthy·
@thsottiaux Can you make something to schedule tasks for the off peak hours?
English
0
0
0
3
Tibo
Tibo@thsottiaux·
Should we bring batch compute to codex? Aka /slow mode
English
1.1K
60
4.8K
232.7K
Niko E.
Niko E.@nefthy·
@thesadiksaifi @saeed_vz I did not read it otherwise, just wanted to give you my own experience. I use it all the time for debugging mostly even with low reasoning, and I'm always amazed at the stuff it finds and solves.
English
1
0
1
24
Sadik Saifi
Sadik Saifi@thesadiksaifi·
@nefthy @saeed_vz I’ve not said it’s bad or something. I’m just saying from personal experience that I like the findings from Opus.
English
1
0
0
23
Saeed Vaziry ⚡
Saeed Vaziry ⚡@saeed_vz·
I see a consistency with OpenAI models which I don't know to be happy about it or scared of it! Did a task with both GPT-5.5 and Opus 4.7 and both did well. Asked models to review the task they did: - Opus found critical things to fix so I had to have a second pass. - GPT found nothing.
English
1
0
4
1K
Niko E.
Niko E.@nefthy·
@thesadiksaifi @saeed_vz GPT 5.5 is excellent at fighting bugs. I especially like the find, reproduce with failing test case, fix and verify test passes workflow, with GPT.
English
1
0
1
25
Sadik Saifi
Sadik Saifi@thesadiksaifi·
@saeed_vz From my personal experience, GPT is really good at implementing things, and Claude is very good at finding bugs or security vulnerabilities.
English
1
0
2
74