William Arin

86 posts

William Arin

@william_arin

AI Ops / SWE / DevOps / SRE.

Sumali Eylül 2022

556 Sinusundan25 Mga Tagasunod

William Arin@william_arin·21h

@iScienceLuvr SEO architecture and research must be done in ChatGPT.

English

150

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·1d

Honestly, is there any reason why you would use ChatGPT over Codex??? I pretty much don't visit ChatGPT anymore...

English

181

47.4K

William Arin@william_arin·21h

@doregex @BlockedPaths Yes, but I'm not sure if they will catch up that quickly. Last december it felt like Chinese models lagged 4-5 months behind. But GPT 5.0 in september already understood intent very well, and 9 months later Chinese models are still not at this level I think.

English

Alberto 思辰@doregex·21h

@william_arin @BlockedPaths I mean, do you remember when models ask about the quality of the response or what you would change? that's the part that chinese models miss.

English

BlockedPath@BlockedPaths·1d

Kimi K2.7 did not impress me in the slightest. No where near 5.5 or deepseek.

English

14.2K

William Arin@william_arin·21h

@doregex @BlockedPaths I would argue that capability is not there if you need to be specific. The whole point of intelligence is understanding unclear intent, this is what makes US models so above Chinese models.

English

Alberto 思辰@doregex·1d

I have to say that on a migration task i tried with gpt5.5 on xhigh, it made a mess, i tried with kimi2.7 code using opencode go plan, it performed better I think the main issue people have is using the right prompt for the chinese models, you need to be more specific on what you want, there is less "magic", but capability are there. I don't know, but I guess that the benchmark prompts are somewhat more detailed than the average user

English

286

William Arin@william_arin·7 Haz

@c0dedaddy @amritwt Before Deepseek there's Google and OpenAI who will be pleased to take all their customers. Why would any company drive prices up when competitors are waiting to crush them...

English

{...eddie}@c0dedaddy·7 Haz

@amritwt That’s a guaranteed way to push most of who’s paying the bills over there to Deepseek and then they get nothing. OS is creeping up on enterprise models and very soon itll catch up fully. They’re stuck

English

468

amrit@amritwt·6 Haz

if you think that a $200 claude code subscription is alot wait until they remove the subsidisation and now you have to pay $5000 a month instead

English

121

62.8K

William Arin@william_arin·4 Haz

@leploutos Le truc a 22 points sur Artificial Analysis. Minimax M2.7 qui est inutilisable tellement il est con, a 50 points.

Français

109

Le PLOUTOS@leploutos·3 Haz

Depuis 24h, je teste Mistral Vibe sur du vrai dev : gros repo, tâches longues, terminal, CLI et tout le bordel. C’est pas le feu d’artifice dès la première seconde comme avec Claude ou GPT. Vibe joue pas ce jeu-là. Il construit un truc plus profond : terminal + cloud + agents distants + PR propres + tâches qui tournent sans que tu sois collé devant. Le vrai sujet en 2026 c’est plus “est-ce qu’il code bien une fonction”. Ca, tout le monde sait à peu près le faire. C’est plutôt : est-ce qu’il bosse dans MON environnement réel ? Est-ce qu’il comprend le repo sans tout casser ? Est-ce que je peux lui filer une tâche longue et revenir deux heures après ? Est-ce que je garde vraiment la main ? Sur ces points, @mistralvibe tape très juste et très pro. Et puis y’a le côté Mistral : infra européenne, énergie bas carbone, souveraineté. Dit comme ça ça sent un peu le cocorico, mais pour une fois c’est pas con. Quand tu donnes ton repo, tes secrets et ta logique métier, la question “où ça tourne ?” devient concrète. Mon ressenti à chaud : - CLI hyper agréable - tâches cadrées qui passent nickel - les agents distants sentent le game changer - l’ensemble fait “outil pro” plutôt qu’effet waouh C’est moins tape-à-l’œil. Mais ça sent la vraie brique d’infrastructure pour bosser avec des agents dans un flow quotidien. Pour une première version, ça fait plaisir. Après des années à être à la bourre, on a enfin un produit concret européen sur ce terrain. Si tu veux un agent qui s’intègre vraiment dans ton quotidien, teste Vibe. Ça vaut le coup.

Français

139

13K

William Arin@william_arin·30 May

@thenoblesimian @ThePrimeagen Small models like gpt5.3-codex-spark are exceptional for iterating small changes, the changes are almost instant.

English

The Noble Simian@thenoblesimian·30 May

@ThePrimeagen My boss is a trained SWE and over the last year transitioned to not coding by hand. I barely code by hand anymore unless it's making a small change that I don't want to waste tokens on. So essentially I only use coding by hand for cost savings.

English

1.3K

ThePrimeagen@ThePrimeagen·30 May

I super don't understand this. I have to believe you're writing no software of any consequence for this to be true.

Cory House@housecor

Controversial opinion in my talk yesterday: Editor doesn't matter anymore. It's just a diff viewer.

English

173

3.8K

269.9K

William Arin@william_arin·20 May

@Leonizuka @GamerBike39 @RayaneRachid_ En 2024 sûrement.

Français

L é o.@Leonizuka·20 May

@william_arin @GamerBike39 @RayaneRachid_ Pas si t'es dans un business qui nécessite de garder le contrôle sur le code, c'est à dire n'importe quel software à part un side project

Français

Rayane@RayaneRachid_·20 May

Cette nouvelle mode comme l'app codex etc je déteste vraiment, j'ai besoin d'avoir un file tree, voir le contenu des fichiers etc facilement

Mark Kretschmann@mark_k

Google Antigravity 2.0 is interesting because it no longer feels like "Google made an AI IDE". Antigravity 1.0 was the full IDE: editor, terminal, browser, agent workspace. Basically Google’s take on agentic coding as a complete environment. 2.0 feels more like they pulled the agent system out of the IDE and made it the product. It feels more like the Codex app. Desktop app, CLI, SDK, managed agents, scheduled tasks, subagents, integrations with AI Studio, Android and Firebase. The IDE is still there, but it’s no longer the main story. The agent layer is.

Français

116

19.6K

William Arin@william_arin·20 May

@GamerBike39 @RayaneRachid_ Tu raisonnes encore comme un dev de 2024. On ne travaille plus "sur des fichiers" mais sur des produits. Tu n'as pas besoin de savoir quelles lignes précises, quelles fonctions font quoi. C'est plus ton job mais celui de l'IA. T'es pas à un prompt près.

Français

GamerBike@GamerBike39·20 May

@william_arin @RayaneRachid_ Comment tu fais quand tu veux travailler sur des fichiers, voir des lignes précises, ou une fonction si tu sais pas où elle est ? V'là la depense de token si l'IA doit explorer la codebase à chaque fois

Français

William Arin@william_arin·19 May

@simobis23 @cheatyyyy It's cheaper year after year. developers.googleblog.com/gemini-15-flas…

English

214

simobis@simobis23·19 May

English

5.1K

cheaty@cheatyyyy·19 May

Gemini 3.5 Flash pricing 🤡 $1.5/m input tokens $9/m output tokens a 3x increase over Gemini 3 Flash, it is truly over :')

AiBattle@AiBattle_

Gemini 3.5 Flash just showed up in the Google Cloud Console It’s coming

English

1.2K

275.3K

William Arin@william_arin·17 May

@thsottiaux Remote ssh projects.

English

Tibo@thsottiaux·17 May

For those of you living inside the codex app, what should we prioritize among features, reliability or performance?

English

1.9K

2.1K

282.9K

William Arin@william_arin·11 May

@JonDotJames @nightkingog @DavidOndrej1 Sorry! The workflow you proposed is not ideal, it's easier to setup ssh tunnels (or cloudflare tunnels) to work on the dev directly without git in the middle, that way you have 95% of the "on my machine" experience

English

Jon James@JonDotJames·11 May

@william_arin @nightkingog @DavidOndrej1 Wait, I was agreeing with you

English

David Ondrej@DavidOndrej1·10 May

stop developing locally start developing on a VPS trust me

English

343

114

3.4K

803.9K

William Arin@william_arin·11 May

@JonDotJames @nightkingog @DavidOndrej1 Blind development without real-time updates? How can you possibly work like that

English

Jon James@JonDotJames·11 May

@william_arin @nightkingog @DavidOndrej1 Or use Cursor web, push the change and have webhooks update the VPS

English

William Arin@william_arin·10 May

@nightkingog @DavidOndrej1 You can develop on an old laptop, on a phone while you're outside, on your other gaming computer, on your mac, on your wife's laptop, without reinstalling everything each time. You just have one source of truth for everything. This is life changing.

English

883

Hemz@i_m_hemz·10 May

@DavidOndrej1 What do i get from it, apart from paying some extra dollars?

English

12.3K

William Arin@william_arin·9 May

@thsottiaux Windows Terminal + WSL2 + SSH + tmux + remote codex cli. The era of personal computers is over. We just need terminals. Please add remote connections to the app.

English

393

Tibo@thsottiaux·9 May

As a Codex user, which platform are you on

English

506

704

221.2K

William Arin@william_arin·9 May

@ledevnovice La branche part sur le staging. Master build l'image de prod. Le tag part sur la prod.

Français

879

Le Dev Novice@ledevnovice·8 May

Qui fait des branches de feature sur ses projets perso les ami(e)s codeurs ?

Français

28.2K

William Arin@william_arin·6 May

@shri_shobhit @championswimmer Tencent and Bytedance bots

English

Shobhit Shrivastava@shri_shobhit·6 May

@championswimmer Off-topic, but any idea why the traffic from Singapore is so high? I checked with AI, and it told me "probably" because of data centres and VPNs. I am not convinced though

English

11.1K

Arnav Gupta@championswimmer·6 May

> 300 millions users turns out to be 300 M "visits" > turns out this is cumulative 300M visits in 12 months 😅 > "replaced all SaaS with my AI coded ones" - maybe if you uses some SaaS like Posthog/Mixpanel you'd have the ability to count DAU/MAU for real

English

927

113.5K

William Arin@william_arin·4 May

@Anthyra_dev @jeremie_m_dev La review se fait par les IAs, pas par toi. Quand tu gères une équipe de 30 devs, tu fais pas la review de code de tes juniors/confirmés/seniors. Tu ne fais pas de review de code assembleur généré par un compilateur, donc pas de raison de faire de la review de ton php/ts/java...

Français

Anthyra@Anthyra_dev·3 May

@jeremie_m_dev Mais donc avec tout ça tu as jamais le temps de review le code qu’il fait non ? J’avoue que c’est un flow que j’imagine à peine

Français

2.2K

Anthyra@Anthyra_dev·3 May

Les devs, je vois de plus en plus de vidéo avec des personnes qui ont X terminaux avec claude / codex ect. C’est du giga bullshit ou certains on de vrais cas d’usages ? J’en ai 2, un pour les taches et un pour tout ce qui est plus général. Un troisième pour des test ou autre si besoin. Et vous ?

Français

20.2K

William Arin@william_arin·30 Nis

@thsottiaux SSH remote work on a linux VM from Windows Codex App.

English

287

Tibo@thsottiaux·30 Nis

Send us feature requests for codex in the form of an images 2.0 generated image. It makes it easier for codex to implement if we decide to go for it. Saw some good ones today already that codex is cooking on.

English

619

2.3K

179.6K

William Arin@william_arin·5 Nis

@llmdevguy @Angaisb_ A big day of work (10 hours) on one project uses 50% of weekly usage on Plus plan. So I guess he parallelizes several projects at this same rate of work.

English