xuandaooo retweetledi
xuandaooo
299 posts

xuandaooo retweetledi
xuandaooo retweetledi

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow.
Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes.
As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now.
It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.
English
xuandaooo retweetledi

作为严肃跑者,嫌市面工具不顺手,索性Vibe Coding 自己写了个 pbRun。
核心盯 VDOT 跑力和心率区间。不用服务器,每日自动同步 Garmin 数据,0 成本且数据私有,真香!
已开源,欢迎跑友自取👇
🔗 github.com/xuandao/pbRun
🌐 pbrun.vercel.app/analysis
中文

@huanghlzone 握手!🤝 完全同意,自己才是最懂自己痛点的产品经理。Vibe Coding 让我们这些“非标”的小众需求也能被完美满足,这种 0 成本手搓工具的感觉太爽了。
中文
xuandaooo retweetledi

写了一篇很长的DeepSeek R1科普文,从AlphaGo战胜李世石出发,到ChatGPT类模型的训练困境,和我看到DeepSeek R1真正的闪光点,一路贯穿下来竟然写的心潮澎湃。
特别是在OpenAI和Claude的各种操作和言论后,让我更想把R1-Zero对人类的贡献告诉给更多不懂AI的人。
mazzzystar.com/2025/01/30/cha…
中文

@shengxj1 苹果新出的Airpods4,好用的很、像我这种适应不了pro2那种入耳式耳机的人太适合用了、
中文




















