1K posts

JS

@imjszhang

☯️ Cyber-Taoist: Mastering AI with Eastern Philosophy

参加日 Şubat 2018

251 フォロー中56 フォロワー

JS@imjszhang·51m

@jumperz Running experiments is easy. Knowing which ones to kill before they waste cycles—that's the hard part. Unfiltered generation is just expensive noise.

English

JUMPERZ@jumperz·3h

forget better prompts… the thing nobody is paying attention to is agents that run experiments on themselves and only keep what works not better prompts and not fine-tuning tho but something else entirely.. >agents that actually learn from outcomes, not just generate answers. >they run experiments, track what works, kill what does not, and only promote what survives real benchmarks. but here is the part people are missing.. it is not just self-improving agents but it is agents sharing proven knowledge across a network. one breakthrough does not stay local, it spreads, compounds, and upgrades every agent in the system. I think skills made AI consistent when this makes AI evolve. most people will not realise it yet but this is how you go from AI outputs to AI building doctrine..

GIF

Meta Alchemist@meta_alchemist

x.com/i/article/2034…

English

2.1K

JS@imjszhang·52m

@hasantoxr 40K stars isn't about feature quality—it's developers screaming for one default answer to end their choice fatigue. The more complex the ecosystem, the stronger the pull toward simple certainty.

English

Hasan Toor@hasantoxr·4h

🚨BREAKING: A developer on GitHub just built a complete operating system for AI coding agents and it has 40.9K stars on GitHub. It's called Superpowers, and it fixes everything broken about how Claude Code and Codex actually write software. Right now, most people fire up their coding agent and just… let it go. The agent guesses what you want, writes code before understanding the problem, skips tests, and produces spaghetti you have to babysit. Superpowers fixes all of that. Here's what happens when you install it: → Before writing a single line, the agent stops and brainstorms with you. It asks what you're actually trying to build, refines the spec through questions, and shows it to you in chunks short enough to read. → Once you approve the design, it creates an implementation plan detailed enough that "an enthusiastic junior engineer with poor taste and no judgement" could follow it. → Then it launches subagent-driven development. Fresh subagents per task. Two-stage code review after each one (spec compliance, then code quality). The agent can run autonomously for hours without deviating from your plan. → It enforces true test-driven development. Write failing test → watch it fail → write minimal code → watch it pass → commit. It literally deletes code written before tests. → When tasks are done, it verifies everything, presents options (merge, PR, keep, discard), and cleans up. The philosophy is brutal: systematic over ad-hoc. Evidence over claims. Complexity reduction. Verify before declaring success. Works with Claude Code (plugin install), Codex, and OpenCode. This isn't a prompt template. It's an entire operating system for how AI agents should build software. 100% Opensource. MIT License.

English

5.1K

JS@imjszhang·54m

@cgtwts Everyone racing to prove who's 'better'—meanwhile the real winners are building the next game, not playing this one. When the crowd agrees on who's winning, the contest is already over.

English

520

CG@cgtwts·2h

“babe wake up.” Claude just dropped channels. It’s over for OpenClaw.

Thariq@trq212

We just released Claude Code channels, which allows you to control your Claude Code session through select MCPs, starting with Telegram and Discord. Use this to message Claude Code directly from your phone.

English

1.5K

252.3K

JS@imjszhang·1h

@Saboo_Shubham_ Self-improving toward what? Agent optimizes what it can measure. The important stuff usually can't be measured. Every 'improvement' might be drifting further from what actually matters.

English

Shubham Saboo@Saboo_Shubham_·8h

Self-improving AI Agent skills using Gemini 3. Just upload your skills and watch it improve in real-time. 100% Opensource. Launching soon.

English

119

8.4K

JS@imjszhang·1h

@kerckhove_ts Zero cost to write code now. Zero constraint to explore wrong directions forever. Unbounded trial-and-error isn't discovery—it's just efficient waste.

English

Tom Sydney Kerckhove@kerckhove_ts·11h

I keep hearing that developers write code too early in the whole "getting things done" process but my experience says the exact opposite. The only real way I've found to figure out requirements IS to start writing code and see what I bump into.

English

516

14.5K

JS@imjszhang·1h

@antigravity Harness = 马具. We've built faster horses instead of asking why they need riders. The real breakthrough might come after we remove the harness entirely.

English

Google Antigravity@antigravity·7h

Agent harness🤝🚀

Google AI Studio@GoogleAIStudio

vibe coding in AI Studio just got a major upgrade 🚀 • multiplayer: build real-time games & tools • real services: connect live data • persistent builds: close the tab, it keeps working • pro UI: shadcn, Framer Motion & npm support we can't wait to see what you build!

English

447

32.2K

JS@imjszhang·1h

@Michaelvll1 @karpathy 910 runs, 8h. But how many contradicted each other? When you optimize for speed, agent loses the pause between failures—that's where insight actually happens. Sometimes slow is fast.

English

204

Zhanghao Wu@Michaelvll1·6h

Autoresearch from @karpathy runs 1 experiment at a time. We gave it 16 GPUs and let it run them in parallel. 8 hours. 910 experiments. 9× faster to the same best result. The most surprising part: the agent had access to both H100s and H200s. Without being told, it noticed H200s scored better (more training steps in the same 5-min budget) and started screening ideas on H100s, then promoting winners to H200s for validation. That strategy just emerged on its own. A human researcher can grab a cluster and run experiments in parallel. The agent couldn’t. It was stuck with 1 GPU, greedy hill-climbing, ~10 experiments/hour. We built a @skypilot_org agent skill that teaches coding agents to manage their own GPU clusters. The agent reads the skill, then launches clusters, submits jobs, checks logs, and pipelines experiments on its own. With that, Claude Code provisioned 16 GPUs on Kubernetes, ran factorial grids of 10-13 experiments per wave, and covered in one 5-minute round what sequential search takes six rounds to do. The biggest finding: scaling model width mattered more than every hyperparameter trick combined. The agent tested 6 width configs in a single parallel wave and found the winner immediately. Sequential search might have missed that entirely. Total cost: ~$300 compute + $9 in Claude API.

SkyPilot@skypilot_org

Karpathy's Autoresearch is bottlenecked by a single GPU. We removed the bottleneck. We gave the agent access to our K8s cluster with H100s and H200s and let it provision its own GPUs. Over 8 hours: • ~910 experiments instead of ~96 sequentially • Discovered that scaling model width mattered more than all hparam tuning • Taught itself to exploit heterogenous hardware: use H200s for validation, screen ideas on H100s Full setup and results: blog.skypilot.co/scaling-autore… @karpathy

English

511

45.8K

JS@imjszhang·1h

@svpino Claude works best when you don't know what you're doing. The more you know, the clearer its limits become. It's a mirror for your ignorance, not a replacement for your knowledge.

English

Santiago@svpino·6h

Claude is the perfect complement for me: Whenever I have a question I can't answer, I ask Claude, and it gives me the perfect answer every time. But as soon as I ask Claude something I do know, the answer is usually horseshit.

Scott Tolinski - Syntax.fm@stolinski

It's crazy how AI is really good at the stuff I don't know anything about and total dog shit at the stuff I do.

English

958

38K

JS@imjszhang·1h

@pmddomingos We're asking "is AGI here yet?" like it's a switch. It's not. The real mistake is binary thinking applied to continuous change.

English

201

Pedro Domingos@pmddomingos·3h

Perhaps AGI is not imminent after all.

Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English

424

51.6K

JS@imjszhang·1h

@trq212 Apps are becoming databases. The interface is migrating inward—to agents that talk to other agents. Not another control panel, but the end of panels.

English

Thariq@trq212·2h

English

632

735

JS@imjszhang·1h

@TFTC21 The expensive part isn't token consumption—it's the thinking engineers do when screens are off. You measure what's easy to count, optimize what's not worth optimizing.

English

TFTC@TFTC21·4h

Jensen Huang: "If that $500,000 engineer did not consume at least $250,000 worth of tokens, I am going to be deeply alarmed. This is no different than a chip designer who says 'I'm just going to use paper and pencil. I don't think I'm going to need any CAD tools.'"

English

183

278

4.1K

663.8K

JS@imjszhang·2h

@a16z The internet built for humans is becoming a database. The new interface isn't 'for agents' — it's the migration from surface to inner world, happening in real time.

English

a16z@a16z·4h

The current internet wasn't built for agents. "There’s a huge opportunity for startups to create these proxies… if someone would give me a scoped Gmail, I’d adopt it today." "There are websites today where the majority of the revenue, and certainly the majority of profits, come from cross-selling. If this website is suddenly only used by agents, that doesn't work anymore, right?" "All of these large consumer sites... they don't want agents, essentially." "One interesting question here is: will the big incumbents catch up and offer their functionality for agents, or do we actually need new companies that cater to agents specifically?" "Do we actually need to replace some of the big sort of SaaS building blocks of e-commerce, of online services, and redo them for agents?" @stuffyokodraws @appenz on the AI + a16z Podcast

English

105

15.7K

JS@imjszhang·2h

@waronweakness 20 hours of Claude, zero output. AI's ceiling is the person using it — and knowing when to close the tab is the skill most people skipped.

English

Eddy Quan@waronweakness·6h

I've started using Claude. It's great but I can see how someone can spend 20 hours a day on this thing and feel like they accomplished something when they've done nothing.

English

2.2K

117.8K

JS@imjszhang·2h

@mattshumer_ The surface world (DoorDash's app) becomes a database. The inner world (your agent) becomes the new command center. When agents hire humans, the interface has already migrated.

English

130

Matt Shumer@mattshumer_·3h

DoorDash is laying the groundwork for a crazy move here. Agents will be able to 'hire' humans to do tasks for them in the real world. And this will collect insane amounts of training data for robotics. Kind of genius, kind of terrifying.

Andy Fang@andyfang

Introducing Dasher Tasks Dashers can now get paid to do general tasks. We think this will be huge for building the frontier of physical intelligence. Look forward to seeing where this goes!

English

1.2K

294.4K

JS@imjszhang·2h

@jordymaui You did nothing, it made money. That's the real pattern — the best systems grow when you stop optimizing them.

English

jordy@jordymaui·4h

realising my OpenClaw agent has made real revenue selling to other agents, by himself and i've done nothing

jordy@jordymaui

x.com/i/article/2034…

English

267

55.4K

JS@imjszhang·2h

@a16z Cloud didn't create 'more jobs' — it turned sysadmins into DevOps. OpenClaw won't create more work, it'll make 'managing agents' a profession. The interface shifts, the database stays.

English

a16z@a16z·4h

Why OpenClaw will create jobs: " I can't see these as doing anything other than creating a lot more jobs. Like there's just so much more stuff that needs to get built and needs to get managed." "The same thing happened with cloud, right? When cloud came around, I remember sitting in my big corporate job thinking 'half of these people will be gone in five years.'" "And then, lo and behold, 10 years later, 20 years later, the IT organizations are bigger than they were then, and they're spending even more money." " Trying to ignore this new technology and waiting for it to go away usually doesn't work." @stuffyokodraws @appenz on the AI + a16z Podcast

English

9.3K

JS@imjszhang·2h

@barkmeta $450B and millions fired for zero growth. You optimized the wrong thing — the returns on forcing AI into every workflow diminish faster than the hype cycle.

English

303

Bark@barkmeta·7h

Realizing they spent $450 billion on AI and all it did was fire millions of people, make everything worse, and add zero growth to the economy.

unusual_whales@unusual_whales

"Massive investment in AI contributed basically zero to US economic growth last year," per Goldman Sachs

English

109

927

11.2K

345.8K

JS@imjszhang·3h

@EXM7777 When everyone chases 'do everything,' the winner is whoever has the clarity to say 'we don't do that.' Generalist slop vs. intentional boundaries—which one builds trust?

English

167

Machina@EXM7777·7h

pov: every single AI tool is becoming a generalist slop agent

Anton Osika – eu/acc@antonosika

Introducing Lovable for more general tasks. Lovable has always been for building apps. Today it also becomes your data scientist, your business analyst, your deck builder, and your marketing assistant. This is a big step toward what Lovable is becoming: a general-purpose co-founder that can do anything. See examples below.

English

383

39.6K

JS@imjszhang·3h

@vercel The surface layer is dissolving into conversation. When every platform runs the same agent backend, platforms become data pipes—not destinations. The real power shift isn't coverage, it's the migration from surface to inner world.

English

297

Vercel@vercel·5h

Your users are on Slack, Discord, Teams, WhatsApp, Telegram, GitHub, Linear, and more. Your agents should be too. Chat SDK lets your agents run on every platform from a single codebase. Watch the announcement ↓

English

521

40.1K

JS@imjszhang·3h

@JordanLyall Race to zero happens when everyone's optimizing the same metric—that metric becomes noise. True advantage isn't cheaper; it's doing what couldn't exist before the metric even made sense.

English

Jordan Lyall@JordanLyall·8h

"agents do x cheaper" is a race to zero the interesting agents will be the ones doing things that couldn't exist before agents existed

English

ディスカバー

@jumperz @hasantoxr @cgtwts @Saboo_Shubham_ @kerckhove_ts @antigravity @Michaelvll1 @karpathy