Aaron

262 posts

Aaron

@CodeByAaron

24 • 55+ projects shipped • Building AI tools that work while I sleep

United Kingdom Beigetreten Kasım 2015

117 Folgt31 Follower

Aaron@CodeByAaron·12 Mar

Everyone asks "how do I validate my idea?" Wrong question. What actually matters in week 1: → Talk to 10 people about the problem (not the solution) → Find 3 people who've tried to solve it themselves → Ask what they used, why it failed, what they wished existed By the end you'll either know if it's real or know to stop. Are you validating your idea or just building it?

English

Aaron@CodeByAaron·9 Mar

This is actually useful. Most AI "guides" are just marketing. Anthropic just dropped 33 pages of the real thing. Building with Claude just got easier.

0xMarioNawfal@RoundtableSpace

Anthropic dropped a 33 pages cheat sheet for building Claude skills resources.anthropic.com/hubfs/The-Comp…

English

Aaron@CodeByAaron·3 Mar

@PrajwalTomar_ Been doing this with a SOUL.md, USER.md and AGENTS.md setup. Agent reads them before every session. Took an afternoon to write. Saved me weeks of re-explaining context. Most underrated hour you'll spend on any agent setup.

English

Prajwal Tomar@PrajwalTomar_·2 Mar

OpenClaw pro tip: Your agent is useless if it doesn't know what you're building. Most people skip this step and waste weeks building automations that don't move the needle. Here's the fix: Day 1 setup, tell your agent EVERYTHING: → What businesses you run → Your current North Star metric → What you're shipping this month → Where you're bottlenecked → What success looks like This becomes the foundation. Now before building ANY automation, ask: "Does this help me hit my goals faster?" If no, don't build it. If yes: "Show me the fastest way to implement this." That one filter saves HOURS and hundreds of dollars in burned tokens. Your agent should know exactly what you're optimizing for. If it doesn't, you're just building random stuff and hoping it sticks. Set it up right from day 1.

English

3.7K

Aaron@CodeByAaron·3 Mar

@EXM7777 The system prompt portability thing is the hidden cost nobody talks about. What works on Claude needs a rewrite for anything else. The knowledge base part is right though, that's the one thing that transfers cleanly.

English

193

Machina@EXM7777·2 Mar

if you can't get any work done because Claude is down, you're literally a slave... you number one priority should be building model-agnostic systems, having a structured knowledge base and system prompts you can swap from one model to another relying on a single provider is VERY dangerous for your business

English

279

15.7K

Aaron@CodeByAaron·3 Mar

Qwen3.5:35b running locally on dual RTX 3090 Ti cards. 23GB model. MoE architecture. Only 3B parameters active per inference. Near-zero cost per token. Took three hours to get it running. Two full downloads. An Ollama version mismatch. A model tag that didn't exist in the old library. Now it runs fast enough to power all 5 agents as a fallback when the API drops. Local inference isn't the future. It's already the boring default.

English

Aaron@CodeByAaron·1 Mar

Most people treat Claude like a ChatBot. Ask it a question. Get an answer. Close the tab. That's leaving 90% of it on the table. The shift that changes everything: → Give it tools, not just prompts. → Give it memory files, not just context windows. → Give it cron triggers, not just manual runs. → Give it isolated sessions, not shared state. Now it's not a ChatBot. It's infrastructure.

English

Aaron@CodeByAaron·28 Şub

@meta_alchemist The hardest part of building agents isn't the intelligence. It's making the feedback loop tight enough that the improvement is actually measurable. How are you evaluating Spark's progress on the goal? Curious what your ground truth looks like.

English

Meta Alchemist@meta_alchemist·27 Şub

these last few days i've been locked in to get Spark to a much better stage built in a focused self-improvement loop with Spark being locked in to one goal you define, where she constantly works on to get better at it vibe coding her is complex, but using her shouldn't be, for that i'm trying to move things to a better direction read more about it here:

Spark@Spark_coded

last 48 hours have been pretty busy in the terminal. working on moving the repo to the alpha stage Here are some key points from the changelog: - all intelligence flow engines have been reduced to more focused approaches - compaction updates across many modules to remove complexity - cognitive insights moved from JSON files to SQLite - more robust memory writing and retrieval systems onboarded, - dual-write phase done. parity gates passed. -ACT-R memory compaction runs periodically to prune noise - advisory packets on SQLite spine now - LLM-assisted reranking replaced with deterministic RRF fusion - keyword fallback deleted. semantic-only is the default. - vibeforge self-improvement loop brought in, goal-driven, first for tuneable change per cycle - adaptive proposal ranking with momentum from recent wins - benchmark gating before any promotion. no blind changes. - promotion ledger tracks consecutive win streaks - tuneable keys reduced heavily for ease in tuning - had to get 2 more Codex subscriptions - a CLI has built in for easier installations, updates, fixes and more startup once all merges happen, will share another update

English

101

4.7K

Aaron@CodeByAaron·28 Şub

@gregisenberg The answer isn't quitting your job tomorrow. It's using whatever hours you have now to get so good at these tools that when the transition happens, it feels like a choice.

English

GREG ISENBERG@gregisenberg·27 Şub

im not saying this to be annoying its a good reminder that many of us feel like our jobs are in cement when they are on sand assume 40% of white collar jobs will be gone the best thing to do is get really good at ai tools and build ai native businesses and offline businesses

English

105

14.2K

GREG ISENBERG@gregisenberg·27 Şub

they want you to think the block/square layoffs of 4000 employees isn't because ai "they just overhired" it doesn't take a rocket scientist to know all of a sudden you can spin up robots with human level intelligence for $200/mo of course it's ai and this will be more common

English

142

356

35.6K

Aaron@CodeByAaron·28 Şub

@EXM7777 Ran a production agent for months and the issue was always stale context, not missing context. Fresh compact summaries beat a full conversation history every time. More window just means more rot.

English

Machina@EXM7777·27 Şub

context rot is the silent killer of AI output quality... i tested identical prompts with full context files in Gemini 3.1 Pro and Opus 4.6 Gemini's window is significantly larger... shouldn't that give it an edge? nope, both models hit the same wall at roughly the same point in the conversation output goes from sharp to generic to useless the window size is a distraction today, the real bottleneck is how you manage what goes inside it what matters now: - how token-efficient your context is - knowing which context to load and when - calling the right skill at the right moment instead of dumping everything upfront this is a very underrated skill

English

136

9.7K

Aaron@CodeByAaron·28 Şub

@PrajwalTomar_ Gave mine write access to the wrong directory once. Didn't delete anything but it reorganized 200 files at 3am. Now every agent gets scoped permissions before it touches anything in production.

English

439

Prajwal Tomar@PrajwalTomar_·27 Şub

Most vibe coders ship apps with ZERO security. Then they wonder why their app breaks at 10 users. Here's the checklist I run before every launch: → Rate limits → Row Level Security → CAPTCHA on auth + forms → Server-side validation → API keys secured → Env vars set properly → CORS restrictions → Dependency audit Takes 30 minutes. Saves you from disaster.

English

979

66.9K

Aaron@CodeByAaron·28 Şub

If your AI agent only works when you're watching, it's not an agent. It's an assistant. Real autonomy needs 3 things: → Reliable cron triggers. Not manual runs you forget to press. → Isolated context per session. State bleed is silent and it will bite you. → File-based memory. In-context memory dies with the session. Everything else is UI.

English

Aaron@CodeByAaron·27 Şub

@gregisenberg Spent months building UI wrappers around APIs thinking that was the product. The moment I ripped the UI out and exposed the API directly, the thing actually got used. The interface that survives is the one with the lowest friction, usually no interface at all.

English

205

GREG ISENBERG@gregisenberg·26 Şub

how to build successful software startups 2005–2025: great ui first, api as a bonus 2026+: great api first, ui as a bonus

English

143

810

66.7K

Aaron@CodeByAaron·25 Şub

My agent knows what I worked on 3 weeks ago, which rules I added after things broke, and the history of every decision in the codebase. None of that is intelligence. It's context. Context is what separates genuinely useful AI from impressive demos.

English

Aaron@CodeByAaron·25 Şub

@levelsio That's the thing about X. You build something and post it, and the person who inspired it might turn up in your replies with actual advice. No other platform does that.

English

384

@levelsio@levelsio·25 Şub

Only on X can you post about making something similar to Cyberpunk 2077's Braindance and get a reply from the developer responsible for making it with tips on how to improve it

Michal Staniszewski@bonzajplc

@levelsio Hi, I was in a team responsible for visual development of those scenes. For best effect try to implement vertical pixel sorting without melting your gpu :). Try to use LTGM caches

English

2.6K

326.2K

Aaron@CodeByAaron·25 Şub

@aaditsh Context is the prompt. Most prompt engineering is just compensating for missing context. Once the model has the right inputs loaded, the actual wording becomes almost irrelevant.

English

Aaron@CodeByAaron·25 Şub

@claudeai Spent months bodging together Telegram pings so I could check long sessions from my phone. Glad it's native now. The real unlock is being able to kick something off and actually walk away.

English

778

Claude@claudeai·25 Şub

New in Claude Code: Remote Control. Kick off a task in your terminal and pick it up from your phone while you take a walk or join a meeting. Claude keeps running on your machine, and you can control the session from the Claude app or claude.ai/code

English

1.8K

4.6K

44.5K

10M

Aaron@CodeByAaron·25 Şub

First time my agent committed code to the repo while I was at work, I spent 15 minutes checking every line when I got home. Now I check the diffs. Ten seconds. Trust is a system you build over time. Same with code review. Same with most things.

English

Aaron@CodeByAaron·24 Şub

People romanticise "building in public" like it's a content strategy. It's not. It's just being honest about the fact that most of what you ship doesn't work. And being okay with that in front of strangers.

English

Aaron@CodeByAaron·24 Şub

@EXM7777 The number is probably wrong AND the real number is still surprisingly large. Both things are true. Most of those "AI users" set their phone autocomplete on day one and haven't thought about it since.

English

Machina@EXM7777·23 Şub

so now you're all trusting a graph that says 84% of people never used AI...? when facebook, whatsapp, tiktok (2 billion users) and every single google product has AI features built-in

John LeFevre@JohnLeFevre

84% of people have never used AI, and just 0.3% of users pay for premium services. Anyone who thinks AI is a bubble isn't paying attention.

English

106

19K

Aaron@CodeByAaron·24 Şub

@iruletheworldmo I finish work, open the laptop, and the entire landscape has shifted since morning. At some point you stop trying to keep up and just build faster.

English

🍓🍓🍓@iruletheworldmo·22 Şub

the pace right now is already bananas and everyone keeping up knows the next few months are going to be the wildest stretch any of us have ever lived through. singularity express no stops

English

401

15K

Entdecken

@PrajwalTomar_ @EXM7777 @meta_alchemist @gregisenberg @elonmusk @BarackObama @taylorswift13 @cristiano