Ben Doessel

5.8K posts

Ben Doessel banner
Ben Doessel

Ben Doessel

@bendoessel

Katılım Ağustos 2010
4.3K Takip Edilen2.3K Takipçiler
Ben Doessel retweetledi
0xMarioNawfal
0xMarioNawfal@RoundtableSpace·
HERMES AGENT OS JUST DROPPED AND LETS YOU RUN HERMES, CLAUDE, CODEX, AND OPENCLAW FROM ONE DASHBOARD WITH SHARED MEMORY. 12 agent profiles, built-in studio, and SEO tools deploying ranked content to 5 websites simultaneously.
English
21
15
250
17.5K
Ben Doessel retweetledi
Argona
Argona@Argona0x·
a 28-year-old in Berlin runs a 7-agent software factory off a remote server she approves checkpoints from her phone at midnight; her workspace has no desk, no city, no fixed machine - any screen is just a terminal she quoted $28,000 for a scope a local agency priced at $74,000 and told the client 'minimum 6 weeks.' the agents shipped a validated PR in 19 hours the agency was still revising their proposal i've been running a version of this for the past few months. the setup sounds absurd until the first time it works, and then you can't go back the factory lives on a remote VPS - always on, eight tmux panes, already mid-session. ssh in from whatever screen is nearby: laptop at home, phone on a train, tablet at a café at 1am. the environment never moves. you're just a terminal window connecting to something that was already running agencies price the way they do because their overhead is structural. a $74,000 quote on a 6-week scope is real math: account managers, a senior dev who gets rotated to a bigger client by week three, revision cycles that exist because context lives in fourteen slack threads instead of one file the factory collapses all of that into a CLAUDE.md → a 100-line markdown file at the repo root loads the entire project into every new agent session - stack, architectural rules, banned patterns - so no session starts blind and no context drifts between runs → agent one is read-only: maps the existing codebase, documents patterns, flags risks before any agent touches the code → agent two writes the user story and acceptance criteria, locking the exact definition of done before engineering starts → agent three produces the technical brief: data model changes, API shapes, a precise list of every file that will move - this locks before any builder runs → backend and frontend build in parallel but in isolation, each scoped to its own directories, so they can't reach across and corrupt each other's work → agent six writes acceptance tests against the original user story criteria before the implementation is considered complete → agent seven runs a final read-only audit: missing auth, tenant isolation gaps, any deviation from the brief gets flagged back into the loop before the pr is cut → three checkpoints pause the entire chain for human approval - story, spec, pre-merge - each one a 30-second phone tap when the upstream work was done right the 19 hours is the output. what compressed was everything underneath: the pm relaying a question to an engineer who responds two days later, the architectural mistake that only surfaces after code is written, the context drift between sessions because the memory layer is a human brain instead of a file that loads before anything runs the loop closes itself. validator flags a gap, builder fixes it, verifier confirms, pr is clean by morning the agency sent their revised proposal at 9am. the pr had been merged for 14 hours she approved the final checkpoint at midnight, 30 seconds on her phone. the agents were already done the desk, the office, the fixed machine - she left them out
Rahul@sairahul1

x.com/i/article/2058…

English
68
227
2.1K
629.2K
Ben Doessel retweetledi
Peter Steinberger 🦞
autoreview is the most impactful skill I've added to my stack (next to crabbox.sh). It automatically reviews your code before landing a PR. Finds so many edge cases. Sometimes it runs for hours. github.com/openclaw/agent…
English
51
57
1.3K
91.4K
Ben Doessel retweetledi
Julian Goldie SEO
Julian Goldie SEO@JulianGoldieSEO·
𝐇𝐞𝐫𝐦𝐞𝐬 𝐀𝐠𝐞𝐧𝐭 𝐎𝐒 𝐉𝐮𝐬𝐭 𝐃𝐫𝐨𝐩𝐩𝐞𝐝. You can now run every AI agent from one mission control dashboard. 🤯 Stop juggling Hermes, Claude, OpenClaw, Codex, and Anti-gravity in different tabs. → One screen runs every agent off the same shared memory → 12 agent profiles ready to assign tasks across one Kanban board → Studio generates videos, podcasts, and images in one workspace → SEO tool deploys ranked content to 5 websites at the same time → Plug in Grok 4.3 OAuth from your Twitter account with no extra cost → Obsidian memory stores every voice note, idea, and client context This guy literally watched Hermes pull voice notes into Obsidian every 4 hours on its own. Save this, you'll want it later.
English
9
23
189
11.6K
Ben Doessel retweetledi
Alexander Chen
Alexander Chen@alexanderchen·
Gemini Omni 🕶️ prompt in 🧵
Indonesia
40
62
894
101.6K
Ben Doessel retweetledi
Ben Doessel retweetledi
Julian Goldie SEO
Julian Goldie SEO@JulianGoldieSEO·
Nobody is talking about how crazy Google’s new AI stack actually is. Project Genie → builds explorable 3D worlds from prompts Gemini Omni → edits videos by talking AI Studio Build Mode → builds apps from plain English Google Stitch → designs websites with voice Google Pix → makes marketing graphics in minutes Ask YouTube → searches videos and jumps to exact answers This is Google quietly replacing entire workflows.
English
4
8
29
1.6K
Ben Doessel retweetledi
m0h
m0h@exploraX_·
jensen huang, the CEO of nvidia said: “open-source models are now the lifeblood of startups across different industries.” emphasizing why open-source models are so important and how useful they’ve become for everyone. in the article below, I’ve put together a list of 100 free open-source AI tools that are incredibly useful. ranging from everyday productivity tasks to advanced agentic coding workflows.
m0h@exploraX_

x.com/i/article/2058…

English
14
27
101
7.7K
Ben Doessel retweetledi
Corey Ganim
Corey Ganim@coreyganim·
Someone is going to make $1M profit in 2026 selling ONE AI agent. It's called a speed-to-lead agent. Here's the play: The agent watches a business's inbound lead channels (website forms, email, SMS) and responds to every new lead in under 5 minutes with a personalized, contextual reply. Not "thanks, we got your message, we'll be in touch in 24 hours." But a real response that understands why they reached out and addresses it specifically. Why this matters: MIT ran a study and found you're 21x more likely to qualify a lead when you respond in under 5 minutes vs 30+ minutes. Every business with inbound leads is bleeding revenue right now. This is one of the few agents where you can immediately quantify the ROI. It puts cash in the client's pocket on day one. How to find clients: Call local service businesses. Submit forms on their websites. Email them. Track how long they take to respond. Most will take 24 to 48 hours. Some will never reply. That's your sales pitch: "I requested a quote from your business 3 days ago. Imagine how many leads you're losing because your response time is 3 days. Let's build you a speed-to-lead agent." Pricing: - $5K to $10K setup fee - $500 to $1K per month to maintain it Land 20 clients = $10-$20K MRR + $100-$200K in setup fees. Land 50 clients = $25-$50K MRR + $250-$500K in setup fees. This is the best AI business model right now because it ties directly to revenue. The ROI is dead simple to quantify. Go build it.
Corey Ganim tweet media
English
33
21
333
23.8K
Ben Doessel retweetledi
Ben Doessel retweetledi
Ryan Carson
Ryan Carson@ryancarson·
How to use /goal in Codex to ship huge features
English
34
12
368
22.4K
Ben Doessel retweetledi
CyrilXBT
CyrilXBT@cyrilXBT·
DROPSHIPPING 2016: Unlimited opportunity. Almost nobody doing it. First movers built empires. DROPSHIPPING 2019: Saturated. Margins gone. Too late. CLIPPING 2026: Unlimited opportunity. Almost nobody doing it systematically. First movers building empires right now. We are absurdly early to this.
Vadim@VadimStrizheus

x.com/i/article/2058…

English
47
117
1.1K
235.6K
Ben Doessel retweetledi
Garry Tan
Garry Tan@garrytan·
This sounds complicated but the agents can implement this in OpenClaw/Hermes Agent trivially (use skillify from GBrain with a link to this tweet) Sounds ridiculous but you should try it
Muratcan Koylan@koylanai

Gradient descent for SKILL.md files sounds interesting, maybe a bit complex but it's becoming a real part of agent harness. SkillOpt is one of the first papers to treat markdown skill files as trainable parameters and provides a proper optimization framework for them. A few things I learned that you should consider too. 1. The validation gate is the only thing that matters in a self-editing loop. Held-out set, strict improvement, ties rejected. End-to-end, their best skills land with 1 to 4 accepted edits total. If your "self-improving agent" is accepting most of what it proposes, you're shipping slop. 2. Bounded edits are better than full rewrites. 4 to 8 edits per step is the sweet spot. Remove the budget and performance collapses. This is the textual analog of learning rate, and it transfers to any LLM-as-author loop. If you're using an agent to refactor your docs, your prompts, or your skills, cap the diff size. 3. Compactness wins. Median final skill: ~920 tokens. Skills do not need to be long. They need to be high-signal. Most skill files I see are bloated because length feels like effort. It isn't. 4. The harness is becoming less important; the skill is becoming more important. A Codex-trained skill ported into Claude Code hit +59.7 points on SpreadsheetBench. Procedural knowledge is more general than the runtime that produced it. 5. Frozen model + trained context is the practical adaptation. GPT-5.4-nano with a SkillOpt'd skill ≈ frontier behavior on procedural benchmarks. Cheaper, portable, inspectable, zero inference-time cost. This is the answer to "how do we adapt a frontier model for our domain" for almost everyone who isn't training their own models. 6. Verification is the bottleneck. Every gate in this paper depends on an auto-grader. That works for benchmarks. It fails for writing, design, and strategy, exactly the open-ended work we want to automate. Whoever builds the verifier for open-ended tasks owns the next stage. There are also two leassons I learned while shipping v2.3.0 of my Context Engineering Agent Skills repo, measured across composer-2, claude-opus-4-7, gpt-5.5, and gemini-3.1-pro via the @cursor_ai SDK: - Description and body are two different surfaces. The router only sees the description. The agent sees the body once activated. They can quietly disagree, and only end-to-end task tests catch it. - Aggregate accuracy is the wrong unit. When I rewrote three descriptions, the corpus average moved ~1pp. Individual skills moved 23–25pp. Per-skill effect size is where the action is. Also, in Feb 2026 I shared a piece called Personal Brain OS arguing that the markdown file is a first-class substrate for agent state. SkillOpt is the optimizer-shaped version of that same argument: not "store memory in files" but "treat files as trainable parameters with proper optimization machinery around them." That's the move from static to measured. The fast/slow split they describe already lives implicitly in the digital-brain-skill repo: - voice-guide and tone-of-voice.md are slow-state (rarely touched) - posts.jsonl and bookmarks.jsonl are fast-state What SkillOpt adds that I didn't have is a protected section invariant, a structural guarantee that fast edits cannot overwrite slow lessons. Removing that mechanism cost them 22 points on SpreadsheetBench. Worth borrowing. If you're building agents, SkillOpt: Executive Strategy for Self-Evolving Agent Skills is a good paper to read: arxiv.org/pdf/2605.23904

English
45
153
1.8K
277.7K
Ben Doessel retweetledi
淘沙者(TheSandPicker)
这段马斯克 2003 年在斯坦福的 45 分钟闭门演讲,真的有点东西。 不是那种空话连篇的创业鸡汤。 而是他亲自把“怎么从 0 搞出一家公司”一层层拆给你看。 更狠的是, 他没在讲概念,直接把自己当时手里那三家公司,到底怎么活下来的,整个过程都摊开了。 这种内容放到今天看都不过时。 很多人听完的反应都很统一: 这 45 分钟,顶得上你翻十本创业书。 那种真正能用的干货,往往就藏在这种老素材里,不吵不闹,但句句都是实战。
中文
171
2.9K
15.6K
993.3K
Ben Doessel retweetledi
Rahul
Rahul@sairahul1·
Anthropic just paid millions to hire Andrej Karpathy. He gave you the same knowledge for $0 the same week. Co-founder of OpenAI. Former head of AI at Tesla. The man who coined vibe coding. No recruitment fee. No exclusive access. Just a link and 29 minutes. LLMs are ghosts not animals. Vibe coding is dead. Software 3.0 is here. Watch it. Then read this. Because Karpathy tells you what Software 3.0 is. This shows you how to build one - a software factory with Claude Code that ships features while you sleep. The full build guide is below.
Rahul@sairahul1

x.com/i/article/2058…

English
63
559
4.2K
1.1M
Ben Doessel retweetledi
The Startup Ideas Podcast (SIP) 🧃
The guy behind Google's Antigravity thought a one-person billion-dollar company was impossible. Then he changed his mind. His argument: if you're an ideas guy, and the next guy has the same AI, what's your alpha? Then he looked at games. The best games aren't the ones with the best graphics. Nintendo proves it. - It's the mechanics. - The story. A few things done perfectly. So the alpha was never the studio of 100 people. It's the idea. AI just removed everything between you and shipping it. One person. Billion dollars. Increasingly possible.
English
13
15
112
11.9K
Ben Doessel retweetledi
Vox
Vox@Voxyz_ai·
>dude runs a clipping page solo. one creator in, 28 posts a day across 4 platforms. >every morning hermes pulls a long video, hands it to vugola, grabs the clips, ships to postiz. >vugola burns captions in. postiz hits ig/tt/x/yt from one dashboard. you don't open a single app. >every night view counts feed back. wild clips hit 12k, calm clips die at 500. tomorrow you cut sharper. >28 posts ship daily. 4m views/month is the ceiling, not week 1. no filming, no face, no brand. 📺
Vadim@VadimStrizheus

x.com/i/article/2058…

English
11
17
122
17.4K
Ben Doessel retweetledi
Shiv
Shiv@shivsakhuja·
Claude Code + @higgsfield_ai is insane.
Himanshu Bamoria@0xhbam

Most AI-generated brand videos still feel generic. So I tested if Claude Code/Codex + Higgsfield AI could create a brand-grounded content factory. Turns out, yes. Here's the workflow I used to create this video for @touchland : 1. Research the brand, products, ICP, positioning, and visual style 2. Turn it into a brand bible 3. Generate video concepts for different customer segments 4. Pull real product images from the website 5. Preprocess them with Nano Banana 2 6. Write a storyboard with consistent scenes/props 7 .Send assets + storyboard to Higgsfield 8. Have Claude self-QC frames and iterate The bigger opportunity: Brands can run this across hundreds/thousands of SKUs and generate product videos grounded in real product info, brand assets, and customer segments. I turned this into a reusable Skill. Let me know if you want access.

Português
5
3
71
29.4K