Ben Doessel

5.8K posts

Ben Doessel

@bendoessel

Katılım Ağustos 2010

4.3K Takip Edilen2.3K Takipçiler

Sabitlenmiş Tweet

Ben Doessel@bendoessel·7 Şub

The best way to show off Grok's imagination? Let my 4-year-old son direct it. Created with Grok Imagine 1.0

Creators@XCreators

You thought the fun was over? 🏈 This weekend, video takes center stage on the timeline. We’re awarding $1M, $500K, and $250K to the top three videos about @grok, created with Imagine 1.0.

English

1.3K

Ben Doessel retweetledi

0xMarioNawfal@RoundtableSpace·12h

HERMES AGENT OS JUST DROPPED AND LETS YOU RUN HERMES, CLAUDE, CODEX, AND OPENCLAW FROM ONE DASHBOARD WITH SHARED MEMORY. 12 agent profiles, built-in studio, and SEO tools deploying ranked content to 5 websites simultaneously.

English

250

17.5K

Ben Doessel retweetledi

Argona@Argona0x·1d

a 28-year-old in Berlin runs a 7-agent software factory off a remote server she approves checkpoints from her phone at midnight; her workspace has no desk, no city, no fixed machine - any screen is just a terminal she quoted $28,000 for a scope a local agency priced at $74,000 and told the client 'minimum 6 weeks.' the agents shipped a validated PR in 19 hours the agency was still revising their proposal i've been running a version of this for the past few months. the setup sounds absurd until the first time it works, and then you can't go back the factory lives on a remote VPS - always on, eight tmux panes, already mid-session. ssh in from whatever screen is nearby: laptop at home, phone on a train, tablet at a café at 1am. the environment never moves. you're just a terminal window connecting to something that was already running agencies price the way they do because their overhead is structural. a $74,000 quote on a 6-week scope is real math: account managers, a senior dev who gets rotated to a bigger client by week three, revision cycles that exist because context lives in fourteen slack threads instead of one file the factory collapses all of that into a CLAUDE.md → a 100-line markdown file at the repo root loads the entire project into every new agent session - stack, architectural rules, banned patterns - so no session starts blind and no context drifts between runs → agent one is read-only: maps the existing codebase, documents patterns, flags risks before any agent touches the code → agent two writes the user story and acceptance criteria, locking the exact definition of done before engineering starts → agent three produces the technical brief: data model changes, API shapes, a precise list of every file that will move - this locks before any builder runs → backend and frontend build in parallel but in isolation, each scoped to its own directories, so they can't reach across and corrupt each other's work → agent six writes acceptance tests against the original user story criteria before the implementation is considered complete → agent seven runs a final read-only audit: missing auth, tenant isolation gaps, any deviation from the brief gets flagged back into the loop before the pr is cut → three checkpoints pause the entire chain for human approval - story, spec, pre-merge - each one a 30-second phone tap when the upstream work was done right the 19 hours is the output. what compressed was everything underneath: the pm relaying a question to an engineer who responds two days later, the architectural mistake that only surfaces after code is written, the context drift between sessions because the memory layer is a human brain instead of a file that loads before anything runs the loop closes itself. validator flags a gap, builder fixes it, verifier confirms, pr is clean by morning the agency sent their revised proposal at 9am. the pr had been merged for 14 hours she approved the final checkpoint at midnight, 30 seconds on her phone. the agents were already done the desk, the office, the fixed machine - she left them out

Rahul@sairahul1

x.com/i/article/2058…

English

227

2.1K

629.2K

Ben Doessel retweetledi

Peter Steinberger 🦞@steipete·10h

autoreview is the most impactful skill I've added to my stack (next to crabbox.sh). It automatically reviews your code before landing a PR. Finds so many edge cases. Sometimes it runs for hours. github.com/openclaw/agent…

English

1.3K

91.4K

Ben Doessel retweetledi

Julian Goldie SEO@JulianGoldieSEO·1d

𝐇𝐞𝐫𝐦𝐞𝐬 𝐀𝐠𝐞𝐧𝐭 𝐎𝐒 𝐉𝐮𝐬𝐭 𝐃𝐫𝐨𝐩𝐩𝐞𝐝. You can now run every AI agent from one mission control dashboard. 🤯 Stop juggling Hermes, Claude, OpenClaw, Codex, and Anti-gravity in different tabs. → One screen runs every agent off the same shared memory → 12 agent profiles ready to assign tasks across one Kanban board → Studio generates videos, podcasts, and images in one workspace → SEO tool deploys ranked content to 5 websites at the same time → Plug in Grok 4.3 OAuth from your Twitter account with no extra cost → Obsidian memory stores every voice note, idea, and client context This guy literally watched Hermes pull voice notes into Obsidian every 4 hours on its own. Save this, you'll want it later.

English

189

11.6K

Ben Doessel retweetledi

Google AI@GoogleAI·14h

x.com/i/article/2059…

ZXX

171

1.1K

94K

Ben Doessel retweetledi

Alexander Chen@alexanderchen·1d

Gemini Omni 🕶️ prompt in 🧵

Indonesia

894

101.6K

Ben Doessel retweetledi

Garry Tan@garrytan·1d

Ultimately the golden age of abundance will be this kind of tech built and deployed 1000x

Afshine Emrani MD FACC@afshineemrani

1/5 I'm a cardiologist. I have spent twenty years watching cholesterol destroy arteries, trigger heart attacks, and kill people I care about. Today, Eli Lilly presented data that may begin to end that era. VERVE-102. A single infusion. One dose. It uses base editing to permanently turn off the PCSK9 gene in your liver. Presented today at the European Atherosclerosis Society Congress: 88% reduction in PCSK9. 62% reduction in LDL cholesterol. Sustained up to 18 months. No treatment-related serious adverse events. One infusion. Not daily pills you forget to take. Not monthly injections. One dose — and your cholesterol may stay low for the rest of your life.

English

113

2.3K

690.8K

Ben Doessel retweetledi

Peter Steinberger 🦞@steipete·1d

@Dimillian yo u need crabbox.sh - it includes a parallels adapter + checkpoints for macOS now.

English

263

34K

Ben Doessel retweetledi

Julian Goldie SEO@JulianGoldieSEO·1d

Nobody is talking about how crazy Google’s new AI stack actually is. Project Genie → builds explorable 3D worlds from prompts Gemini Omni → edits videos by talking AI Studio Build Mode → builds apps from plain English Google Stitch → designs websites with voice Google Pix → makes marketing graphics in minutes Ask YouTube → searches videos and jumps to exact answers This is Google quietly replacing entire workflows.

English

1.6K

Ben Doessel retweetledi

m0h@exploraX_·1d

jensen huang, the CEO of nvidia said: “open-source models are now the lifeblood of startups across different industries.” emphasizing why open-source models are so important and how useful they’ve become for everyone. in the article below, I’ve put together a list of 100 free open-source AI tools that are incredibly useful. ranging from everyday productivity tasks to advanced agentic coding workflows.

m0h@exploraX_

x.com/i/article/2058…

English

101

7.7K

Ben Doessel retweetledi

Corey Ganim@coreyganim·1d

Someone is going to make $1M profit in 2026 selling ONE AI agent. It's called a speed-to-lead agent. Here's the play: The agent watches a business's inbound lead channels (website forms, email, SMS) and responds to every new lead in under 5 minutes with a personalized, contextual reply. Not "thanks, we got your message, we'll be in touch in 24 hours." But a real response that understands why they reached out and addresses it specifically. Why this matters: MIT ran a study and found you're 21x more likely to qualify a lead when you respond in under 5 minutes vs 30+ minutes. Every business with inbound leads is bleeding revenue right now. This is one of the few agents where you can immediately quantify the ROI. It puts cash in the client's pocket on day one. How to find clients: Call local service businesses. Submit forms on their websites. Email them. Track how long they take to respond. Most will take 24 to 48 hours. Some will never reply. That's your sales pitch: "I requested a quote from your business 3 days ago. Imagine how many leads you're losing because your response time is 3 days. Let's build you a speed-to-lead agent." Pricing: - $5K to $10K setup fee - $500 to $1K per month to maintain it Land 20 clients = $10-$20K MRR + $100-$200K in setup fees. Land 50 clients = $25-$50K MRR + $250-$500K in setup fees. This is the best AI business model right now because it ties directly to revenue. The ROI is dead simple to quantify. Go build it.

English

333

23.8K

Ben Doessel retweetledi

Mario Nawfal@MarioNawfal·1d

A coder turned everyday video into an explorable 3D environment with AI-driven 4D Gaussian Splatting. No fancy gear needed. Just feed in a clip, and the system builds a full three-dimensional scene you can move through.

Mario Nawfal@MarioNawfal

🇺🇸 A robot just worked an 8-hour shift. Fully autonomous. Sorted packages at human speed and then diagnosed its own maintenance issues Figure AI's Helix-02 replaced 109,000 lines of code with a neural network trained on human movement. One system. Legs, arms, fingers, eyes. All running together like a person. Get this: in a warehouse in America, a robot just clocked out after a full shift. And immediately clocked back in because it doesn't sleep.

English

45.4K

Ben Doessel retweetledi

Ryan Carson@ryancarson·1d

How to use /goal in Codex to ship huge features

English

368

22.4K

Ben Doessel retweetledi

CyrilXBT@cyrilXBT·1d

DROPSHIPPING 2016: Unlimited opportunity. Almost nobody doing it. First movers built empires. DROPSHIPPING 2019: Saturated. Margins gone. Too late. CLIPPING 2026: Unlimited opportunity. Almost nobody doing it systematically. First movers building empires right now. We are absurdly early to this.

Vadim@VadimStrizheus

x.com/i/article/2058…

English

117

1.1K

235.6K

Ben Doessel retweetledi

Garry Tan@garrytan·1d

This sounds complicated but the agents can implement this in OpenClaw/Hermes Agent trivially (use skillify from GBrain with a link to this tweet) Sounds ridiculous but you should try it

Muratcan Koylan@koylanai

Gradient descent for SKILL.md files sounds interesting, maybe a bit complex but it's becoming a real part of agent harness. SkillOpt is one of the first papers to treat markdown skill files as trainable parameters and provides a proper optimization framework for them. A few things I learned that you should consider too. 1. The validation gate is the only thing that matters in a self-editing loop. Held-out set, strict improvement, ties rejected. End-to-end, their best skills land with 1 to 4 accepted edits total. If your "self-improving agent" is accepting most of what it proposes, you're shipping slop. 2. Bounded edits are better than full rewrites. 4 to 8 edits per step is the sweet spot. Remove the budget and performance collapses. This is the textual analog of learning rate, and it transfers to any LLM-as-author loop. If you're using an agent to refactor your docs, your prompts, or your skills, cap the diff size. 3. Compactness wins. Median final skill: ~920 tokens. Skills do not need to be long. They need to be high-signal. Most skill files I see are bloated because length feels like effort. It isn't. 4. The harness is becoming less important; the skill is becoming more important. A Codex-trained skill ported into Claude Code hit +59.7 points on SpreadsheetBench. Procedural knowledge is more general than the runtime that produced it. 5. Frozen model + trained context is the practical adaptation. GPT-5.4-nano with a SkillOpt'd skill ≈ frontier behavior on procedural benchmarks. Cheaper, portable, inspectable, zero inference-time cost. This is the answer to "how do we adapt a frontier model for our domain" for almost everyone who isn't training their own models. 6. Verification is the bottleneck. Every gate in this paper depends on an auto-grader. That works for benchmarks. It fails for writing, design, and strategy, exactly the open-ended work we want to automate. Whoever builds the verifier for open-ended tasks owns the next stage. There are also two leassons I learned while shipping v2.3.0 of my Context Engineering Agent Skills repo, measured across composer-2, claude-opus-4-7, gpt-5.5, and gemini-3.1-pro via the @cursor_ai SDK: - Description and body are two different surfaces. The router only sees the description. The agent sees the body once activated. They can quietly disagree, and only end-to-end task tests catch it. - Aggregate accuracy is the wrong unit. When I rewrote three descriptions, the corpus average moved ~1pp. Individual skills moved 23–25pp. Per-skill effect size is where the action is. Also, in Feb 2026 I shared a piece called Personal Brain OS arguing that the markdown file is a first-class substrate for agent state. SkillOpt is the optimizer-shaped version of that same argument: not "store memory in files" but "treat files as trainable parameters with proper optimization machinery around them." That's the move from static to measured. The fast/slow split they describe already lives implicitly in the digital-brain-skill repo: - voice-guide and tone-of-voice.md are slow-state (rarely touched) - posts.jsonl and bookmarks.jsonl are fast-state What SkillOpt adds that I didn't have is a protected section invariant, a structural guarantee that fast edits cannot overwrite slow lessons. Removing that mechanism cost them 22 points on SpreadsheetBench. Worth borrowing. If you're building agents, SkillOpt: Executive Strategy for Self-Evolving Agent Skills is a good paper to read: arxiv.org/pdf/2605.23904

English

153

1.8K

277.7K

Ben Doessel retweetledi

淘沙者(TheSandPicker)@Etudecn·1d

这段马斯克 2003 年在斯坦福的 45 分钟闭门演讲，真的有点东西。不是那种空话连篇的创业鸡汤。而是他亲自把“怎么从 0 搞出一家公司”一层层拆给你看。更狠的是，他没在讲概念，直接把自己当时手里那三家公司，到底怎么活下来的，整个过程都摊开了。这种内容放到今天看都不过时。很多人听完的反应都很统一：这 45 分钟，顶得上你翻十本创业书。那种真正能用的干货，往往就藏在这种老素材里，不吵不闹，但句句都是实战。

中文

171

2.9K

15.6K

993.3K

Ben Doessel retweetledi

Rahul@sairahul1·1d

Anthropic just paid millions to hire Andrej Karpathy. He gave you the same knowledge for $0 the same week. Co-founder of OpenAI. Former head of AI at Tesla. The man who coined vibe coding. No recruitment fee. No exclusive access. Just a link and 29 minutes. LLMs are ghosts not animals. Vibe coding is dead. Software 3.0 is here. Watch it. Then read this. Because Karpathy tells you what Software 3.0 is. This shows you how to build one - a software factory with Claude Code that ships features while you sleep. The full build guide is below.

Rahul@sairahul1

x.com/i/article/2058…

English

559

4.2K

1.1M

Ben Doessel retweetledi

The Startup Ideas Podcast (SIP) 🧃@startupideaspod·1d

The guy behind Google's Antigravity thought a one-person billion-dollar company was impossible. Then he changed his mind. His argument: if you're an ideas guy, and the next guy has the same AI, what's your alpha? Then he looked at games. The best games aren't the ones with the best graphics. Nintendo proves it. - It's the mechanics. - The story. A few things done perfectly. So the alpha was never the studio of 100 people. It's the idea. AI just removed everything between you and shipping it. One person. Billion dollars. Increasingly possible.

English

112

11.9K

Ben Doessel retweetledi

Vox@Voxyz_ai·1d

>dude runs a clipping page solo. one creator in, 28 posts a day across 4 platforms. >every morning hermes pulls a long video, hands it to vugola, grabs the clips, ships to postiz. >vugola burns captions in. postiz hits ig/tt/x/yt from one dashboard. you don't open a single app. >every night view counts feed back. wild clips hit 12k, calm clips die at 500. tomorrow you cut sharper. >28 posts ship daily. 4m views/month is the ceiling, not week 1. no filming, no face, no brand. 📺

Vadim@VadimStrizheus

x.com/i/article/2058…

English

122

17.4K

Ben Doessel retweetledi

Shiv@shivsakhuja·1d

Claude Code + @higgsfield_ai is insane.

Himanshu Bamoria@0xhbam

Most AI-generated brand videos still feel generic. So I tested if Claude Code/Codex + Higgsfield AI could create a brand-grounded content factory. Turns out, yes. Here's the workflow I used to create this video for @touchland : 1. Research the brand, products, ICP, positioning, and visual style 2. Turn it into a brand bible 3. Generate video concepts for different customer segments 4. Pull real product images from the website 5. Preprocess them with Nano Banana 2 6. Write a storyboard with consistent scenes/props 7 .Send assets + storyboard to Higgsfield 8. Have Claude self-QC frames and iterate The bigger opportunity: Brands can run this across hundreds/thousands of SKUs and generate product videos grounded in real product info, brand assets, and customer segments. I turned this into a reusable Skill. Let me know if you want access.

Português

29.4K

Keşfet

@Dimillian @higgsfield_ai @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA