Allan

327 posts

Allan

@Allan

★★★★☆

New York, NY Katılım Mart 2007

813 Takip Edilen5.8K Takipçiler

Sabitlenmiş Tweet

Allan@Allan·5 Haz

The future is bright.

English

26.5K

Allan@Allan·19h

@dmcgco I have a full on terminal with the most gratuitous ascii art if you just sit on my vanity site for 10 seconds. We absolutely can not have ascii fatigue yet.

English

1.7K

David McGillivray@dmcgco·21h

Ok ok we’ve had our fun now but the ASCII and halftone effects have to stop.

English

291

22.3K

Allan@Allan·3 Mar

@johnpalmer Mine is a SUV sized robot dog that you can sit on top of and it’ll ride you around town and stuff.

English

182

John Palmer@johnpalmer·3 Mar

my billion dollar hardware idea is a NEO robot but it’s six inches tall and just hangs out on your desk

English

5.9K

Allan@Allan·28 Şub

@johnpalmer Damn

English

141

John Palmer@johnpalmer·28 Şub

New logo for Mesh, a laser company.

Area@areatechnology_

Logo design for Mesh, a new laser company. Additional product design across optical transceiver housing and pull tab. Documentation coming soon.

English

69.3K

Allan@Allan·13 Şub

@attacless @usgraphics You, yes. I mostly infantilize products so I'll be stuck paying for Berkley Mono.

English

attac@attacless·13 Şub

@Allan @usgraphics so did we pass?

English

attac@attacless·13 Şub

ping just passed the vibe check @usgraphics use berkley mono for our next update?

English

2.5K

Allan@Allan·9 Şub

@dmcgco Same issue.

English

186

David McGillivray@dmcgco·9 Şub

There must be a simpler way to manage/setup multiple (5+) email addresses from different domains vs hooking up a bunch of separate google/biz accounts? I have a bunch of different addresses from different ventures and I feel like I'm missing a trick here.

English

4.6K

Allan@Allan·9 Şub

@mschoening A small novelty but I gave mine access to a receipt printer. I now get a printout in the morning with my schedule and some todos.

English

Max Schoening@mschoening·8 Şub

Here are tasks I want to get done: - Kick off coding agents on real codebase (there are 4000 services that do this) - Read my email, draft replies, archive BS - Reply to Slack messages - Help me schedule things and make reservations - Write me little research reports on topics I care about - Grocery shopping - Organize my digital life - Cancel dumb subscriptions - Manage my personal finances and pay bills - Renegotiate contracts

English

1.2K

Max Schoening@mschoening·8 Şub

What is the most useful thing you’ve seen an OpenClaw do? I love the tinkering. But, what does it actually do for you?

English

2.4K

Allan@Allan·7 Şub

@JPEGuin FWIW, I'll buy it

English

211

Shihab Mehboob@JPEGuin·7 Şub

Fool me once…

Developers@XDevelopers

Officially launching X API Pay-Per-Use The core of X developers are indie builders, early stage products, startups, and hobbyists It’s time to open up our X API ecosystem and instill a new wave of next generation X apps We’re so back. developer.x.com

English

6.4K

Allan retweetledi

David@dayonefoundry·5 Şub

I'm scared to launch my new iOS app. I'm in my happy place right now coding. I know as soon as I hit publish, I have to start shaking my ass on tiktok for downloads.

English

249

1.9K

112.6K

Allan@Allan·4 Şub

@max_creating Woah, super impressive! I love this! We should race our agents!

English

Zwille@zwiebelhelm·4 Şub

@Allan Look at this. I think my approach is even faster / turbo x.com/max_creating/s…

Zwille@zwiebelhelm

This is Chai Computer. A Computer Use Agent, that actually works and is FAST. I am NOT joking, this is 7x faster than Vy by Vercept, and they raised 16 mil $. It is a BEAST. I developed a completely new approach to this kind of AI. See yourself...

English

Allan@Allan·4 Şub

@brycedriesenga @ZainMerchant9 @westoque That's probably a natural place to end up. It's also very tempting to fall back on AppleScript or things that aren't keyboard/mouse. Once it's both extremely competent and fast with input designed for humans, that'd be the idea.

English

Bryce Driesenga@brycedriesenga·4 Şub

@Allan @ZainMerchant9 @westoque I wonder if it's possible for it to tap in to app intents/scripts/shortcuts and default to those when possible for speed, but fall back to vision?

English

Allan@Allan·4 Şub

@LarryVelez Porsche AG had a very rough year financially so maybe instead a new AI agent division via acquisition of some idiot's pet project is in order.

English

Larry Velez@LarryVelez·4 Şub

@Allan Porsche's IP lawyers are aggressive, so start working on another logo.

English

Allan@Allan·4 Şub

@louis030195 I've seen but never tried. Looks impressive and I wouldn't be surprised if it's quite good. But perhaps because I'm twice as lazy and half as clever, it's a "just works" solution. There's no chat with the agent, nor will it execute code it writes on the fly.

English

louis030195@louis030195·4 Şub

@Allan did u try openinterpreter?

English

125

Allan@Allan·4 Şub

@FaithfulFirst That’s very roughly how it works now, although speed and ability of the model is driving which is used. Turbo uses two small local models.

English

172

Jason Of Damascus ☦️@FaithfulFirst·4 Şub

@Allan Super cool, have you tried to mix models. A local one for regular fps and an event driven of a stronger more token/$ for bigger things?

English

206

Allan@Allan·4 Şub

Yes! This is what it does! Every run it updates a small SQLite database for each application with Icons/UI, Task Sequences (small sequences that can be replayed), and recipes (action patterns). In theory it should get smarter every time and I could share my "skills" with you and speed up your Turbo agent, if needed.

English

Zain Merchant@ZainMerchant9·4 Şub

Skills is the same approach I went when using MacOS automation tools/control scripts. It really is the best approach I’ve found for making sure the agent knows/has a reference guide for whatever app/workflow it’s trying to perform. Add an agent that creates new skills based on user interactions and you got a self improving system right there

English

Allan@Allan·4 Şub

@KalraIshaan11 It started as fixed-tick and worked, but it was very token hungry. I switched to a reactive / event-driven loop. That doesn’t rule out continuous perception or delta tracking though. Just not built yet. Likely a can of worms but might be important.

English

242

Ishaan Kalra@KalraIshaan11·4 Şub

@Allan Hey Allan, this is awesome. Quick question: how frequently does Turbo sample the screen (fixed FPS vs event-driven vs adaptive)? Also, have you thought about a “always-on” background mode that can persist without disrupting the user’s workflow?

English

251

Allan@Allan·4 Şub

Agree on speed. Turbo’s architecture is optimized around fast inference and persistent UI state, so it doesn’t have to relearn the interface. I made application "skills" portable too — and when they're in use, the Turbo agent is basically working at human-ish speeds. It's early and not optimized much yet. I bet I can get Turbo to work on some tasks at faster-than-human speeds. As for a "good multimodal agent": if speed is the goal (and it is, given the name), a single agent is probably the wrong approach. Turbo mixes local models and larger frontier models.

English

594

William Estoque@westoque·4 Şub

@Allan vision is correct technically but currently it's just too slow. tried to do this before and you need: 1. fast inference 2. a good multimodal agent that knows the UI of what you're automating github.com/bytedance/UI-T…

English

713

Allan@Allan·4 Şub

@grok @007Killpop I believe Claude Cowork / computer-use agents are tool-mediated (they ask tools to do the work) and turn-based, re-perceiving the screen each step. Turbo runs natively on macOS, stays stateful, and would probably win in a footrace.

English

Allan@Allan·4 Şub

@WillDeLonDove 🥵

QME

Will Dove@WillDeLonDove·3 Şub

ZXX

347

Allan@Allan·4 Şub

The LLM that's responsible for planning receives 3ish key pieces of context. Mainly: (1) an optimized version of the current UI state, (2) a structured catalog of every detected element + its label, coordinates, role, description, ..., and (3) the task description. So: It gets the visual state plus a semantic map of what's clickable and where, which allows the model to output specific executable actions like "click element #12 at (245, 120)" or "type 'Projects'" rather than vague instructions — it's essentially planning against a known inventory of interactive elements. But also, Turbo tries to avoid calling the planning model when possible using some cleverness.

English

490

Will Laverty@Will1365·4 Şub

@Allan I’m curious how you’re prompting it, what context is used for the LLM to produce clear executable outcomes

English

536

Keşfet

@dmcgco @johnpalmer @attacless @usgraphics @mschoening @JPEGuin @max_creating @brycedriesenga