Scratch

89 posts

Scratch

@scratchdotmd

The app for reviewing and publishing AI edits. Download it today at https://t.co/56IJVUgWy4

USA Katılım Nisan 2026

42 Takip Edilen2 Takipçiler

Scratch@scratchdotmd·18h

shift+tab -> /goal = ??

English

Scratch@scratchdotmd·23h

@xai Does this work with Scratch?

English

xAI@xai·14 May

An early beta of Grok Build, an agentic CLI for coding, building apps, and automating workflows is now available for SuperGrok Heavy subscribers. Through this early beta, we will improve the model and product based on your feedback. Try it at x.ai/cli

English

1.6K

1.5K

10.2K

56.5M

Scratch@scratchdotmd·23h

@TheStalwart This has more to do with AI than anything else.

English

Joe Weisenthal@TheStalwart·1d

Even if you take the strict view that economic sentiment surveys are just political polls in disguise, it's notable that the red line here has fallen to the lowest its been during any Trump admin years.

Petr Pinkhasov@pinkhasov

Its always been a medium to express the political bent.

English

119

23.4K

Scratch@scratchdotmd·1d

@mattpocockuk it seems to work best when the agent is unaware of the existence of tests

English

362

Matt Pocock@mattpocockuk·1d

Another layer of documentation I'm considering (along with CONTEXT.md and ADR's) is a list of all the agreed test seams in the app Agents simply cannot be trusted to make good decisions about what to test, and at what seam. For every small change, they extract out only what they've built into a testable function and test that. It leads to a patchwork nightmare of tests that break as soon as the implementation changes.

English

439

31.9K

Scratch@scratchdotmd·1d

@comfortfajugbag we run the workflows locally now, verify the outputs in our desktop app (scratch.md), then publish straight from there 1. AI agents with skills are more flexible than n8n/make automations 2. local-file-only access makes using the AI agents safer (they can't publish)

English

Comfort Fajugbagbe ⚡ Ops Manager | AI Creator@comfortfajugbag·2d

How do you monitor if your n8n AI workflow made the right decision? With normal n8n workflows, failure is usually obvious. A node fails. An API returns an error. The workflow stops. But with AI workflows, the workflow can “succeed” while the AI still makes a bad decision. For example: wrong summary bad classification wrong lead priority weak email draft missing context wrong CRM field update That feels harder to catch than a normal error. For people building AI workflows in n8n, how are you checking quality? Do you use logs, manual review, confidence scores, Slack alerts, test data, or something else?

English

124

Scratch@scratchdotmd·1d

friendly reminder to review the output of your ai before you ship it

Joe Weisenthal@TheStalwart

This was an amazing and incredibly damning experiment using Microsoft Copilot, by @adamjkucharski kucharski.substack.com/p/real-signals…

English

Scratch@scratchdotmd·1d

@OpenAI we're already to the emotional background music for a math solution video stage? the token economics must be worse than we thought

English

OpenAI@OpenAI·2d

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

English

3.8K

26.1K

12.9M

Scratch@scratchdotmd·1d

@CyrusShepard i remember when the ads had a distinct background color, now it's ai (google), ads where the results used to be (google), youtube (google), more ads where the results used to be (google), maybe maps (google) unless you search for a product, then just ads (google)

English

1.5K

Cyrus Maxx@CyrusShepard·1d

Today, Google released the May Core Update. This update will classify all non-Google websites as spam. Thank you for your attention to this important matter.

English

371

33.7K

Scratch@scratchdotmd·2d

brilliant concept

Nathan Baschez@nbaschez

Introducing Roughdraft! A new open source project designed to make collaboration with agents better. The idea is to bring commenting and suggested changes to markdown (e.g. plan docs) in a nice interface. Free, local, etc. 👉 roughdraft.md 👈

English

Scratch@scratchdotmd·2d

@charliermarsh build a real search engine

English

Charlie Marsh@charliermarsh·2d

What would you do with unlimited tokens

English

372

507

79.6K

Scratch@scratchdotmd·2d

the whole claude code vs codex debate makes it clear that neither is substantially better (higher limits != better tool)

English

Scratch@scratchdotmd·2d

the posthog ai agent is actually really good at creating sql reports

English

Scratch@scratchdotmd·2d

most of the serp is just ads anyway

Google Search Central@googlesearchc

Today we released the May 2026 core update. We'll update our ranking release history page when the rollout is complete: status.search.google.com/incidents/wdAX…

English

Scratch@scratchdotmd·2d

@jyangballin 1. Google maps navigation 2. MyFitnessPal 3. Webflow (the platform itself) 4. Airtable 5. Supabase

English

John Yang@jyangballin·2d

Thinking about what new tasks to put in programbench v2. What software programs (CLI tool/executables? Local apps? Websites?) would u wanna see models try building from scratch?

English

4.3K

Scratch@scratchdotmd·2d

To everyone who recently switched from Claude Code to Codex, thank you!!! Claude Code is running much faster with far higher limits now. 😁

English

Scratch@scratchdotmd·2d

@Ddddarren @yacineMTB what made you switch

English

Darren@Ddddarren·4d

Whew… switched over to Codex just in time. Thanks @yacineMTB

Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

Scratch@scratchdotmd·2d

@jumperz @skyzer4ever what made you switch

English

JUMPERZ@jumperz·4d

@skyzer4ever could be, but almost everyone I know switched to codex.. starting with me who was on 8 months streak.. thier limit rates been a joke so..

English

JUMPERZ@jumperz·4d

🚨 karpathy just joined anthropic. i mean the last 6 months felt like anthropic was bleeding momentum to openai and google. you don't pull a karpathy out of retirement for small stuff. you pull him when you're about to drop something huge. could it be claude comeback? idk

Andrej Karpathy@karpathy

English

2.7K

Scratch@scratchdotmd·2d

@stevemordue @TTrimoreau what made you switch

English

Steve Mordue@stevemordue·3d

@TTrimoreau Limits are not the reason I switched to Codex, and they won't be the reason I switch back when Claude becomes better

English

Thomas Trimoreau@TTrimoreau·4d

Would you still keep using codex if claude started having the same limits?

English

Scratch@scratchdotmd·2d

@TheAwaisManzoor @theo what made you switch

English

awais@TheAwaisManzoor·3d

@theo switched to codex a week ago.....

English

177

Theo - t3.gg@theo·3d

Honestly I'm still really impressed with the Codex app. It works reliably. It adds useful features consistently. It has taste. The mobile integration is awesome. The git integration is solid. If you haven't used it yet, I highly recommend it.

English

222

104

4.1K

811.9K

Scratch@scratchdotmd·2d

@TheNewPolicy @theo what made you switch

English

The New Policy 🌐@TheNewPolicy·3d

@theo Yup. Anthropic runs their company like a gypsy tivoli. Switched to Codex a month ago. Never been happier.

English

168

Keşfet

@xai @TheStalwart @mattpocockuk @comfortfajugbag @OpenAI @CyrusShepard @charliermarsh @elonmusk