Scratch

89 posts

Scratch banner
Scratch

Scratch

@scratchdotmd

The app for reviewing and publishing AI edits. Download it today at https://t.co/56IJVUgWy4

USA Katılım Nisan 2026
42 Takip Edilen2 Takipçiler
Scratch
Scratch@scratchdotmd·
shift+tab -> /goal = ??
English
0
0
0
2
Scratch
Scratch@scratchdotmd·
@xai Does this work with Scratch?
English
0
0
0
1
xAI
xAI@xai·
An early beta of Grok Build, an agentic CLI for coding, building apps, and automating workflows is now available for SuperGrok Heavy subscribers. Through this early beta, we will improve the model and product based on your feedback. Try it at x.ai/cli
xAI tweet media
English
1.6K
1.5K
10.2K
56.5M
Scratch
Scratch@scratchdotmd·
@TheStalwart This has more to do with AI than anything else.
English
0
0
0
38
Scratch
Scratch@scratchdotmd·
@mattpocockuk it seems to work best when the agent is unaware of the existence of tests
English
0
0
0
362
Matt Pocock
Matt Pocock@mattpocockuk·
Another layer of documentation I'm considering (along with CONTEXT.md and ADR's) is a list of all the agreed test seams in the app Agents simply cannot be trusted to make good decisions about what to test, and at what seam. For every small change, they extract out only what they've built into a testable function and test that. It leads to a patchwork nightmare of tests that break as soon as the implementation changes.
English
47
14
439
31.9K
Scratch
Scratch@scratchdotmd·
@comfortfajugbag we run the workflows locally now, verify the outputs in our desktop app (scratch.md), then publish straight from there 1. AI agents with skills are more flexible than n8n/make automations 2. local-file-only access makes using the AI agents safer (they can't publish)
English
0
0
0
3
Comfort Fajugbagbe ⚡ Ops Manager | AI Creator
How do you monitor if your n8n AI workflow made the right decision? With normal n8n workflows, failure is usually obvious. A node fails. An API returns an error. The workflow stops. But with AI workflows, the workflow can “succeed” while the AI still makes a bad decision. For example: wrong summary bad classification wrong lead priority weak email draft missing context wrong CRM field update That feels harder to catch than a normal error. For people building AI workflows in n8n, how are you checking quality? Do you use logs, manual review, confidence scores, Slack alerts, test data, or something else?
English
1
0
2
124
Scratch
Scratch@scratchdotmd·
@OpenAI we're already to the emotional background music for a math solution video stage? the token economics must be worse than we thought
English
0
0
0
12
OpenAI
OpenAI@OpenAI·
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
English
1K
3.8K
26.1K
12.9M
Scratch
Scratch@scratchdotmd·
@CyrusShepard i remember when the ads had a distinct background color, now it's ai (google), ads where the results used to be (google), youtube (google), more ads where the results used to be (google), maybe maps (google) unless you search for a product, then just ads (google)
English
0
0
3
1.5K
Cyrus Maxx
Cyrus Maxx@CyrusShepard·
Today, Google released the May Core Update. This update will classify all non-Google websites as spam. Thank you for your attention to this important matter.
English
43
32
371
33.7K
Charlie Marsh
Charlie Marsh@charliermarsh·
What would you do with unlimited tokens
English
372
8
507
79.6K
Scratch
Scratch@scratchdotmd·
the whole claude code vs codex debate makes it clear that neither is substantially better (higher limits != better tool)
English
0
0
0
25
Scratch
Scratch@scratchdotmd·
the posthog ai agent is actually really good at creating sql reports
English
0
0
1
5
Scratch
Scratch@scratchdotmd·
@jyangballin 1. Google maps navigation 2. MyFitnessPal 3. Webflow (the platform itself) 4. Airtable 5. Supabase
English
0
0
0
70
John Yang
John Yang@jyangballin·
Thinking about what new tasks to put in programbench v2. What software programs (CLI tool/executables? Local apps? Websites?) would u wanna see models try building from scratch?
English
15
5
50
4.3K
Scratch
Scratch@scratchdotmd·
To everyone who recently switched from Claude Code to Codex, thank you!!! Claude Code is running much faster with far higher limits now. 😁
English
1
0
1
86
JUMPERZ
JUMPERZ@jumperz·
@skyzer4ever could be, but almost everyone I know switched to codex.. starting with me who was on 8 months streak.. thier limit rates been a joke so..
English
1
0
1
90
JUMPERZ
JUMPERZ@jumperz·
🚨 karpathy just joined anthropic. i mean the last 6 months felt like anthropic was bleeding momentum to openai and google. you don't pull a karpathy out of retirement for small stuff. you pull him when you're about to drop something huge. could it be claude comeback? idk
Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English
9
0
42
2.7K
Steve Mordue
Steve Mordue@stevemordue·
@TTrimoreau Limits are not the reason I switched to Codex, and they won't be the reason I switch back when Claude becomes better
English
1
0
0
49
Thomas Trimoreau
Thomas Trimoreau@TTrimoreau·
Would you still keep using codex if claude started having the same limits?
English
81
1
79
6K
awais
awais@TheAwaisManzoor·
@theo switched to codex a week ago.....
English
2
0
0
177
Theo - t3.gg
Theo - t3.gg@theo·
Honestly I'm still really impressed with the Codex app. It works reliably. It adds useful features consistently. It has taste. The mobile integration is awesome. The git integration is solid. If you haven't used it yet, I highly recommend it.
English
222
104
4.1K
811.9K
The New Policy 🌐
The New Policy 🌐@TheNewPolicy·
@theo Yup. Anthropic runs their company like a gypsy tivoli. Switched to Codex a month ago. Never been happier.
English
1
0
0
168