Scratch
89 posts

Scratch
@scratchdotmd
The app for reviewing and publishing AI edits. Download it today at https://t.co/56IJVUgWy4
USA Katılım Nisan 2026
42 Takip Edilen2 Takipçiler

Even if you take the strict view that economic sentiment surveys are just political polls in disguise, it's notable that the red line here has fallen to the lowest its been during any Trump admin years.
Petr Pinkhasov@pinkhasov
Its always been a medium to express the political bent.
English

@mattpocockuk it seems to work best when the agent is unaware of the existence of tests
English

Another layer of documentation I'm considering (along with CONTEXT.md and ADR's) is a list of all the agreed test seams in the app
Agents simply cannot be trusted to make good decisions about what to test, and at what seam.
For every small change, they extract out only what they've built into a testable function and test that.
It leads to a patchwork nightmare of tests that break as soon as the implementation changes.
English

@comfortfajugbag we run the workflows locally now, verify the outputs in our desktop app (scratch.md), then publish straight from there
1. AI agents with skills are more flexible than n8n/make automations
2. local-file-only access makes using the AI agents safer (they can't publish)
English

How do you monitor if your n8n AI workflow made the right decision?
With normal n8n workflows, failure is usually obvious.
A node fails.
An API returns an error.
The workflow stops.
But with AI workflows, the workflow can “succeed” while the AI still makes a bad decision.
For example:
wrong summary
bad classification
wrong lead priority
weak email draft
missing context
wrong CRM field update
That feels harder to catch than a normal error.
For people building AI workflows in n8n, how are you checking quality?
Do you use logs, manual review, confidence scores, Slack alerts, test data, or something else?
English

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946.
For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids.
An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better.
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
English

@CyrusShepard i remember when the ads had a distinct background color, now it's ai (google), ads where the results used to be (google), youtube (google), more ads where the results used to be (google), maybe maps (google)
unless you search for a product, then just ads (google)
English

@jyangballin 1. Google maps navigation
2. MyFitnessPal
3. Webflow (the platform itself)
4. Airtable
5. Supabase
English

Whew… switched over to Codex just in time.
Thanks @yacineMTB
Andrej Karpathy@karpathy
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English

@skyzer4ever could be, but almost everyone I know switched to codex.. starting with me who was on 8 months streak.. thier limit rates been a joke so..
English

🚨 karpathy just joined anthropic.
i mean the last 6 months felt like anthropic was bleeding momentum to openai and google.
you don't pull a karpathy out of retirement for small stuff. you pull him when you're about to drop something huge.
could it be claude comeback? idk
Andrej Karpathy@karpathy
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English

@TTrimoreau Limits are not the reason I switched to Codex, and they won't be the reason I switch back when Claude becomes better
English

@theo Yup. Anthropic runs their company like a gypsy tivoli.
Switched to Codex a month ago. Never been happier.
English







