Base44
1.1K posts

Base44
@Base44
Create fully-functional apps in minutes. If you can describe it, you can build it.
Katılım Eylül 2024
12 Takip Edilen29.9K Takipçiler
Base44 retweetledi

We’re introducing a new model benchmark.
And it’s a different kind of benchmark. (Basemark? Vibench?)
A different kind because it’s breathing, constantly updated from millions of builders. Not a closed set of tasks.
For a while now the public benchmark have not been really useful. Many models scoring high on benchmarks with very low real world usability
So we’re introducing to the world a new benchmark that we’re using internally and found extremely useful.
Our benchmark is basically how satisfied millions of users are when using different models.
IMO it’s the closest measurement to how useful a model is in real world use cases.
This metric is also correlated with our own business metrics - conversion, retention, etc.
We called it the frustration meter.
It’s automatically analysing millions of messages daily
It detects bug loops, repeated requests, etc.
We use this to benchmark every model we consider shipping. Not by asking "did it generate correct code." By asking "how did the builder feel after using it."
it’s a good benchmark to measure model degradation. So far in the past few weeks we haven’t found any.
Here's where the top models stand right now, ranked by average frustration score (scale 1 to 5, lower is better):
opus 4.6 - 1.3
sonnet 4.6 - 1.4
opus 4.7 - 1.5
gpt 5.5 - 1.5
gpt 5.4 - 1.6
Gemini 3.1 - 2.2
For app building, Opus 4.6 seems better than 4.7 to a lot of builders. We ran Opus 4.7 50/50 against Opus 4.6 across over 10,000 apps. Frustration riseed by 43%. Turns per request by 19%.
Gemini 3.1 don’t perform well at the moment, I left out of the graph as it made it unclear due to it’s rapid changes in this benchmark.
Quick note - this is all aggregated data, and do not involve reading individual or identifiable conversations.
We’ll keep tracking it and I’ll share it from time to time.

English

Introducing: Base44’s new platform migration feature.
If your needs have outgrown your current software, you can now bring your projects from other platforms into Base44 in a single click.
This includes your Salesforce pipeline, Shopify catalog or WordPress site. Also whatever you’ve built on Lovable, Bolt, or Replit.
To celebrate the launch, anyone who completes migration by May 5th, 12am ET gets 25 free credits added to their account!
English
Base44 retweetledi

Just shipped: SEO & GEO for your @Base44 apps.
You can now run a full scan, get a scored breakdown, and fix everything with AI. No SEO background needed.
Two things get scored separately:
- SEO: how Google finds you - meta tags, crawlability, structured data, content quality
- GEO: how AI tools find you - ChatGPT, Gemini, Perplexity. Different logic than Google.
What ranks on one doesn't automatically work on the other.
There are millions of builders on Base44. Most of them ship something real and then it just sits there.
Someone searches for exactly the tool they built - on Google or inside ChatGPT - and gets nothing. That's what this solves.
One thing worth knowing specifically: we now generate llms.txt automatically. It's a file that tells AI search engines
what your app does and how to reference it.
If a user asks ChatGPT for "best app to manage X" and you built it, you now have a shot at showing up.
It scans your app and shows you exactly what's missing: a meta description on your pricing page, structured data that Google expects, an image for when someone shares the link.
You click Fix with AI, it writes them for you.
Live for all builders now. Under the new Growth section.

English

@Norbert61854530 We’d love to see how we can help. Could you please DM us?
English

@rashidrealme We’d love to see how we can help. Could you please DM us your email address?
English

Hey @Base44 team, I applied for the partner program a week ago (second time applying) and still no email confirmation.
Just checking if my application made it through.
Anything I should do differently?
English

@CalebOl84463652 We’d love to see how we can help. Could you please DM us?
English











