2.9K posts

R.

@RichDoesTech

Christian | 1x Husband | 5x Dad | Serial bootstrapped Founder | Building private AI tools that work offline @TryYaps (Sign up to the https://t.co/4B2vBqXwEl waitlist 👋)

London انضم Ekim 2020

420 يتبع3.2K المتابعون

تغريدة مثبتة

R.@RichDoesTech·9 Mar

# Day 1 Recently I've been focused on building a privacy-first and UX-friendly AI tool that handles a full suite of AI voice related tools (e.g. like, dictation, screen reading, and so much more) and slowly expanding into other domains. The hope is to address privacy, tool fatigue, latency, and several other things. But behind that the product, I'm challenging some assumptions that undergird the very root of how pre-LLM startups were built. I'll try and write more extensively on this. But for now, you enjoy the UI from my lastest Claude Code refactor 😅 (P.S. Shoutout to @DannPetty on reframe)

English

R.@RichDoesTech·1h

Big week ahead, bigger month ahead!

English

R.@RichDoesTech·12h

When some bob, others will weave. Some will compete on evals, others on price. Some will fight to be the best, others the most affordable. Compute is almost certainly way more expensive than the current charges, agreed. But there's a misconception that everyone is willing to pay full price. I don't think that's true - and I do think companies will invest more in open source models or cheaper proprietary solutions over being at the mercy of ever-increasing prices. Also, the rising tide lifts all the boats. Models like Kimi facilitate the release of a Composer 2. This is a model that runs at like 1/20th the price of other frontier models, gets trained on real time data with a 5 hour post training cycle, and for coding (based on the evals) is likely "good enough" for most tasks. These divides between the various companies/labs will only continue to grow imo, leading to stronger divides in their respective ICPs.

Andrew Curran@AndrewCurran_

Three weeks ago there were rumors that one of the labs had completed its largest ever successful training run, and that the model that emerged from it performed far above both internal expectations and what people assumed the scaling laws would predict. At the time these were only rumors, and no lab was attached to them. But in light of what we now know about Mythos, they look more credible, and the lab was probably Anthropic. Around the same time there were also rumors that one of the frontier labs had made an architectural breakthrough. If you are in enough group chats, you hear claims like this constantly, and most turn out to be nothing. But if Anthropic found that training above a certain scale, or in a certain way at that scale, produces capabilities that sit far above the prior trendline, then that is an architectural breakthrough. I think the leaked blog post was real, but still a draft. Mythos and Capybara were both candidate names for the new tier, though Mythos may now have enough mindshare that they end up keeping it. The specific rumor in early March was that the run produced a model roughly twice as performant as expected. That remains unconfirmed. What is confirmed is that Anthropic told Fortune the new model is a 'step change,' a sudden 2x would certainly fit the definition. We will find out in April how much of this is true. My own view is that the broad shape of this is correct even if some of the numbers are wrong. And if it is substantially accurate, then it also casts OpenAI's recent restructuring in a new light. If very large training runs are about to become essential to staying in the game, then a lot of their recent decisions, like dropping Sora, make even more sense strategically. For the public, this would mean the best models in the world are about to become much more expensive to serve, and therefore much more expensive to use. That will put pressure on rate limits, pricing, and subscription plans that are already subsidized to some unknown degree. Instead of becoming too cheap to meter, frontier intelligence may be about to become too expensive for most of humanity to afford. Second-order effects; compute, memory, and energy are about to become much more important than they already are. In the blog they describe the new model as not just an improvement, but having 'dramatically higher scores' than Opus 4.6 in coding and reasoning, and as being 'far ahead' of any other current models. If this is the new reality, then scale is about to become king in a whole new way. It would also mean, as usual, that Jensen wins again.

English

R.@RichDoesTech·13h

@Yuchenj_UW Been seeing everyone post these, honestly, I just think to myself "why"?

GIF

English

Yuchen Jin@Yuchenj_UW·19h

I’ve never really done any frontend coding, other than my own static webpage. But with Claude Code (Opus 4.6), I built this in 20 minutes using the pretty awesome Pretext to make images flow perfectly around text. Call me a frontend engineer now.

Cheng Lou@_chenglou

My dear front-end developers (and anyone who’s interested in the future of interfaces): I have crawled through depths of hell to bring you, for the foreseeable years, one of the more important foundational pieces of UI engineering (if not in implementation then certainly at least in concept): Fast, accurate and comprehensive userland text measurement algorithm in pure TypeScript, usable for laying out entire web pages without CSS, bypassing DOM measurements and reflow

English

397

74.8K

R.@RichDoesTech·13h

@Known_by_One @pcshipp oof, that's a really ctr, I wonder why though? 🤔

English

Detective Dick Tracey@Known_by_One·20h

@pcshipp :(

479

pc@pcshipp·21h

Literally, SEO is getting more harder - 2 clicks - 11.8 average position - 327 total impressions Maybe SEO is 100x harder now

English

100

12.4K

R.@RichDoesTech·14h

Trying to publicly document the "messy middle". - Ready to launch on Android but Google play is approvals are being annoying. - Went back to focussing on Mac & Windows but had some latency regressions from feature creep (fixed now). - Need to test payments then good to go.

English

R.@RichDoesTech·1d

@Jacob660245 Bro is 16, well done!

English

Jacob Rhodes@Jacob660245·1d

wow. Late last night I made my first sale, it has been 13 months of working after and before school and during summer break. Thanks to all of you who supported me!

English

1.7K

R.@RichDoesTech·1d

@moonfarm_dev

GIF

QME

Moonfarm 🇸🇪@moonfarm_dev·1d

WAIT, someone I DON'T KNOW just signed up 🔥🤯

English

1.9K

R.@RichDoesTech·2d

@CalebPanza This is looking really good. Congrats!

English

Caleb Panza ☾@CalebPanza·2d

Can't make this up, holy crap! WE ADDED $500 MRR IN LESS THAN 3 DAYS let's GO!!! What a way to start the weekend!

Caleb Panza ☾@CalebPanza

Another week another $500 MRR! 🤯 Post for Me is now making over $3500 MRR… I think that’s more than my first job. lol Every day, blessed with small wins like this. So proud of what we’re doing!!

English

127

9.7K

R.@RichDoesTech·2d

I mentioned Claude's downtime as a cause for concern a month ago, things have only gotten worse on the stability front unfortunately. They're scaling faster than everyone which makes things difficult... but unreliability will cause terrible churn.

R.@RichDoesTech

Respectfully, when you're paying $200 a month for Claude Code, you'd expect it to be more reliable than it actually is. Maybe I'm just always awake at the times it's down 🫠

English

190

R.@RichDoesTech·2d

Just wrapping up some assets for the mobile app store

English

124

R.@RichDoesTech·2d

Healthy reminder. Wins fuel attention.

Marc Lou@marclou

79% of startup founders grow revenue faster than their audience. Win first. Attention follows.

English

115

R.@RichDoesTech·2d

@ericzakariasson Amazing. Let me know if you want to collaborate on that at all. Happy to help on the taste eval front.

English

158

eric zakariasson@ericzakariasson·3d

@RichDoesTech good q! actually working on something for this

English

1.4K

eric zakariasson@ericzakariasson·3d

this is actually crazy. there's a new, improved, composer model every 5 hours. that is ~5 improvements per day

Cursor@cursor_ai

Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.

English

992

150.8K

R.@RichDoesTech·3d

Really trying to cut out noise today and lock-in. I'm on the final stretch.

English

R.@RichDoesTech·3d

@irbaazkadri In real time, yeah

English

Irbaaz Kadri@irbaazkadri·3d

@RichDoesTech so they take answers from users to trian their models?

English

R.@RichDoesTech·3d

Every 5 hours they can train a new model. This is seriously incredible. They would need to tons of evals and stuff but man that's fast.

Cursor@cursor_ai

English

177

R.@RichDoesTech·3d

@irbaazkadri They use actual inference tokens from real traffic as training signals. There's a follow up article.

English

Irbaaz Kadri@irbaazkadri·3d

@RichDoesTech real time RL whats that

English

R.@RichDoesTech·3d

@GergelyOrosz Na bro, we can't mentally go back 🤣 Jokes aside we'd use Codex

English

Gergely Orosz@GergelyOrosz·3d

Devs who can code also WITHOUT AI as well looking to became 10x more valuable They are the ones who won’t panic or be idle when their Claude quota runs out… So much for all the advice on how learning to code is not worth it any more…

Thariq@trq212

To manage growing demand for Claude we're adjusting our 5 hour session limits for free/Pro/Max subs during peak hours. Your weekly limits remain unchanged. During weekdays between 5am–11am PT / 1pm–7pm GMT, you'll move through your 5-hour session limits faster than before.

English

153

141

275K

R.@RichDoesTech·3d

Man I was really eating through my $200 plan quickly this week too, ngl, this came at the right time 😅

Tibo@thsottiaux

Hello. We have reset Codex usage limits across all plans to let everyone experiment with the magnificent plugins we just launched, and because it had been a while! You can just build unlimited things with Codex. Have fun!

English

R.@RichDoesTech·3d

@SherryYanJiang Well @grok / @perplexity_ai for research tbf

English

238

Sherry Jiang@SherryYanJiang·3d

what's one ai tool that you think is criminally underrated??

English

12.2K

اكتشف

@Yuchenj_UW @Known_by_One @pcshipp @Jacob660245 @moonfarm_dev @CalebPanza @ericzakariasson @elonmusk