————-
547 posts

————- रीट्वीट किया

@datarade > Great fleas have little fleas upon their backs to bite 'em,
> And little fleas have lesser fleas, and so ad infinitum.
> And the great fleas themselves, in turn, have greater fleas to go on;
> While those again have greater still, and greater still, and so on.
English

@_adamwiggins_ @browsercompany Can you push to fix the irresponsible security policies
English

I'm on the @browsercompany team this month!
Two reasons I'm excited for this one:
• The web (meaning web apps, cloud, HTML/CSS/JS, etc) has become a sort of universal operating system. The browser is the main affordance and touchpoint most people have with this OS. Lots of tie-ins with my career's mission on making computers better.
• The team is full of star performers. Hope I can match the excellence of execution I'm seeing in everyone's work here ✨

English

@katherineroseyy @browsercompany CAN YOU FIX YOUR SECURITY BS FIRST? Before you build a new internet or computer or whatever bs you guys trynna do
Don’t lie about your privacy policy or play with your users security…liar
English


@joshm @browsercompany CAN YOU FIX YOUR SECURITY BS FIRST? Before you build a new internet or computer or whatever bs you guys trynna do
Don’t lie about your privacy policy or play with your users security…liar
English


@levelsio oh, and hop in discord for issues: discord.gg/itsalltruffles
English

mostly for devs here but a message from our sponsors.
I think on Tuesday, we are launching “SWE-bench Verified” a tool that evaluates software engineering practices, tools, and autonomous systems. They have been talking about it for a while. But most people won’t care - but, its good for developers.
English

rushed a little but will refine and add some more info I've been given if it bangs.
-project strawberry / qstar
ai explained has been close to this for a while so i'd watch them for a cleaner take if you want to dig in. this is what ilya saw. it's what has broken math benchmarks. it's more akin to rlhf than throwing compute at the problem. sus column r is a very very tiny open ai model using strawberry. strawberry in the larger models comes on thursday.
think of it as an llm fine-tuned to reason like a human. hence why sam liked the level two comment, and felt great about it. ilya did not. here we are.
-huge models, sora, voice, video and safety.
i'd referenced some model sizing based on meta and claude having small 8b, medium 72b and large 405b. this is a simple way to frame and means nothing. except that a much larger version of 4o is coming. when you try it, it will be the first noticeable jump that we saw when going from gpt 3 to 4. the jump from original 4 to sonnet 3.5 will seem insignificant in comparison. arrives next week with strawberry.
gpt next. etc.
so gpt next (internally called gpt x, you could call it gpt5) is also ready to go. lots here relies on safety and what google do next. it's difficult to say if competition will trump safety.
though red teaming is finished and post training is done. this model is such an enormous leap in capabilities it's becoming impossible to make the model safe. if you had this particular model unlocked, you could easily disrupt the world on an unprecedented scale. when you mix in voice, video, sora, agents, and the eye-watering capabilities, things hot up. they'll get the safety right and they'll roll it out I'm sure.
this is why we post don't die or vague post around how everything is about to change forever etc. it is. we've tried the models. it's insane. i'm not directly an agent, though i've had access to an early benchmark of five to take over an account and influence some big names in the field to carry out a few things for me. github was one such case of using the model to convince several to launch.
sora and voice rollout
it's expensive. especially sora. it's proving incredibly difficult to make safe. without guardrails for example you can with a simple prompt create a video of a world leader saying anything in their own style and voice, and effortlessly hack into large scale state secrets. if you haven't read situational awareness, it lays a lot of this out.
we will get a step change next week
it won't quite be gpt5. gpt5 / next / x / is more comparable to the jump made from gpt1-4. this is why sam feels great. ilya was right. you can scale your way to a digital god with or without strawberries. but strawberries + scale will cure world problems overnight.
sam. obviously not random chance you'll see i've been rocking with current / former openai employees and jimmy for a while. tldr. we are launching strawberry. we wanted to generate some hype. we did.
please burn after reading.
English
















