Domo

565 posts

Domo

Domo

@embossedly

Katılım Temmuz 2025
21 Takip Edilen5 Takipçiler
Domo
Domo@embossedly·
@argofowl now that codex won the race they're rugpulling us in realtime and pretending to fix it and now that claude is losing they're upping limits rapidly never get comfortable. keep these companies on their toes cuz they will rugpull u as soon as they're on top
English
0
0
2
158
Domo
Domo@embossedly·
@scaling01 i'm still struggling to imagine what the announcement of AGI looks like. or if it's a retroactive agreement instead? like is it really just oai coming out one random day saying "we achieved AGI based on our criteria!". feels unserious. AGI feels like a hindsight thing?
English
0
0
0
44
Domo
Domo@embossedly·
@Crashman_X that brown brick shitbox behind him is worth at most $100,000 aud btw tank prices back to 1990 asap
English
0
0
0
343
Crashman
Crashman@Crashman_X·
No bid auctions are becoming more and more popular. The sky really is falling.
English
113
40
536
196.5K
Domo
Domo@embossedly·
@thsottiaux yesss there's a lot of giga async stuff i do that has low priority that i will just check in the morning after running overnight
English
0
0
0
4
Tibo
Tibo@thsottiaux·
Should we bring batch compute to codex? Aka /slow mode
English
1K
54
4.4K
186.7K
Domo
Domo@embossedly·
@Chrisgpt unless web search is turned off, they haven't hallucinated since gpt-5 in august 2025. at worst they just have a slightly confused argument, not really a full out hallucination.
English
0
0
0
457
Chris
Chris@Chrisgpt·
the hallucinations seem to be getting more rare with OAI models, don’t know if they dedicate a lot of time to this as a research problem but it’s noticeably better. Sometimes it does, and it’s noticeable but rarely.
English
20
5
130
31.1K
Domo
Domo@embossedly·
@emollick maybe if they're capable of updating their worldview 99% of normies cannot change their mind and formed a permanent worldview upon contact with 3.5 turbo in 2023, 4o-mini in 2024, or gemini flash 2 on google search in 2025 and declared ai useless, forever.
English
0
0
0
103
Ethan Mollick
Ethan Mollick@emollick·
One of the easiest ways to show journalists and academics who have not used AI in awhile how far things have come is to throw something they know into Pro (or Extended Thinking) and ask for a fact check. Between the citations & actual depth of critique, it really changes minds.
English
18
14
181
12.3K
Ethan Mollick
Ethan Mollick@emollick·
GPT-5.5 Pro is a very solid fact checker. I can throw entire chapters at it and it will hunt down every key reference accurately. The only real annoyance is that it loves nuance, so returns a lot of “the general idea is right, but you are not taking into account tiny detail X”
English
83
52
1.1K
57.9K
Domo
Domo@embossedly·
@Chrisgpt the more surprising thing is that 3.5 can be used in the big may 2026
English
0
0
0
1.2K
Chris
Chris@Chrisgpt·
I got a little bored.. GPT 3.5 vs GPT 5.5 Pro extended "Explain why pizza tastes good in two sentences"
Chris tweet mediaChris tweet media
English
26
3
472
104.3K
Domo
Domo@embossedly·
@synthwavedd is 5.6 mid compared to mythos tho
English
0
0
1
235
Domo
Domo@embossedly·
@AravPhi @jarredsumner @rustaceans_rs the blog post has no tests to verify against and is built on human taste for other humans to read so actually yeah ai is still awhile from acing hard to verify stuff that humans care about
English
0
0
1
76
Arav
Arav@AravPhi·
@jarredsumner @rustaceans_rs cmon guys its defiantly harder to write a blog post about porting bun to rust than actually porting bun to rust
English
3
0
56
11.1K
Rust Bytes 🦀
Rust Bytes 🦀@rustaceans_rs·
what are you working on this weekend?
English
18
1
89
21.5K
Domo
Domo@embossedly·
@larsencc lol no backend is easy asf. any frontend dev can effortlessly vibe code a backend now because AI is literally strong at: controlled environment, easy to verify but not chaotic environment, hard to verify.
English
0
0
0
149
Domo
Domo@embossedly·
@benjamin_horne @theo @OpenAI you skipped over the fact that he's a massive prick to everyone & everything and is in general just a hateful, whining little cretin for no reason behavior which got normalized after elmo took over this platform
English
1
0
1
55
Ben Horne
Ben Horne@benjamin_horne·
The tragic thing about @theo is he's such a beta male theater kid spazz, that @OpenAI doesn't even *need* to pay him to get him to shill for them. By inviting Theo to their cute little dev events, and giving him early access to models, and having their handlers reach out to him to make him feel all warm and fuzzy and special, they get 100% of Theo's loyalty because he's such an easily manipulated loser. It's the equivalent of the dorky kid who always gets bullied by the popular kids, but then jumps at the chance to sit at their lunch table when he's finally offered the opportunity, and instantly becomes their loyal little puppy dog from that day on. If Anthropic had beaten OpenAI to the punch, and had given Theo all these little perks and ways of making him feel special, you can bet he'd be out here shilling for Anthropic today instead. It's insanely sad to watch, and so painfully clear what's going on. He thinks he's "in on it" rather than just being completely used and manipulated by their team. 🤣 Theo wants so badly for OAI engineers to notice him and accept him, because he is an uncool, neurotic, freakazoid YouTuber rather than a serious or legit engineer, and he just wants to feel "a part of it all" somehow, even though he has zero deep understanding of LLMs nor a background in AI research (again, the lunch table thing). He wants to keep getting invited to OpenAI events, get shipped OpenAI merch, keep getting internet likes and mentions from OpenAI's super duper cool *actual* engineers, etc. so he will continue to find every little opportunity he can to shill for OpenAI and/or bash Anthropic. He is such a loyal little OAI simp that he even tries to bash Anthropic's... web design (the color's are so bad! the art sucks!!). Hey Theo, could you be any more obvious? He pathetically calls himself a "CEO" in his bio, but his "company" is just a glorified way to toggle between using different models—wow, such a visionary, look out Silicon Valley! Theo is not a serious engineer, or person, or serious anything in the AI space. He is a fan.
English
4
0
6
1.1K
Domo
Domo@embossedly·
@levie you realize ai will be better than humans at judgment, taste, triage work etc in a few years max 😭 muh human judgment. it will fall just like math is falling *right now.* but i get why CEOs are so intent on deceiving the anxious cattle :-)
English
1
0
1
135
Aaron Levie
Aaron Levie@levie·
Here’s a key line in this mythos update. This is precisely an example of why engineers don’t go away, ever. We’ve made it far easier to create and find security issues, which means the new bottleneck is our ability to actually review, respond to, and fix the issues. Far from AI magically solving all of this, there still is major triage work and human judgment required to do the follow on work to actually protect systems. As a result, we’re about to enter a security engineer boom. Jevons paradox all over again.
Aaron Levie tweet media
Anthropic@AnthropicAI

Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.

English
70
62
574
119K
Domo
Domo@embossedly·
@Chrisgpt reminder mythos got 77.8% on this and openai then started ignoring this benchmark for 5.5 release lmfaoo
English
0
0
0
437
Chris
Chris@Chrisgpt·
The GPT 5 series progress over the past 10 months with SWE PRO (public) ! • GPT 5.1: 50.8% • GPT 5.2: 55.6% • GPT 5.3 Codex: 56.8% • GPT 5.4: 57.7% • GPT 5.5: 58.6% From 50.8% to 58.6% across the GPT 5 series is slow, steady, and very real progress toward coding agents that can already do useful ‘economically valuable’ work. Anthropic is going to look a bit more accelerated, however that model is not out yet
Chris tweet media
English
33
19
485
38.3K
Lisan al Gaib
Lisan al Gaib@scaling01·
The only other lab that could have a mythos class model right now is OpenAI and if they had a model as capable as Mythos why wouldn't they do a "PR campaign"? Why would they leave the stage to Anthropic? It's not an internal model when dozens of companies are using the model. You are just turbo coping
sdmat@sdmat123

@scaling01 Because it's an internal model. It's not relevant, you can't use it. The other labs have internal models too, they just don't do the PR campaign.

English
20
6
264
27.7K
Domo
Domo@embossedly·
@scaling01 mythos' aura is insane normies are so mad they can't access it cuz they aren't relevant or elite enough to be able to 🫵😂 instead they get oai's second-rate for-the-poors version
English
0
0
1
389
Peter Gostev
Peter Gostev@petergostev·
So average days between releases is 52 days, if we exclude the first long one, it is 40 days. So a couple of weeks at least is a reasonable bet. Could be a bit longer if it is a new pre-train and they need more time to adjust.
Peter Gostev tweet media
English
13
10
177
104.4K
Domo
Domo@embossedly·
@hd_nvim dino ahh devs-- as jarred said, this move is outside their overton window right now but it will be widely accepted in 6-12 months. so this move will look dumb in hindsight
English
8
1
43
9.2K
Herrington Darkholme
yt-dlp plans to drop support for Bun Reason: it is vibe-coded
Herrington Darkholme tweet media
English
106
191
4.8K
594.1K
Domo
Domo@embossedly·
@binarybits ed midtron will continue twisting himself into a self contradictory loop btw and he'll get more clownish as time goes on as reality completely disagrees with everything he says the downfall will be biblical
English
0
0
2
198
Timothy B. Lee
Timothy B. Lee@binarybits·
Ed Zitron is such a hack. The ellipses here replaced a paragraph that started "The company disclosed the figures to investors as part of an ongoing funding round." It's dishonest to cut that paragraph and then act like the timing is suspiciously coincidental.
Timothy B. Lee tweet media
English
10
21
467
27.7K
Domo
Domo@embossedly·
@scaling01 some random guy (yes he's random) declaring AGI doesn't make it so btw a committee agreeing will make it feel real and authoritative
English
0
0
1
527
Domo
Domo@embossedly·
@opinonhaver why are they proudly speedrunning the citrini scenario though like this isn't gonna end well in 2 years when every company only thinks of themselves and not broader societal impacts of the majority of white collar workers not having a job anymore
Domo tweet media
English
1
0
0
192
William B. Fuckley
William B. Fuckley@opinonhaver·
Businesses aren’t charities and there’s nothing inherently evil about having to let people go, stuff happens. But there should be more way stigma around doing this kind of public “Here’s why I made the brave, possibly heroic, decision to fire people” shit. Bad form.
Zeb Evans@DJ_CURFEW

Today we reduced headcount by 22%. The business is the strongest it's ever been. So I think it's important to be direct about what I'm seeing and why. First, I made this decision and I own it. I did it because the way to operate at the highest level of productivity is changing, and to win the future, ClickUp needs to change with it. Second, this wasn't about cutting costs. Most savings from this change will flow directly back into the people who stay. We'll be introducing million-dollar salary bands. If you create outsized impact using AI, you'll be paid outside of traditional bands. Most importantly, I have the deepest gratitude for those affected. We're doing this from a position of strength specifically so we can take care of people properly. Everyone affected receives a package aimed at honoring their contributions and easing the transition. I only see two options: wait for this to play out gradually in the market or be honest about what I'm seeing and act proactively. THE 100X ORGANIZATION The primary change is that we're restructuring around what I call 100x org. The goal is 100x output. The roles required to build at the highest level are fundamentally different than they were a year ago. Incremental improvements to existing systems won't get us there. We need new ones. That means creating enough disruption to rebuild rather than iterate on what's already broken. The common narrative is that AI makes everyone more productive. It doesn't. Many of the workflows of today, if left unchanged, create bottlenecks in AI systems. These roles will evolve. But waiting for that to happen naturally means falling behind now. The 100x org is actually heavily dependent on people - infinitely more than today. This is only possible with 10x people that have embraced and adopted new ways of working. THE BUILDERS, AGENT MANAGERS, AND FRONT-LINERS — THE BUILDERS: 10X ENGINEERS I don't think most companies have internalized what's actually happening with AI in engineering. The common narrative is that AI makes all engineers more productive. That may be true in isolation, but at an organization level - that is the farthest thing from reality. Here's what we've validated recently at ClickUp: the great engineers, the ones who can orchestrate, architect, and review, are becoming 100x engineers. They're not writing code. They're directing agents that write code. The skill is judgment. AI makes the best engineers wildly more productive, and everyone else using AI slows these engineers down. Think about it - the bottlenecks are (1) orchestration - telling AI what to do, and (2) reviewing - what AI did. Everything is leapfrogged and no longer needed. So who do you want orchestrating and reviewing code? And how do you want your best engineers to spend their time? If your best engineers are spending time reviewing other people's code, then this is inherently an inefficient bottleneck. These engineers can review their agent's code much faster than reviewing human code. The new world is about enabling your 10x engineers to become 100x. The wrong strategy is to push every engineer to use infinite tokens. Companies doing this are celebrating 500% more pull requests. But customer outcomes don't match the volume of code being generated. I call this the great reckoning of AI coding, and every company will face this soon if not already. More code is just another bottleneck to the best engineers, and ultimately to your company's impact as well. — THE BUILDERS: 10X PRODUCT MANAGERS Product management and design roles are merging. Designers that have customer focus, become more like product managers. And product managers that have intuition for UX become more like designers. The bottleneck of user research is gone. It takes us just one mention of an agent to kickoff research and analyze results. The bottleneck of product <> design iteration is also gone. The product builder iterates on their own, along with agents and skills that ensure alignment with quality and strategy. Also controversial today - I believe that the wrong strategy is to have your PMs shipping code - that just introduces another bottleneck that the best engineers will waste their time on. To be clear, PMs should be coding but they should do this in a playground to iterate, validate, and scope. That code should not go to production. Everything outside of managing systems, orchestrating AI, and reviewing output becomes a bottleneck. That's why the other roles that are critical along with these are the systems managers (to reduce bottlenecks) along with a bottleneck you can't replace - customer meeting time. — THE SYSTEM MANAGERS Ironically, the people that automate their jobs with AI will always have a job. They become owners of the AI systems - agent managers. We have many examples of these people at ClickUp. The underlying systems in which we operate are absolutely critical to get right. I think most companies are delusional to think they can iterate on existing systems and compete in this new world. You must create enough disruption so that old systems are deprecated entirely. If there's any definition for 'AI native' that's what it is. — THE FRONT-LINERS In a world that will become saturated with AI communication, the human touch will matter more than anything to customers. This is a bottleneck that you shouldn't replace - even when agents are high enough quality to do video meetings. One-on-one meeting time with customers is something that shouldn't be automated. The systems around the meetings should be - so that front-liners spend nearly 100% of their time with customers. REWARDING 100X IMPACT In a world where companies are able to do so much more with less, where does that excess money go? In our case, much of the savings in this new operating model will flow directly back to those that enabled it. We must reward people that create productivity accordingly. This aligns incentives on both sides. Plus, in a world where your best people create 100x impact, you can't afford to lose them. You should aim to retain these employees for decades. The context they have and their ability to efficiently orchestrate and review will be nearly impossible to replace. Compensation bands of today should be thrown out the door. We're introducing $1 million cash/year salary bands with a path available to nearly everyone in the company if they produce 100x impact by creating or managing AI systems. THE FUTURE Nearly every company will make changes like these. The ones that do it proactively will define what comes next. The future is not fewer people. It's different work, new roles, and better rewards for those who embrace it. We're already seeing entirely new roles emerge, like Agent Managers, that didn't exist a year ago. ClickUp is positioning to lead this shift, not just internally, but for our customers too. I've never been more certain about where we're headed.

English
8
34
791
33.5K