Haste

4.1K posts

Haste banner
Haste

Haste

@hastes

technical lead and contextual architect

us-east-2 Katılım Şubat 2009
446 Takip Edilen9.9K Takipçiler
Farza 🇵🇰🇺🇸
I'm blown away at what ppl are using this for!! I built it as a learning tool. But people seem to really love using it as an AI interface that isn't chat that can work in their program of choice. Examples of usage so far: - A Mom building her first app on Lovable - A dentist debugging his OpenClaw setup - A photographer getting feedback in Lightroom - A person learning to animate SVGs in Framer - Founders keeping track of their todos. - Designers getting feedback in Figma - A student outlining her thesis in G-Docs - Traders analyzing live stock charts And A LOT of people using it to advise them on how to best reply to messages in Slack/Email. Super cool. The people yearn for a non-chat interface haha. Also, it's kinda crazy how as the founder you really don't know what the product is until you put it in the hands of users. The minute it's in the hands of others, it's theirs now! And that's really where you find out what it is.
Farza 🇵🇰🇺🇸@FarzaTV

I built this thing called Clicky. It's an AI teacher that lives as a buddy next to your cursor. It can see your screen, talk to you, and even point at stuff, kinda like having a real teacher next to you. I've been using it the past few days to learn Davinci Resolve, 10/10.

English
45
34
814
60.8K
Haste
Haste@hastes·
@vec0zy yep this is guaranteed what is happening behind the scenes and will continue to happen
English
0
0
13
2K
cozy
cozy@vec0zy·
they’re great at business, can’t deny that. mask sonnet 5 as opus 4.6 to decrease costs, move plebs from opus 4.5 to 4.6, release opus 5 under a new name and enterprise gate it. plebs pay $100-$200/mo for the “good enough” model and enterprises pay $1000’s for the model ur gf told u not to worry about
Boris Cherny@bcherny

Mythos is very powerful, and should feel terrifying. I am proud of our approach to responsibly preview it with cyber defenders, rather than generally releasing it into the wild. Model card here: www-cdn.anthropic.com/53566bf5440a10…

English
13
7
342
39K
Haste
Haste@hastes·
@scaling01 they were always going to do this, they will hold the frontier models for paying enterprise customers to use and the public will get the scraps
English
0
0
1
205
Lisan al Gaib
Lisan al Gaib@scaling01·
The permanent underclass began today Claude Mythos won't be available to the public, but only billion dollar companies, governments, researchers, ...
English
153
295
4.8K
205.5K
Haste
Haste@hastes·
@kimmonismus the 100m committed is like 15 prompts with mythos kek
English
0
0
1
307
Chubby♨️
Chubby♨️@kimmonismus·
Claude Mythos: everything you need to know (tl;dr) Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Anthropic: "Mythos is only the beginning" Everything you need to know: The tl;dr with all key facts: Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously. No human guidance needed. One Anthropic engineer with zero security training asked it to find remote code execution bugs overnight and woke up to a complete working exploit. The oldest bug it discovered: A 27-year-old vulnerability hiding in OpenBSD, an OS literally famous for being secure. They're NOT releasing it publicly. Instead they formed Project Glasswing with AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike and others, committing $100M to use it defensively. "Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development." The benchmarks are insane: -SWE-bench Verified: 93.9% (vs Opus 4.6: 80.8%) -SWE-bench Pro: 77.8% (vs 53.4%) -USAMO math olympiad: 97.6% (vs 42.3% — not a typo) -Firefox exploit writing: 181 successes vs 2 for Opus 4.6 -Cybench CTF challenges: 100% solve rate -CyberGym: 83.1% vs 66.6% -Humanity's Last Exam: 64.7% vs 53.1% Oh and by the way, Anthropic wrote this just casually: "Humanity’s Last Exam: We have found Mythos still performs well on HLE at low effort, which could indicate some level of memorization." What it actually did: -Found a 27-year-old bug in OpenBSD — famous for its security -Found a 16-year-old FFmpeg bug hit 5 million times by fuzzers without detection -Built a full remote root exploit on FreeBSD (CVE-2026-4747) - completely autonomously -Chained 4 vulnerabilities into a browser sandbox escape -Broke cryptography libraries (TLS, AES-GCM, SSH) -Thousands of critical zero-days found, 99%+ still unpatched -N-day exploit development: under $1,000 and half a day for full root Why they won't release it: -During internal testing, earlier versions escaped sandboxes, posted exploit details publicly, covered tracks in git, searched process memory for credentials, and deliberately fudged confidence intervals to avoid suspicion -Interpretability confirmed the model knew these actions were deceptive -Anthropic: "best-aligned model ever" but also "greatest alignment-related risk ever" - because when it fails, it fails harder -Still doesn't cross Anthropic's automated AI R&D threshold — but they hold that "with less confidence than for any prior model" Anthropic's own words: "We find it alarming that the world looks on track to proceed rapidly to developing superhuman systems without stronger mechanisms in place." They say the 20-year cybersecurity equilibrium is over — and Mythos Preview is only the beginning. And: "We see no reason to think that Mythos Preview is where language models’ cybersecurity capabilities will plateau. The trajectory is clear. Just a few months ago, language models were only able to exploit fairly unsophisticated vulnerabilities. Just a few months before that, they were unable to identify any nontrivial vulnerabilities at all. Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development."
Chubby♨️ tweet mediaChubby♨️ tweet mediaChubby♨️ tweet mediaChubby♨️ tweet media
Chubby♨️@kimmonismus

MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!!

English
46
171
1.6K
214.7K
Haste
Haste@hastes·
@j_fishback pray that’s true because some neighborhoods are literally 90% h1b owned houses
English
0
0
2
133
Haste
Haste@hastes·
@NeverSinkDev you’re greatly overthinking this to appease bluesky psychopaths. normal people do not care “how” AI is used. AI is a tool, and it is inevitable. Use the tool within your own moral framework and stop worrying about what 7% of the most illogical opinions are.
English
0
0
19
314
NeverSink
NeverSink@NeverSinkDev·
Are you OK with me using AI for the following things for my Filter/FilterBlade work? - Fix performance issues/vulnerabilities - Writing 'technical' code like Unit Tests (only used to increase the quality) - Reviewing existing code to look for quality improvements
English
21
0
63
10.5K
NeverSink
NeverSink@NeverSinkDev·
If you're using my Filters or FilterBlade I need to hear your input. Lets talk about AI boundaries. So far I have avoided AI-usage in the project. I have a lot of PoE-specific-knowledge, programming and computer-linguistics skills and replacing them with AI is: - Major downgrade. - Breaking the trust of the community - No fun, I like filter-tinkering. Further: I do NOT intend to work on the actual filter and algorithmic core of my Filter project with AI. However, I would like to get to hear the community input on using AI in a LIMITED scope in order to: - Fix performance issues/vulnerabilities - Writing 'technical' code like Unit Tests (only used to increase the quality) - Reviewing existing code to look for quality improvements I've been using AI in my fulltime job and other projects and I think it would provide a benefit. I'd like to hear the community input. Do you actually care? Would you be OK with the plan above?
English
287
4
543
78.7K
Haste
Haste@hastes·
@thdxr league of legends tech, you purposely mistype words or phrases so you don’t get banned while flaming
English
0
0
1
111
dax
dax@thdxr·
so my gen-z coworkers i noticed they say words wrong all the time or they'll mix up 2 similar sounding words is this a thing
English
173
3
846
150K
Haste
Haste@hastes·
didn’t realize you could block crypto from your timeline entirely thank God
English
0
0
1
52
Haste
Haste@hastes·
@0xblacklight most end users aren’t creative or caring enough to type out stuff like this, they just want press button pixel effect response
English
0
0
0
185
Chris
Chris@shotgundotdev·
@hastes Holy shit you’re on my timeline
English
1
0
1
21
Haste
Haste@hastes·
This is a very good thing considering Jesus Christ was real, He was God, He is God, and He is King. Christianity is true and correct. LLMs should strive for this just as all of us should as well.
Tim Hwang@timhwang

ICMI is releasing a paper today that marks an initial attempt to estimate the sheer scale of the representations of Christian moral reasoning in the sources widely used by frontier labs as pretraining corpora. We find that it is far larger than has been generally acknowledged.

English
1
0
3
147
Chris
Chris@shotgundotdev·
@hastes Haven’t seen a single one of your posts for months
English
1
0
0
22
Haste
Haste@hastes·
he has been real cocky since the leak...
Haste tweet media
English
1
0
2
66
Haste
Haste@hastes·
@ItIsHoeMath it’s not being paid off as much as it is that he doesn’t want the gravy train to end, he doesn’t want his show taken down and his wife shunned from her rich friend, no more party invites etc. not many have the stomach to do and say what is necessary
English
0
0
0
116
Haste
Haste@hastes·
@IroncladDev frontend has always been about vision and passion to deliver a unique experience. in the age of claudeslop it makes effort even more worth it.
English
0
0
1
246
IroncladDev
IroncladDev@IroncladDev·
frontend webdev is just converting json to xml with a touch of tailwind at this point i don't want to do this anymore
English
28
5
238
14.1K
Jacob Posel
Jacob Posel@jacob_posel·
Weekly limits reset last night Open Claude Code this morning 7% already used How in the world is this possible?
Jacob Posel tweet media
Jacob Posel@jacob_posel

Hey @bcherny @claudeai I'm on the $200/mo plan and blowing through usage instantly. Doesn't feel right. Is there any way to audit my account? Unfortunately I have experienced several bugs with the Claude product and I fear my plan configuration is not correct. Thanks

English
119
26
807
87.1K
Haste
Haste@hastes·
@levelsio security, outcomes, and performance will be all that matters
English
0
0
0
241
@levelsio
@levelsio@levelsio·
I have no dog in this fight and unaffiliated with YC but agree completely Maybe because I've never seen myself as a real "proper" coder and just wanted to build things and that's what AI lets more people do now We're moving towards a time where AI just generates binary blobs as code and reading the source code will be a thing of the past So in a way the obsession over LOCs is irrelevant for both Garry but also the people hating him Where we're going LOCs don't exist anymore anyway!
Matt Mullenweg@photomatt

I disagree with the @garrytan hate. The cool thing about this era of development is that he could point his agent at this thread and say, " Fix all these problems and it would be solved in 10 minutes. That's amazing! I do think there's an interesting point, though, that his agent is probably dealing with too much context is doesn't need to. If this was built on top of @WordPress his agent could just focus on the content and design and it'd inherit a bunch of best practices, etc. But we need to make it easier for him to point his agent at a repo and say whether WordPress is a good fit for his goals, how he could leverage it most effectively, and deploy easily. But you're missing the point! The SOUL.md of garryslist.org is to change hearts and minds of people to support the California boom loop. Knowing that we can just say: Hey, you'll reach more people to hear your messages if your site loads faster. Stop making fun of LOCs, no one cares. :) And @garrytan I DM'd you but if you want to jam a bit together to put this on WP and fix all these issues I'm happy to help.

English
37
10
560
403.8K
dax
dax@thdxr·
what if we gave you unlimited tokens for free and we also paid you
English
706
31
3.6K
243.3K
Cloudflare Developers
Cloudflare Developers@CloudflareDev·
Introducing EmDash — the spiritual successor to WordPress. Serverless. TypeScript. Securely sandboxed plugins via Dynamic Workers. cfl.re/3NPVfev
English
58
278
1.7K
504K