
RishDog
4.2K posts


@PlatinumKey13 Maybe we can actually have DK answer some tough questions for managing two shit games in a row instead of giving him softball questions.
English

@StevenCheah This is all very new for pirates fan Steven. There is room to jump on the wagon and enjoy the battling Bucs all season
English

@Nati_Sports We will finish last in the central
Rotation is dog shit we have no hope
English

@aztecs858 He needs to throw a 3rd pitch more frequently or this will happen a lot more.
English

@Nati_Sports Why is McLain chasing out of the zone like that when the guy just walked 3 people in a row...
English

@__Murphy88 I really like Duce Gourson as a prospect. Feel he can be solid.
English

It’s been the swing decisions that has stood out to me.
He’s taken a whole other step forward this year.
The power has always been legit. But he’s rounding out as a hitter
Super impressive
Jim Rosati 🏴☠️@northsiden0tch
Esmerlyn Valdez just homered again for Indianapolis, his 3rd on the year. Pirates aggressively pushed him to Triple-A and he’s picked up right where he left off in the Arizona Fall League. Just absolutely destroying baseballs.
English

i don't how to say this, but i was wrong about mythos.
software engineering won't disappear overnight, but things are about to change a lot.
ben (is hiring engineers)@benhylak
every engineer at anthropic has been using mythos for ~1.5 months. meanwhile, their uptime is horrendous, claude code still has rendering bugs, etc. one could conclude that it won't be the end of software engineering.
English

@ChaseBrowe32432 @mav3ri3k We're the questions in SWE Bench that would take a model beyond 80% pretty flawed?
English

@mav3ri3k oh yeah all I meant was people thought of it as "saturated" (at low 80s) and a model still jumped this far up. mythos is an absolute monster by the look of it
English

@NinaDSchick SWE Bench is one of the worst most flawed benchmarks at this point.
English

Claude Mythos.
Ten trillion parameters: the first model in this weight class. Estimated training cost: ten billion dollars.
On the hardest coding test in the industry (SWE bench) it scores 94%.
It found a security flaw in a system that had been running for 27 years, one that every human engineer and every automated check had missed. It found another bug that had survived five million test runs over 16 years. (It did so overnight.)
It is so capable in cybersecurity that Anthropic will not release it to the public, instead it is launching Project Glasswing along with 100m in compute credits to help secure software.
Only twelve partners currently have access: Amazon, Cisco, Apple, Google, Microsoft, NVIDIA, JPMorgan Chase, Crowdstrike, Palo Alto, AWS, The Linux Foundation, Broadcom. (I'm sure the Pentagon is on the line?)
This is not a product launch: it is a controlled deployment of a system too powerful to distribute freely.
Tell me this isn't (very expensive) AGI?
Anthropic@AnthropicAI
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
English

#NetsWorld Do the Wizards have 2 rookies (Tre Johnson & Will Riley) better than all 5 of our first round picks? #ForTheDistrict
English

@JaredLankes @donboscomd Most hits on the team, second in avg, 5th on OPS, 3rd in WAR. The pirates are a better team with his bat in the lineup. His average or below average defense still make him a net positive. If your looking for problems on the team so far, look at ozuna and horwitz, not Nicky G.
English

@JomboyMedia Manager looks so soft when he's towered over by the umpire.
English

@hunnargenderson @BradishMuse Get absolutely fucked and I hope you have a terrible night.
English

@Smooth_Runnings @BradishMuse You tweet about ai shut the fuck up please 😹
English



















