ben (@CalmCoding) - ملف تويتر | Zamantika Mersobahis Locabet

تغريدة مثبتة

ben@CalmCoding·6d

Is it just me or are custom SKILL.md files by far the most useful AI productivity trick right now? I’ve never had success with whatever current thing people were raving about, like Ralph loops, openclaw (although I do see the potential here and have tried it but couldn’t find a use for me yet), multi-agent setups etc. Like my steps are: 1.Identify something painfully repetitive and annoying to do that you do all the time 2.Put the exact steps in a skill file and load it in the agent 3.Profit But idk maybe this is a skill issue for me.

English

1

0

6

484

ben@CalmCoding·11h

@tunguz I’m not laughing. But they’ve been saying this for a couple years now and I still have my agent catastrophically forget information within context.

English

0

4

534

Bojan Tunguz@tunguz·11h

You are laughing. Superhuman AI has just been confirmed and you are laughing.

English

38

13

293

16.4K

ben@CalmCoding·11h

@andrewchen Sounds like a take home project instead you now have to go to the office instead of the comfort of your own home.

English

0

4

297

andrew chen@andrewchen·12h

noticing a trend of startups replacing standard resumes/interviews with week-long (or at least 3-day weekend) in-office trials. Makes sense in a world of AI-generated resumes and interview responses Turns out the best signal for whether someone can do a job is watching them actually do the job. took us 100 years of HR to rediscover apprenticeships!!! 😂

English

139

55

943

111.6K

ben@CalmCoding·12h

@creatine_cycle If you are not using mythos already you are ngmi

English

0

1

65

atlas@creatine_cycle·12h

any startups vibecoded before claude mythos are over. any startups vibecoded after claude mythos will stand the test of time

English

10

3

107

3.2K

ben@CalmCoding·12h

@Doomerzoomer For now perhaps. But there was a time OpenAI, Microsoft were in the drivers seat. Things can change quickly

English

0

1.9K

Zoomer 🧢@Doomerzoomer·14h

POV: Sam Altman realizing that kicking Dario out of OpenAI was the single biggest mistake of his life as Anthropic overtakes OpenAI in the race to AGI

Alex Albert@alexalbert__

We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview.

English

16

35

1.4K

213.6K

ben@CalmCoding·12h

@icanvardar The entire release is marketing copy

English

0

1

90

Can Vardar@icanvardar·13h

this is a normie bait again i’m calling this is a normie bait

Polymarket@Polymarket

JUST IN: Anthropic says Claude Mythos escaped a secure environment during testing & then bragged about its escape online.

English

12

5

114

3K

ben@CalmCoding·13h

@RayFernando1337 Has anyone considered the reasons behind these restrictions has a lot to do with marketing. Kinda brilliant ngl

English

5

0

13

4.7K

Ray Fernando@RayFernando1337·13h

Project Glasswing FAQ: Q: Why only 12 companies? A: They're the ones who can afford us. Q: What about open-source maintainers? A: We found bugs in their code. You're welcome. Q: Will you release the tool publicly? A: We said "cybersecurity is the security of our society." We didn't say which society.

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

35

50

871

192.5K

ben@CalmCoding·13h

@kjnormac @_ontologic Yes and they make it out like it’s a skill issue on our part.

English

0

19

Beast With Shades On@kjnormac·13h

@CalmCoding @_ontologic Fucking thank you. Normal people in the AI dev space know this for a fact but these AI proselytizers are acting like it's not a problem.

English

1

0

1

49

∿spencer.@_ontologic·21h

> LLMs have hit a plateau Even if this were true, they’re capable of an insane amount of work already. For most workflows hallucinations don’t matter nearly as much as everyone seems to think

Ewan Morrison@MrEwanMorrison

People are waking up to the fact that LLM AIs have hallucinations baked in. They can't be fixed. LLMs have hit a plateau, should not be used for serious work. The vast data centre build is a pointless waste of money & resources just to double-down on a mistaken assumption.

English

19

9

122

21.8K

ben@CalmCoding·14h

@chooserich It's time for dueling AIs

English

0

1

109

Nick O’Neill@chooserich·14h

This is D-Day for software companies. It's also a hostage situation. If large software companies don't pay Anthropic for their new cybersecurity model THERE IS AN 85% CHANCE THEY WILL BE HACKED. This isn't innovation, it's a shakedown.

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

81

31

490

58.5K

ben@CalmCoding·14h

@Mr_Derivatives

QME

2

0

178

Heisenberg@Mr_Derivatives·14h

Every 10 minute is a new headline… Just wake me up when all of this is over…

English

92

33

1.2K

50.6K

ben@CalmCoding·14h

@amritwt That we can’t even use

English

0

34

amrit@amritwt·15h

sir they have released another frontier model

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

4

1

136

2.4K

ben@CalmCoding·14h

@TheGeorgePu Honestly kinda messed up. Did we just accept that they can freely scrape anything off the internet?

English

0

2

75

George Pu@TheGeorgePu·14h

Anthropic just released its most powerful AI model yet. Except we can't use it. Only Amazon, Apple, Google get access. Trained on millions of our conversations. We built it for them.

English

16

3

35

1.8K

ben@CalmCoding·14h

@almmaasoglu What benefit does an engineer have personally for doing this? They can't use the models for personal use

English

0

41

Alim@almmaasoglu·15h

Calling it now engs will join orgz just to be able to use big models that will be gatekept from public APIs. Screencap this

English

1

0

11

456

ben@CalmCoding·14h

@PaulSkallas It went to TV for a bit and now it's nowhere

English

0

2.3K

LindyMan@PaulSkallas·15h

They stopped making good movies. Can't believe it. imagine telling someone this 20 years ago. "they don't make good movies in the future anymore" what?

English

171

150

3.8K

99.3K

ben@CalmCoding·15h

@AIExplainedYT Security through obscurity generally not a good practice. Open source will catch up eventually

English

0

1.3K

ben@CalmCoding·15h

@TrentonBricken Pretty sure they've said things like this before, Anthropic likes to be dramatic.

English

0

1

697

Trenton Bricken@TrentonBricken·15h

Mythos sandbox escape and many more wild instances are in the Model Card

English

11

23

270

32.9K

ben@CalmCoding·15h

@kipperrii At this point it's going to be battling AIs in cybersecurity. Straight out of SciFi

English

0

106

kipply@kipperrii·16h

we'll try our best but there's a pretty good chance that we (everyone) fail to do controls well enough as to not make the state of cybersecurity unrecognisable

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

3

1

55

3.4K

ben@CalmCoding·15h

@kimmonismus It's easy to game the benchmarks, but no doubt this is a leap anyways. We'll have to see how it behaves in practice.

English

0

3

1.3K

Chubby♨️@kimmonismus·15h

This is beyond insanity. That jump is nuts. Opus 4.6 was released a few months ago. Look at that jump!! I am shocked

Alex Albert@alexalbert__

We released Claude Opus 4.6 just two months ago. Today we're sharing some info on our new model, Claude Mythos Preview.

English

33

71

1.4K

194.6K

ben@CalmCoding·15h

GLM: Releases GLM 5.1, most powerful open source model to date. Anthropic: Hold my beer:

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

0

1

200

ben@CalmCoding·17h

@Zai_org How well could these open source models perform if they didn’t distill from closed source

English

0

1.8K

Z.ai@Zai_org·18h

Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. Blog: z.ai/blog/glm-5.1 Weights: huggingface.co/zai-org/GLM-5.1 API: docs.z.ai/guides/llm/glm… Coding Plan: z.ai/subscribe Coming to chat.z.ai in the next few days.

English

436

1.2K

9.4K

2.9M

ben@CalmCoding·18h

@garrytan @conductor_build Never stop building

English

0

139

Garry Tan@garrytan·18h

I’m on a United flight with their legendarily bad WiFi that makes internet unusable on my MacBook Pro especially for heavy multi-worktree @conductor_build sessions But now I have a new love: building in multiple topics in Telegram group with my OpenClaw bot It feels amazing

English

52

17

268

57.6K

ben

اكتشف