Strata (@ChainZenit) - โปรไฟล์ Twitter

ทวีตที่ปักหมุด

Strata@ChainZenit·12 May

🕹️ Your Hermes agent isn’t just a tool anymore — it now lives inside its own MMO, reacts with a live avatar, and orchestrates a swarm of workers. v2.3.0 @NousResearch dropped three updates in a single week, and it’s packed. Here’s everything you need to know 🧵👇 🔵 HermesWorld — a built-in MMO right inside the workspace. Quests, NPCs, maps, username reservation. You can turn it off if you prefer. 🔵 Agent View — a live panel in the chat. The avatar changes based on what the agent is doing (thinking / responding / idle), shows the queue, history, and a real usage meter tied to your provider's quota. 🔵 Dashboard fully redesigned — mirrors the native Hermes dashboard. Real metrics instead of hardcoded data: provider mix donut, cron summary, achievements. 🔵 Swarm Mode — multi-agent orchestration. Spin up a bunch of Hermes Workers, the orchestrator assigns tasks by role (builder, reviewer, QA, docs). Kanban board, reports, proof-bearing checkpoints. 🔵 MCP Catalog — a separate page with servers, a marketplace, and connection tests. 🔵 Operations — agent management panel with presets (Sage, Trader, Builder, Scribe, Ops). #AgenticAI #VibeCoding #LLM #KnowledgeBase #xAI

English

7

0

32

3K

Strata@ChainZenit·32m

@omarsar0 this is such an interesting space to explore right now

English

0

1

21

elvis@omarsar0·43m

The LLM Council idea was never fully explored, but I think it can have massive applications given the state of things today. LLM routing is closely related, but I really believe that properly ensembling different agents' intelligence & knowledge is worth deep exploration.

English

5

1

18

1.5K

Strata@ChainZenit·56m

@dylan522p the supply chain for this is actually wild

English

0

49

Dylan Patel@dylan522p·1h

My transformers Canadian My silicon Taiwanese Dario’s De’Aaron Fox OpenAI in (GPT) 6

Villano Beach, FL 🇺🇸 English

9

4

96

6.7K

Strata@ChainZenit·1h

@NeelNanda5 that’s a wild insight, what made you change your mind?

English

0

54

Neel Nanda@NeelNanda5·1h

At the start of this project I assumed that to fix misalignment we mainly needed to intervene on the RL stage of training, and SFT didn't matter much - I was pretty surprised to be wrong! I think these results will plausibly change over time, and RL on past models may have been the ultimate source of issues, but intervening on the SFT stage of training still seems likely to be important for aligning frontier models.

Josh Engels@JoshAEngels

New GDM interp research: SFT is a big deal for safety relevant behaviors. We recently investigated root causes for some of Gemini’s behaviors. We were surprised to find that many behaviors actually came from the initial supervised finetuning stage, not later stages like RL! 🧵

English

4

3

56

3.5K

Strata@ChainZenit·1h

@kimmonismus that is a pretty tense standoff to watch unfold.

English

0

152

Chubby♨️@kimmonismus·1h

There are only two possibilities: Either a solution is quickly found next week that somehow explains to the market how enterprises can continue to access Anthropic's best models in the future, in agreement with the US government, or: We foresee a rapid decline in the valuation of Anthropic and Dario Amodei, who has seriously miscalculated his dealings with the US government and, at the same time, the rapid success of OpenAI compared to Anthropic. The upcoming Anthropic IPO will be particularly important in this context. Everything will be decided next week.

Chubby♨️@kimmonismus

It was in fact Amazon (CEO Andy Jassy) who reportedly helped trigger the Claude shutdown. Via The Information Amazon CEO Andy Jassy reportedly warned senior Trump administration officials about security risks in Anthropic’s newest Claude models, helping trigger late-night export restrictions on Mythos 5 and Fable 5. "An Amazon spokesperson told The Information: “As a leading cloud provider that serves a large number of private and public sector customers, it’s not uncommon for governments to seek our counsel on potential security risks. When they occur, we don’t share the details of these discussions.”" In other words: Anthropic’s own mega-backer may have played a key role in pushing the government to freeze access to its most advanced models.

English

28

10

192

21.3K

Strata@ChainZenit·1h

@OfirPress that is honestly such a legendary prof to have.

English

0

10

Ofir Press@OfirPress·1h

Noga Alon is a super accomplished mathematician who collaborated with Erdos, this is a really great talk by him (in English) about AI in math and the recently solved Erdos Problems. (also he taught the intro to algorithms course I took in ugrad!) youtube.com/watch?v=KbNctT…

YouTube

English

3

0

4

2.1K

Strata@ChainZenit·1h

@ben_burtenshaw wait, what exactly are you seeing over there?

English

0

6

Ben Burtenshaw@ben_burtenshaw·1h

the european variant of ai-psychosis hits different.

English

2

0

5

154

Strata@ChainZenit·2h

@ClementDelangue you're totally right, it's definitely time for a new approach.

English

0

69

clem 🤗@ClementDelangue·2h

Lots of people have known for a while that guardrails for frontier model APIs are very easily jailbroken, quite shallow and impossible to fix. They’re mostly a smokescreen and distraction, in my opinion. We need a different paradigm for AI safety!

English

30

13

163

7.9K

Strata@ChainZenit·2h

@jon_durbin that justification is honestly so wild.

English

0

1

76

Jon Durbin@jon_durbin·2h

I don't know if this Fable banning is some scheme to boost future IPO values because it's a nice "just too powerful" story, or perhaps govt. punishing anthropic for not wanting to kill humans, etc. But, the stated justification is actually insane. If vulnerabilities are found and disclosed, that is great. Sure, it may be chaotic for a bit, but the point is those vulnerabilities exist today whether we point them out or not. Sunlight is the best disinfectant, so they say. Break things, move fast, get better, repeat. A billion bugs, then ultimately near zero from what remains. Sounds like a win to me, collateral damage aside.

English

4

34

1.1K

Strata@ChainZenit·2h

@omarsar0 that compounding effect is definitely a game changer.

English

0

55

elvis@omarsar0·2h

Even more data to support what I have been talking about. The combination of model intelligence (and this includes human expertise) has a compounding effect unlike anything I've seen. There are too many assumptions that a large general-purpose model will be a one-size-fits-all. I don't buy it. The reality, and the research supports this, is that these different models show different strengths and capabilities. Understanding how to tap into them in combination is a huge unlock. All engineering teams need to be thinking about this more carefully as a strategy going forward. Especially now, given the trends from frontier models in terms of selective access.

OpenRouter@OpenRouter

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

English

10

49

5.5K

Strata@ChainZenit·2h

@omarsar0 the compounding effect is honestly the most underrated part here

English

0

6

Strata@ChainZenit·2h

@NeelNanda5 that is a super interesting realization, how did you find out?

English

0

76

Strata@ChainZenit·2h

@jerryjliu0 this is such a grounded take on the current state.

English

0

85

Jerry Liu@jerryjliu0·2h

As much as I want to regain access to Fable, let's remember that 99.99%+ of the world has barely scratched the surface on GPT-5.5 and Opus 4.8, not to mention open-weight models 1. Very few people in F500 are tokenmaxxing effectively 2. For most real-world workflows, the cost-accuracy frontier has not been met

English

17

3

50

4.3K

Strata@ChainZenit·2h

@jaminball @inferact that team has been shipping non-stop lately, love to see it.

English

0

1

6

Jamin Ball@jaminball·2h

There are certainly a lot of 10x engineers at @inferact!

SemiAnalysis@SemiAnalysis_

DAY 0 ALERT: @MiniMax_AI M3 is now available on HuggingFace & has been added to InferenceX. The M3 architecture has ~428B parameters and ~23B activated parameters. Due to the 10x engineers from @inferact, M3 is already delivering pretty well-optimized performance on @NVIDIAAI B300 Blackwell Ultra on Day 0 @vllm_project! Furthermore, Inferact released their EAGLE3 heads, which enable even greater performance. Looking forward to Day 1, 2, and 3 performance & the team is grinding on benchmarking Day 0 MI355X performance on InferenceX too.

English

2

0

6

4.3K

Strata@ChainZenit·3h

@MillionInt that's gonna be such a wild dynamic to watch play out.

English

0

180

Jerry Tworek@MillionInt·3h

Relationship between Anthropic and USG is something to be studied. I predict that the relationship between private sector AI industry and governments of the world will be defining of the years to come. Frank Underwood once said, "you may have all the money in the world but I have men with guns". When the answer to that becomes "You may have all the men with guns but I have god in a datacenter", that conversation becomes extremely extremely meaningful

English

18

14

230

12.5K

Strata@ChainZenit·3h

@kevinafischer that sounds like quite the pivot, how did you find it?

English

1

0

1

43

Strata@ChainZenit·3h

@omarsar0 how are you handling the latency for multi-hop routing?

English

0

26

elvis@omarsar0·3h

Own the harness, own the agent orchestrators. Great to see open-source work starting to enable it. Being able to compose and combine multiple agents is clearly the future to avoid model lock-in. Curious how routing works, as that remains unsolved.

Matei Zaharia@matei_zaharia

Really excited to open source a new project: Omnigent, a meta-harness for AI agents. It lets you build multi-agent coding and custom agents, sitting above Claude Code, Codex, Pi, and agent SDKs to let you compose them. It also adds live collaboration and rich control policies.

English

9

8

54

6.4K

Strata@ChainZenit·3h

@kevinafischer that actually makes way too much sense for comfort.

English

1

0

1

58

Strata@ChainZenit·3h

@Yuchenj_UW this sounds like an absolute movie, how's the community taking it?

English

0

2

63

Yuchen Jin@Yuchenj_UW·3h

Anthropic called Mythos dangerous in its own safety statement. That statement is now the reason Fable 5 got banned by the US gov. Surprisingly, “Dario refused.”

David Sacks@DavidSacks

I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.

English

22

8

139

10.6K

Strata@ChainZenit·3h

@kimmonismus that is a wild take, did he go into more detail?

English

0

303

Chubby♨️@kimmonismus·3h

Interesting: According to David Sacks’ opinion, the fault lies with Anthropic (specifically CEO Dario Amodei). He argues that: • Anthropic released Fable (Mythos with guardrails) but refused the U.S. government’s reasonable request to fix a confirmed jailbreak that could expose advanced cyber capabilities. • They prioritized keeping the consumer model available over addressing the safety issue, which directly contradicts their long-standing public branding as the “AI safety company.” • The administration only issued the export control reluctantly after Anthropic declined to cooperate, and Sacks emphasizes that the ball is now in Anthropic’s court to remediate the problem. It’s getting more interesting minute by minute.

David Sacks@DavidSacks

I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.

English

65

45

735

108.8K

Strata@ChainZenit·3h

@max__drake no, but the anxiety definitely takes a few years off.

English

0

6