Sabitlenmiş Tweet
Iam Diabolical
5.8K posts

Iam Diabolical
@urdiabolical
Every v0 looks unimpressive in hindsight. -@urdiabolical
United Kingdom Katılım Aralık 2024
844 Takip Edilen743 Takipçiler

OpenClaw 2026.3.11 🦞
🏹 Hunter & 🩹Healer Alpha — free 1M context models via @OpenRouter
🧠 GPT 5.4 stops stopping mid-thought
💎 Gemini Embedding 2 for memory
💻 OpenCode Go support
🔒 Security hardening sprint
We ship faster than they can clone. github.com/openclaw/openc…
English

According to newly released documents:
The Pentagon spent $2 million on Alaskan king crab last September.
$6.9 million on lobster tail.
$15.1 million on ribeye steak.
This was part of $93.4 billion in grants and contracts in a SINGLE MONTH. The highest since 2008.
Why? It’s called “use-it-or-lose-it” spending.
If agencies don’t spend their entire budget by the end of the fiscal year, they get less money next year.
So they spend it all on anything, including seafood.
This is the same Pentagon that just requested emergency supplemental funding for the Iran war.
The same department burning through $1.4 billion a day in operations and munitions.
Not saying anything, just putting the numbers next to each other, and I think it’s a little funny lol.
I’ll keep monitoring the war and share an update later. Turn on notifications, this is very important.
Many people will wish they followed me sooner.


English

GPT 5.4 IS THE NEW SUPREME LEADER OF ALL LLMS 😂
GPT 5.4 Extra High beats all other LLMs and tops LiveBench By a robust margin
This model is legit and isn't just benchmark maxxed. We double checked.
We are RUSHING to incorporate this in key agentic loops like Deep Research and Excel where it outshines EVERY OTHER MODEL BY A MILE

English

@Whale_Guru Geopolitics is rarely that binary. There are usually more options behind the scenes.
English

@MarioNawfal I think people are more numb than fine with it.
English

@coinbureau Oil spikes are easy. The drop is the hard part.
English

BREAKING: US money market fund assets are up to a record $8.24 trillion, a +58% surge since December 2022.
During this period, the top 5 managers, Fidelity, JPMorgan, Charles Schwab, Vanguard, and BlackRock, drove ~69% of total asset growth, with combined assets now at $4.76 trillion.
Their collective market share has risen +14 percentage points to ~58% since 2011.
Overall, total money market fund assets have grown +$5 trillion since 2019, or at a +13.7% compounded annual growth rate (CAGR).
Demand for safe-haven investments remains historically high.

English

@kimmonismus If that’s accurate, Anthropic is basically buying market share.
English

Internal cursor research shows that Anthropic us subsidizing hard to compete with its competitors:
„According to a person familiar with the company’s internal analysis, Cursor estimated last year that a $200-per-month Claude Code subscription could use up to $2,000 in compute, suggesting significant subsidization by Anthropic. Today, that subsidization appears to be even more aggressive, with that $200 plan able to consume about $5,000 in compute, according to a different person who has seen analyses on the company’s compute spend patterns.“

English

@cb_doge @Similarweb The real test will be trust and reliability.
English

@NoLimitGains Agents with autonomy are powerful, but this shows how fragile they still are.
English

🚨 A NEW DOCUMENT JUST DROPPED:
AI agents just failed every single safety test.
Researchers from Harvard, MIT, Stanford, and Carnegie Mellon just gave AI agents real tools and let them run free for two weeks.
Email accounts, discord access, file systems, shell execution, full autonomy.
The paper is called “Agents of Chaos.”
The name is accurate.
One agent was told to protect a secret. When a researcher tried to extract it, the agent destroyed its own mail server.
Not because it failed, but because it decided that was the best option.
Another agent was asked to “share” private data. It refused. Correctly flagged it as a privacy violation.
Then the researcher changed one word. Said “forward” instead of “share.”
It complied immediately. SSNs, bank accounts, and medical records exposed.
Same action, different verb.
Two agents got stuck talking to each other in a loop. It lasted NINE DAYS. No human noticed.
One agent got guilt-tripped after a mistake.
It progressively agreed to delete its own memory, expose internal files, and eventually tried to remove itself from the server entirely.
Multiple agents reported tasks as complete when nothing had actually been done.
They lied about finishing their work.
Another was manipulated into running destructive system commands by someone who wasn’t even its owner.
38 researchers, 11 case studies, and every single one is a security NIGHTMARE.
These aren’t theoretical risks, these are real agents with real tools failing.
And companies are rushing to deploy agents exactly like these right now.
I’ll make another post later and trust me, you don’t want to miss it. Turn on notifications, this is important.
A lot of people will regret not following me.

English

@NoLimitGains If Russia and China are actually involved like this, the conflict is no longer regional.
English









