SK
7.6K posts

SK
@Samking207
technology, gaming and Music
Maryland, USA Katılım Aralık 2020
1.7K Takip Edilen339 Takipçiler

@astraiaintel That’s a old data, Chinas GDP is up to $20 trillion as of today
English

高市首相「世界中に平和と繁栄もたらせるのはドナルドだけ」
news.web.nhk/newsweb/na/na-… #nhk_news
日本語

The U.S. request to delay a summit between President Trump and Chinese leader Xi Jinping served as a reminder that Washington still drives the global agenda—not China on.wsj.com/3NC6Opg
English

@JChengWSJ @Lingling_Wei Man true journalism is dead, all we get now is clickbait trash from the likes of this man smh
English

China Hoped Trump Summit Would Cement Its Superpower Status. Now Xi Has to Wait. The president’s postponement of planned meeting signals that the U.S.—not Beijing—still sets the global agenda
@Lingling_Wei
wsj.com/world/china/ch…
wsj.com/world/china/ch…
English

Exclusive: The Pentagon asked the White House to approve a more than $200 billion request to Congress to fund the war in Iran, according to an administration official, a new ask that will likely run into resistance from lawmakers opposed to the conflict.
wapo.st/4bt8UQk
English

White House officials are bracing for a dramatic rupture between Donald Trump and Benjamin Netanyahu.
thedailybeast.com/president-dona…
English

@_LuoFuli @airesearch12 No wonder DeepSeek V4 is late, you left DeepSeek 😩, but congrats on MiMo-V2-Pro, it’s amazing
English

MiMo-V2-Pro & Omni & TTS is out. Our first full-stack model family built truly for the Agent era.
I call this a quiet ambush — not because we planned it, but because the shift from Chat to Agent paradigm happened so fast, even we barely believed it. Somewhere in between was a process that was thrilling, painful, and fascinating all at once.
The 1T base model started training months ago. The original goal was long-context reasoning efficiency. Hybrid Attention carries real innovation, without overreaching — and it turns out to be exactly the right foundation for the Agent era. 1M context window. MTP inference for ultra-low latency and cost. These architectural decisions weren't trendy. They were a structural advantage we built before we needed it.
What changed everything was experiencing a complex agentic scaffold — what I'd call orchestrated Context — for the first time. I was shocked on day one. I tried to convince the team to use it. That didn't work. So I gave a hard mandate: anyone on MiMo Team with fewer than 100 conversations tomorrow can quit. It worked. Once the team's imagination was ignited by what agentic systems could do, that imagination converted directly into research velocity.
People ask why we move so fast. I saw it firsthand building DeepSeek R1. My honest summary:
— Backbone and Infra research has long cycles. You need strategic conviction a year before it pays off.
— Posttrain agility is a different muscle: product intuition driving evaluation, iteration cycles compressed, paradigm shifts caught early.
— And the constant: curiosity, sharp technical instinct, decisive execution, full commitment — and something that's easy to underestimate: a genuine love for the world you're building for.
We will open-source — when the models are stable enough to deserve it.
From Beijing, very late, not quite awake.
English


@bxieus @XVanFleet This is true everywhere , here in the USA too, not a China specific issue
English

The Hunter Alpha stealth model is now ranked #1:

OpenRouter@OpenRouter
The Hunter Alpha stealth model is now in the top 10 weekly:
English




















