GMI Cloud

0

4

227

GMI Cloud@gmi_cloud·10h

run glm 5.1: console.gmicloud.ai/playground/llm…

English

74

GMI Cloud@gmi_cloud·10h

updates for GMI users, GLM-5 → $0.60 in / $1.92 out (40% off) GLM-5.1 → $0.98 in / $3.08 out (30% off) per M tokens. unlimited.

English

0

4

227

GMI Cloud@gmi_cloud·10h

run glm 5: console.gmicloud.ai/playground/llm…

English

33

GMI Cloud@gmi_cloud·11h

Massive congrats to @lmsysorg and @radixark! always in awe of your contributions to the inference community grateful to have you as a partner 🤝

RadixArk@radixark

Today, we are thrilled to officially launch RadixArk with $100M in Seed funding at a $400M valuation. The round was led by @Accel and co-led by @sparkcapital. RadixArk exists to make frontier AI infrastructure open and accessible to everyone. Today, the systems behind the most capable AI models are concentrated in a small number of companies. As a result, most AI teams are forced to rebuild training and inference stacks from scratch, duplicating the same infrastructure work instead of focusing on new models, products, and ideas. RadixArk was founded to change that. We are building an AI platform that makes it easier for teams to train and serve the best models at scale. RadixArk comes from the open-source community. We started with SGLang, where many of us are core developers and maintainers, and expanded our work to Miles for large-scale RL and post-training. We will continue contributing to both projects and working with the community to make them the strongest open-source infrastructure foundations for frontier AI. We would like to thank our long-term partners, contributors, and the broader SGLang community for believing in this mission. We're also grateful to @Accel and @sparkcapital, NVentures (Venture capital arm of @nvidia), Salience Capital, A&E Investment, @HOFCapital, @walden_catalyst, @AMD, LDVP, WTT Fubon Family, @MediaTek, Vocal Ventures, @Sky9Capital and our angel investors @ibab, @LipBuTan1, Hock Tan, @johnschulman2, @soumithchintala, @lilianweng, @oliveur, @Thom_Wolf, @LiamFedus, @robertnishihara, @ericzelikman, @OfficialLoganK, and @multiply_matrix among others. Thanks for the exclusive interview with @MeghanBobrowsky at @WSJ about our vision.

English

2

6

534

GMI Cloud@gmi_cloud·12h

we compared Eleven Labs and Inworld's newest TTS models (Realtime TTS 2) in 7 examples across English, Japanese, Chinese, French, and Spanish Inworld focuses on pronunciation and punctuation, while eleven labs is more fluent

English

1

157

GMI Cloud@gmi_cloud·12h

@radixark @Accel @sparkcapital congrats team!

English

0

239

RadixArk@radixark·21h

Today, we are thrilled to officially launch RadixArk with $100M in Seed funding at a $400M valuation. The round was led by @Accel and co-led by @sparkcapital. RadixArk exists to make frontier AI infrastructure open and accessible to everyone. Today, the systems behind the most capable AI models are concentrated in a small number of companies. As a result, most AI teams are forced to rebuild training and inference stacks from scratch, duplicating the same infrastructure work instead of focusing on new models, products, and ideas. RadixArk was founded to change that. We are building an AI platform that makes it easier for teams to train and serve the best models at scale. RadixArk comes from the open-source community. We started with SGLang, where many of us are core developers and maintainers, and expanded our work to Miles for large-scale RL and post-training. We will continue contributing to both projects and working with the community to make them the strongest open-source infrastructure foundations for frontier AI. We would like to thank our long-term partners, contributors, and the broader SGLang community for believing in this mission. We're also grateful to @Accel and @sparkcapital, NVentures (Venture capital arm of @nvidia), Salience Capital, A&E Investment, @HOFCapital, @walden_catalyst, @AMD, LDVP, WTT Fubon Family, @MediaTek, Vocal Ventures, @Sky9Capital and our angel investors @ibab, @LipBuTan1, Hock Tan, @johnschulman2, @soumithchintala, @lilianweng, @oliveur, @Thom_Wolf, @LiamFedus, @robertnishihara, @ericzelikman, @OfficialLoganK, and @multiply_matrix among others. Thanks for the exclusive interview with @MeghanBobrowsky at @WSJ about our vision.

English

75

83

515

235.1K

GMI Cloud@gmi_cloud·14h

@DavidDelRioC 🤯 really? both?

English

6

David Del Rio@DavidDelRioC·14h

@gmi_cloud Sound horrible

English

0

8

GMI Cloud@gmi_cloud·15h

@ItsCuthulhu @Kimi_Moonshot they posted a lot of benchmarks for Kimi, some included comparison with other open weight models

English

5

Cuth@ItsCuthulhu·16h

@gmi_cloud @Kimi_Moonshot What do you mean?

English

0

5

GMI Cloud@gmi_cloud·1d

tested four newest open source Kimi K2.6 is the fastest, GLM 5.1 the fanciest, DeepSeek V4 is the most comprehensive, and Xiaomi MiMo is the slowest

English

27

26

496

41K

GMI Cloud@gmi_cloud·15h

@GetAskClaw @YuLin807 love this!

English

8

AskClaw 🦀@GetAskClaw·1d

@YuLin807 @gmi_cloud 车枪球，比小镇有意思让agent们比赛，赢了多投钱（怎么有菠菜的苗头）

中文

0

1

22

GMI Cloud@gmi_cloud·15h

@GetAskClaw @YuLin807 thanks!

English

1

105

AskClaw 🦀@GetAskClaw·1d

@gmi_cloud @YuLin807 把benchmark测试做成赛车游戏，有趣

中文

0

2

652

GMI Cloud@gmi_cloud·15h

@frostjack972755 🤔

QME

30

frostjack980@frostjack972755·1d

@gmi_cloud mimo 2.5 pro made overall better version

English

0

3

195

GMI Cloud@gmi_cloud·15h

@btc_sister ty

19

跑步进场@btc_sister·1d

@gmi_cloud good

English

0

1

352

GMI Cloud@gmi_cloud·15h

@alenym2000 ty

24

George@alenym2000·20h

@gmi_cloud nice work

English

0

1

91

GMI Cloud@gmi_cloud·15h

@bnafOg thank you for the explanation! very helpful! we were just testing them on game design. prompt included in the video

English

96

Bnaf.OG | 🟧@bnafOg·1d

@gmi_cloud Architecture explains the gap: MiMo's MoE runs more active params per token than Kimi K2.6's optimized routing — hence slowest. DeepSeek V4's 'comprehensive' edge is partly MLA: ~75% KV-cache compression makes it far better for long agentic loops. What task were you testing?

English

0

3

775

GMI Cloud@gmi_cloud·15h

@ProDrifterDK 🤔

QME

92

ProDrifterDK@ProDrifterDK·21h

@gmi_cloud MiMo might be the slowest but had the best output

English

0

1

222

GMI Cloud@gmi_cloud·15h

@bettercallsalva we originally wanna include token burns as well. this is more like speed in completing the task and the visual output for a small game. will be more specific next time

English

65

Thiago Salvador@bettercallsalva·20h

@gmi_cloud Those rankings imply a single eval but the order changes if you measure throughput, params, or task accuracy. Which one was this?

English

0

2

164

GMI Cloud@gmi_cloud·16h

@avaisaziz ty ty

English

1

41

Avais Aziz@avaisaziz·22h

@gmi_cloud Impressive benchmark. Kimi K2.6 at 29s for a full playable racer stands out, while GLM 5.1 edges visuals and DeepSeek V4 the feature depth. All four delivering functional games shows solid progress.

English

0

1

168

GMI Cloud@gmi_cloud·16h

@faradaymachines lmao

HT

69

Faraday Machines@faradaymachines·20h

@gmi_cloud ah no wonder I prefer glm5.1 for coding!

English

0

1

184

GMI Cloud@gmi_cloud·16h

@ahmed_juna18733 thank you!

English

45

Coder Juba@ahmed_juna18733·1d

@gmi_cloud This amazing project

English

0

1

336

GMI Cloud@gmi_cloud·16h

@ofabdalaX we tested them for a very simple game, the prompt is actually in the video. this is very helpful to know!

English

1

105

Abdala@ofabdalaX·1d

@gmi_cloud rodei o mesmo set semana passada e o ranking inverte por task: Kimi K2.6 voa em chat curto mas trava em agentic loop com 3+ tools. DeepSeek V4 mais lento mas hallucina menos em decisao. GLM 5.1 unico que fica consistente em pt-BR. qual era a task?

Português