big goose
1.1K posts


Introducing Composer 2.5, our most powerful model yet. It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. For the next week, we’re doubling the included usage of the model.

Composer 2.5 is built on the same open-source base as Composer 2, Moonshot’s Kimi K2.5.












ERNIE 5.1 is here 🚀 ERNIE 5.1 significantly reduces pretraining cost while compressing total parameters to ~1/3 and activated parameters to ~1/2 — using only ~6% of the pretraining cost compared to models at similar scale, while achieving leading performance in its class. 💡Key highlights: 1/ Strong agentic performance approaching leading frontier models. ERNIE 5.1 surpasses DeepSeek-V4-Pro on both τ3-bench and SpreadsheetBench-Verified. 2/ Strong world knowledge and creative writing capabilities, with GPQA and MMLU-Pro performance approaching leading closed-source models, and creative writing ability nearing Gemini 3.1 Pro. 3/ Frontier-level reasoning performance. ERNIE 5.1 scores 99.6 on the challenging AIME26 benchmark with tools, second only to Gemini 3.1 Pro. 4/ Deep search capability. On May 9, ERNIE 5.1 ranked #4 globally and #1 among Chinese models on the Arena Search leaderboard with a score of 1223. ERNIE 5.1 is now available on ERNIE and the Baidu AI Studio Model Playground: 👉ernie.baidu.com 👉aistudio.baidu.com 👉ernie.baidu.com/blog





DeepSeek Reportedly Seeking to Raise Over RMB 50 Billion ($7.35 Billion), Accelerating Its Commercialization and Monetization Strategy According to two people familiar with the matter, DeepSeek founder and CEO Liang Wenfeng plans to contribute the maximum allowable amount in the company’s first funding round. DeepSeek is targeting a fundraising size of up to RMB 50 billion, or approximately $7.35 billion, in this round. If completed, it could mark the largest single fundraising round in the history of Chinese AI companies. The financing is also prompting DeepSeek to accelerate the implementation of its revenue-generation plans and push forward with commercialization and profitability. The people familiar with the matter said DeepSeek has recently told some investors that it plans to speed up the iteration and release cadence of its large language models to align with mainstream industry practices. One of the people said the company plans to launch V4.1, an updated version of its V4 model, in June.


gemini 3.1 flash-lite is here it's our most cost-efficient model, optimized for high-volume agentic tasks, translation, and simple data processing



