Say

491 posts

Say

@SayqinR

Just exploring

Katılım Nisan 2022

250 Takip Edilen14 Takipçiler

Say retweetledi

Dee@dee_hw·1d

On-Premise Business AI Center After my posts on the 2-GPU and 4-GPU builds, people reached out asking how to build an 8-GPU box for their businesses. Why? - Protect their IP - Protect customer data - Save on inference costs - Train their own models Here's how to build one: 🧵

English

324

24.3K

Say retweetledi

OpenAI@OpenAI·2d

Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI. It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrators to help organizations deploy frontier AI to production for business impact. openai.com/index/openai-l…

English

656

1.5K

11.3K

7.7M

Say retweetledi

青龍聖者@bdsqlsz·4d

New open-source image model:HiDream-O1-Image (8B), including both the undistilled and distilled Dev variants, together with the Reasoning-Driven Prompt Agent. As I said, removing VAE is a trend. Two versions:dev 28 steps of inference, and standard version uses 50 steps.

English

458

129K

Say retweetledi

ModelScope@ModelScope2022·23 Nis

Tencent HY just released Hy3 preview 👉open source. 295B total, 21B active, 256K context. Hybrid fast-slow thinking MoE. 🚀 First model after a full rebuild of pretraining and RL infrastructure. Biggest gains in coding and agentic tasks. 🛠️ Agent: drives up to 495-step complex workflows in production (docs, data analysis, MCP tool chains) ⚡ Inference: TTFT -54%, end-to-end latency -47%, success rate 99.99%+ on CodeBuddy/WorkBuddy 🎯 Strong on SWE-bench Verified, Terminal-Bench 2.0, BrowseComp, WideSearch — competitive across coding and search agent benchmarks ✅ OpenClaw / OpenCode / KiloCode compatible. vLLM + SGLang supported. 🤖 modelscope.cn/models/Tencent… 💻 github.com/Tencent-Hunyua…

English

268

24.7K

Say retweetledi

ERNIE for Developers@ErnieforDevs·4d

ERNIE 5.1 is here 🚀 ERNIE 5.1 significantly reduces pretraining cost while compressing total parameters to ~1/3 and activated parameters to ~1/2 — using only ~6% of the pretraining cost compared to models at similar scale, while achieving leading performance in its class. 💡Key highlights: 1/ Strong agentic performance approaching leading frontier models. ERNIE 5.1 surpasses DeepSeek-V4-Pro on both τ3-bench and SpreadsheetBench-Verified. 2/ Strong world knowledge and creative writing capabilities, with GPQA and MMLU-Pro performance approaching leading closed-source models, and creative writing ability nearing Gemini 3.1 Pro. 3/ Frontier-level reasoning performance. ERNIE 5.1 scores 99.6 on the challenging AIME26 benchmark with tools, second only to Gemini 3.1 Pro. 4/ Deep search capability. On May 9, ERNIE 5.1 ranked #4 globally and #1 among Chinese models on the Arena Search leaderboard with a score of 1223. ERNIE 5.1 is now available on ERNIE and the Baidu AI Studio Model Playground: 👉ernie.baidu.com 👉aistudio.baidu.com 👉ernie.baidu.com/blog

English

139

1.2K

233.4K

Say retweetledi

Zyphra@ZyphraAI·6 May

Today we're releasing ZAYA1-8B, a reasoning MoE trained on @AMD and optimized for intelligence density. With <1B active params, it outperforms open-weight models many times its size on math and reasoning, closing in on DeepSeek-V3.2 and GPT-5-High with test-time compute. 🧵

English

101

295

2.5K

1.3M

Say retweetledi

Sebastian Raschka@rasbt·3 May

Here is a 2nd batch of April architecture drops. What a month! - Ant Ling 2.6 1T - Minimax M2.7 - Xiaomi MiMo V2.5 - Poolside Laguna XS.2 - Tencent Hy3-preview - IBM Granite 4.1

English

115

840

39.8K

Say@SayqinR·1 May

> Produce the best fucking LLM models > make it open weights and open source > Prohibit companies to fire people using AI as an excuse

BRICS News@BRICSinfo

JUST IN: 🇨🇳 Chinese court rules companies cannot legally fire employees simply to replace them with cost-saving artificial intelligence.

English

Say@SayqinR·1 May

Qwen 3.6 - 27b dense and MOE models are revolutionary models, hopefully we will see that @Alibaba_Qwen will continue working on them and the performances will break the scales.

Lotto@LottoLabs

Qwen 27b series is unironically the best release of 2026

English

Say retweetledi

Mistral Vibe@mistralvibe·29 Nis

Mistral Medium 3.5, a new flagship model in public preview by @MistralAI that merges instruction-following, reasoning, and coding into a single 128B dense model with a 256k context window and configurable reasoning effort. It's a new default model for Mistral Vibe and Le Chat. Released as open weights, under a modified MIT license.

English

656

488.4K

Say retweetledi

Nous Research@NousResearch·28 Nis

The most powerful real-time visual tool in creative coding also has the steepest learning curve Now your Hermes agent can just run TouchDesigner for you. Video credit: made by @macbethAI, a talented AI artist and avid Hermes user, with the TouchDesigner skill

English

104

181

2.5K

255.7K

Say retweetledi

Mushtaq Bilal, PhD@MushtaqBilalPhD·28 Nis

> be Alexandra Elbakyan > be born in Kazakhstan in 1988 > start coding at 12 > hack your internet provider at 14 > hack MIT Press at 16 to download neuroscience books you can't afford > get a CS degree from Satbayev University > intern in neuroscience at Georgia Tech > speak at Harvard on brain-computer interfaces > notice researchers can't read the papers they need > notice academic publishers charging $30 a paper > notice peer reviewers worked for free > notice editors worked for free > notice universities funded the research with billions of dollars of public money > build Sci-Hub in 2011 > upload nearly every paywalled research paper ever published > give it away for free > get sued by Elsevier > get hit with a $15 million judgment > don't give a flying f*ck > keep Sci-Hub up > get domain after domain seized > register a new one > keep Sci-Hub up > get investigated by the US Department of Justice > don't give a flying f*ck > get accused of working for Russian intelligence > don't give a flying f*ck > have the FBI subpoena your iCloud > get named one of Nature's ten people who mattered in science > get a parasitoid wasp named after you > get a deep-sea snail named after you > get the Electronic Frontier Foundation Award for Access to Scientific Knowledge > become a legend

English

242

6.9K

34.7K

1.9M

Say retweetledi

Fuli Luo@_LuoFuli·27 Nis

Just dropped two open-source models: MiMo-V2.5-Pro (Code Agent, 1T total) and MiMo-V2.5 (Multimodal Agent, 310B total). Oh and one more thing — we're giving devs & creators 100T tokens on us. Go build something cool 🛠️ 🎁 100T Free Token Grant for Builders 100t.xiaomimimo.com

Xiaomi MiMo@XiaomiMiMo

Xiaomi MiMo-V2.5 is now officially open-sourced！ MIT License, supporting commercial deployment, continued training, and fine-tuning - no additional authorization required. Two models, both supporting a 1M-token context window : • MiMo-V2.5-Pro: built for complex agent and coding tasks, ranking No.1 among open-source models on GDPVal-AA and ClawEval • MiMo-V2.5: a native omni-modal model with strong agent capabilities A model's value isn't measured by rankings alone — it's measured by the problems it solves. Let's build with MiMo now! 🤗 Weights: huggingface.co/collections/Xi… 📄 Blog: #blog" target="_blank" rel="nofollow noopener">mimo.xiaomi.com/index#blog

English

208

306

3.1K

657.4K

Say@SayqinR·27 Nis

Discourage people from studying SE, CS and then see people from other continent drop some papers which totally revolutionise the industry. Your tech is complimentary, without devs, people it's nothing, has no economical value. It should be for people, not against them.

AI Edge@aiedge_

Anthropic CEO (Dario Amodei): "Coding is going away first, then all of software engineering." What do you think about this?

English

Say retweetledi

left curve dev@leftcurvedev_·25 Nis

Holy shit 2.9% precision lost on UD-IQ3_XXS, the quant I’m using on all my benchmarks! This is insanely good lads, this makes it suitable for daily use and explains all the strong results I shared here in the last couple days 16GB VRAM BROS WE ARE WINNING TODAY! 🥹

Benjamin Marie@bnjmn_marie

Qwen3.6 GGUF Evaluations For the 27B: Q2_K_XL is surprisingly recommendable. IQ3_XXS performs very similarly, uses only +0.2 GB, and generates significantly fewer tokens. If you are memory-tight, pick this one. Otherwise, if you can spare +2.5 GB, use Q3_K_XL: (almost) same accuracy and token efficiency as the original. All the results, also for the 35B, here: kaitchup.substack.com/p/summary-of-q… More results are coming, probably Monday, covering other GGUF providers and some abliterated models.

English

290

30.2K

Say@SayqinR·25 Nis

GIF

ₕₐₘₚₜₒₙ@hamptonism

so, is anyone gonna say anything…

ZXX

Say@SayqinR·25 Nis

I am watching AI race in the front seat with my unemployed ass.

GIF

NVIDIA AI@NVIDIAAI

✨ DeepSeek-V4 is here — a million-token context, 1.6T parameter powerhouse optimized for agentic workflows. Out of the box, on DeepSeek-V4-Pro, NVIDIA Blackwell Ultra delivers over 150 TPS/user interactivity for agentic workflows. And we’re just getting started. Expect these performance figures to climb higher as we implement Dynamo, NVFP4, and advanced parallelization techniques. Start building today with @lmsysorg and @vllm_project

English

Say@SayqinR·25 Nis

This proves that AI and human creativity are complementary, not substitutes.

Charles Curran@charliebcurran

Real slop has never been tried.

English

Say@SayqinR·24 Nis

Chinese AI labs will take jobs of many CEOs. Tech CEOs are just greedy, emotional beings need to be replaced.

CG@cgtwts

> be chinese ai labs > while claude and openai are in cold war > kimi dropped k2.6 using deepseek's v3 architecture > the same week deepseek drops v4 using kimi's muon optimizer > 1.6 trillion parameters & 1M context > both match or beat closed models on benchmarks while being 8x cheaper > both build on each other's breakthroughs > keep shipping frontier LLMs with far less or nerfed NVIDA GPUs > and keep them 100% open sourced the real battle is not between models, it's open source vs closed.

English

Say retweetledi

Pamphlets@PamphletsY·24 Nis

🚨🇨🇳 BREAKING — DeepSeek V4 Drops NVIDIA Huawei Ascend Chips Cut AI Costs 100x Open Source Alternative Scores 3206 Rating Near Global Frontier

English

555

4.5K

145.4K

Keşfet

@AMD @Alibaba_Qwen @MistralAI @macbethAI @elonmusk @BarackObama @taylorswift13 @cristiano