DaE73

108.9K posts

DaE73

@e73_da

Medical Student 🩺🥼⚕️For the Working Class.

Illinois, USA 가입일 Şubat 2020

640 팔로잉497 팔로워

고정된 트윗

DaE73@e73_da·4d

K2.6 + hermes = 4 hr session setting up qwen 3.6 training regime on dgx spark with the current autnomous session currently lasting 70+ min without any prompting. Kimi with hermes is next level. @NousResearch @Teknium

Nous Research@NousResearch

Kimi K2.6 is now available in Hermes Agent. Simply run `hermes update` and use `hermes model` to select a compatible provider hosting the model!

English

315

41K

DaE73 리트윗함

China pulse 🇨🇳@Eng_china5·17h

China begins gradually phasing out Western aircraft. China Southern Airlines is offering its entire fleet of Boeing 787-8 aircraft — 10 planes and two engines — for sale at auction in Shanghai for $550 million. All the aircraft were manufactured between 2014 and 2015, and they will be replaced with domestically produced Chinese aircraft.

English

184

919

4.2K

326.1K

DaE73 리트윗함

New Direction AFRICA@Its_ereko·15h

🚨🇲🇽🇧🇷 BREAKING: MEXICO AND BRAZIL ARE BUILDING THEIR OWN ENERGY ALLIANCE. Pemex and Petrobras. No Washington. No empire. Just Latin America deciding its own future. Sheinbaum is not picking sides. She is building her own table. That is sovereignty. That is power. Watch closely.

English

846

4.3K

48.4K

DaE73 리트윗함

Fiorella Isabel@FiorellaIsabelM·20h

This is all fake news and supposition. The elections were a disaster but there’s 0 evidence it was done by the left in Peru which has 0 power in Lima, where this fraud happened. They have few in congress—remember Pedro Castillo is in jail for bogus charges because he threatened to change the constitution that’s handicapped Peru’s congress’s with too much power, so no President can ever govern. He also wanted to nationalize the lithium. Ten presidents in ten years. Keiko runs the country already, something I’ve also been told by the people of this country—all with different political opinions but all in agreement this is the case. The fraud likely came from within Keiko mafia based on who has the power in these institutions (her people) which an overwhelming majority of Peruvians I spoke to reiterate. This to secure a US puppet in the next elections a Milei or neoliberal figure under Washington’s control. The government just overruled the current president, to accept F-16s for billions. At this rate Peru will be a neo-colony of Washington and next in line to the “taking on the narco cartels” playbook that Venezuela and Ecuador went through. Washington aims to destabilize the remaining LATAM countries, take their resources especially lithium and create a Zionist technate ruled by supremacy and surveillance.

Global News 24@PKN2023

🇵🇪 Escándalo en Perú Piero Corvetto detenido por mega fraude electoral. Cargos de sabotaje y conspiración lo llevarían 30 años a prisión. Impidió el voto de más de 1 millón de limeños para favorecer al candidato de izquierdas @RobertoSanchP. Game Over.

English

104

2.4K

DaE73 리트윗함

Nick Cruse 🥋@SocialistMMA·13h

U.S. Department of Defense press release ADMITS that they went to war with Iran on behalf of Israel Liberals are the biggest dipshits in the world for thinking this was about “human rights” or “Iranian democracy”

English

109

283

3.9K

DaE73 리트윗함

Li Zexin 李泽欣@XH_Lee23·1d

A Chinese visitor shared what he saw at the Australian auto show: "Every booth of Chinese car is packed with visitors." "Ford and Volkswagen, the traditionals, saw sparse crowds." Chinese EVs are now the global favorites.

English

225

1.4K

42.8K

DaE73 리트윗함

Daily News Iran@DailyNewsIran·1d

More and More Videos Are Coming of Weather Engineering They Spray Chemical While We Are Sleeping

English

152

4.3K

14.3K

190K

DaE73 리트윗함

Brian Berletic@BrianJBerletic·8h

🇺🇸🇨🇳 Top US military officials admit energy dominance is central to US geopolitical strategy including using "geography" (straits) to "impose costs" (blockade) "competitors" (China). The full hearing is here: armed-services.senate.gov/hearings/to-re…

English

364

777

18.3K

DaE73 리트윗함

CG@cgtwts·22h

> be chinese ai labs > while claude and openai are in cold war > kimi dropped k2.6 using deepseek's v3 architecture > the same week deepseek drops v4 using kimi's muon optimizer > 1.6 trillion parameters & 1M context > both match or beat closed models on benchmarks while being 8x cheaper > both build on each other's breakthroughs > keep shipping frontier LLMs with far less or nerfed NVIDA GPUs > and keep them 100% open sourced the real battle is not between models, it's open source vs closed.

DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

289

3.6K

192.4K

DaE73 리트윗함

Carolina Lion@CarolinaLion2·15h

Oil to $300.

Disclose.tv@disclosetv

JUST IN - Trump admin places economic sanctions on a major China based oil refinery and approx 40 companies and tankers involved with shipping Iranian oil — AP

English

127

3.4K

DaE73 리트윗함

Seyed Mohammad Marandi@s_m_marandi·16h

The Islamic Republic of Iran knows what the Epstein Coalition plans to do and it is ready for it.

English

251

2.3K

11.4K

165.4K

DaE73 리트윗함

Chubby♨️@kimmonismus·1d

Even DeepSeek is now making fun of Anthropic's Claude.

English

123

1.4K

54.4K

DaE73 리트윗함

Daily News Iran@DailyNewsIran·1d

World Has Start To Realise That “These Plates Are For Weather Engineering “ Weather in Iran And Iraq Is Biggest Example. Follow Us For Updates.

English

2.9K

9.7K

108.1K

DaE73 리트윗함

Zephyr@zephyr_z9·1d

Whale is always extremely good at cracking the economics

GDP@bookwormengr

DeepSeek V4 hits it out of the park and addresses HBM shortage: DeepSeek proves why it is such a fundamental research lab. In addition to exceeding Opus 4.6 on Terminal Bench and virtually matching on other performance metrics, the most notable advancement is this statement: "In the 1M-token context setting, DeepSeek-V4-Pro requires only 27% of single-token inference FLOPs and 10% of KV cache compared with DeepSeek-V3.2" To understand significance of this point, consider below diagram that shows memory layout for Prefill and Decode nodes. If you implement Decode with Data and Expert parallelism (DEP16) with 16 GPUs on GB200 or GB300 NVL72 rack with DeepSeek v3.2, you are left with 104GB or 176 GB HBRAM per GPU respectively. Here we are assuming MoE parameters are in NVFP4. The remaining HBRAM per GPU dictates how large batch size you can have for inference, which determines how many concurrent request you can serve. Consider GB300 with 176GB left: 1. For 128K context, you need 4.45 GB HBRam for KV Cache, and you can serve only 36 concurrent requests. 2. For 256K context, you need 8.90 GB HBRam for KV Cache, and you can serve only 18 concurrent requests. 3. For 512K context, you need 17.80 GB HBRam for KV Cache, and you can serve only 9 concurrent requests. 4. For 1M context, you need 35.60 GB HBRam for KV Cache, and you can serve only 4 concurrent requests. You see the point. Now you imagine, you actually required 10 times less KV cache somehow at 1M! It basically enables you to server 10 times more requests with same resources. Recall Decode is memory bound and not compute bound, unlike Prefill. This is probably the most important contribution of DeepSeek V4. @teortaxesTex @jukan05 @zephyr_z9

English

183

18.1K

DaE73 리트윗함

Sarah@DDGSarah·18h

It’s far past time to cancel our blue checks and stop giving money to this shitty platform. I apologize for my own lateness.

English

441

7.9K

DaE73 리트윗함

Jen Zhu@jenzhuscott·12h

Kimi k2.6 used DeepSeek’s v3 architecture. DeepSeek v4 used kimi's muon optimiser. 1.6 trillion parameters & 1M context - both match or beat closed models on benchmarks while being 8x cheaper. Both build on each other's breakthroughs. Both keep shipping frontier LLMs w far less & nerfed NVIDA GPUs. China’s full domestic stack/ecosystem chip-model-cloud is now on full speed. Debate on distillation all you want. The dragon 🐉 is fully awake & good luck stopping it.

English

109

801

43.9K

DaE73 리트윗함

DD Geopolitics@DD_Geopolitics·17h

If you've noticed a lot less posts from us, we've decided that given the new algorithm changes, posting on this site has become a very fruitless endeavor. Hopefully this changes, but until then we will go touch grass in what's left of this beautiful world before the Trump regime decided to destroy it.

English

240

1.8K

50.1K

DaE73 리트윗함

Arnaud Bertrand@RnaudBertrand·1d

Wow, this is huge, after months of speculation and the U.S. running a massive pre-emptive discreditation campaign (x.com/RnaudBertrand/…), DeepSeek-V4 is finally out! I haven't studied it in depth but here are the most striking aspects as far as I can tell: - Fully open sourced with open weights (available for download on huggingface: huggingface.co/deepseek-ai) - Zero CUDA dependency anywhere in its stack, which is probably the biggest deal of all. For those who don't know, CUDA is Nvidia's software layer - the foundation nearly every frontier AI model in the world is built on. Except, as of today, DeepSeek V4, which can run entirely on Huawei Ascend chips via Huawei's CANN framework (finance.yahoo.com/sectors/techno…). Very concretely it means that China now not only has its own frontier AI models, but its own domestic AI stack, top to bottom. - The prices are insanely low. V4-Pro is roughly 3x cheaper than GPT-5.5 on input and 8.6x cheaper on output. And V4-Flash is an order of magnitude cheaper still, at $0.14/$0.28 per million tokens vs OpenAI's $5/$30 - so 30-100x cheaper than GPT-5.5 (!). And remember, these are the prices DeepSeek charges on its own API - anyone can download the weights and run them for "free" on their own server. - It is at or near the frontier on most benchmarks that matter. V4-Pro-Max matches or beats GPT-5.4 and Claude Opus 4.6 on competitive programming (Codeforces rating 3206), coding (LiveCodeBench 93.5), and math (HMMT 95.2, IMO AnswerBench 89.8). It trails the very newest GPT-5.5 and Opus 4.7 on a handful of the hardest agentic and knowledge benchmarks, but it's in the same league. In effect the value proposition is: "Same league as frontier US AI, at a fraction of the price, open-source and freely modifiable, and hardware-agnostic - you can run it on whatever infrastructure you choose." Which is insanely good. I now understand the need for a preemptive discreditation campaign: they had every reason to be worried. For the vast majority of use cases, you'd have to be a literal idiot to keep paying OpenAI or Anthropic's prices when this exists.

DeepSeek@deepseek_ai

English

781

3.2K

257.1K

DaE73 리트윗함

Dr Singularity@Dr_Singularity·23h

Liang Wenfeng, DeepSeek CEO "The whole world should use a 1.6T model for free." AGI will be open source and almost free

Arjun@arjunkocher

“The whole world should use a 1.6T model for free.” — Liang Wenfeng

English

753

22.8K

DaE73 리트윗함

Nick Cruse 🥋@SocialistMMA·21h

Spain may be suspended from NATO following their objection towards the war against Iran This is the organization that all “progressive Democrats” support NATO exists to project United States power on the world stage and defend their imperialism and wars

English

304

3.2K

DaE73 리트윗함

Zephyr@zephyr_z9·1d

This tells me that V4 Pro require more post training work They will stretch its capability to Mythos++ in the coming months in 1 or 2 iterations Buckle up

NomoreID@Hangsiin

Codeforces 3052 SWE-bench Pro 52.6 HLE 45.1 long-context > Gemini 3.1 Pro The API pricing for this model is: input: (Cache hit) $0.028 / (Cache miss) $0.14 output: $0.28. It supports up to 1M tokens, with no additional charges based on context length. Unless they have told some horrific benchmarking lies, this is insane.

English

346

38.6K

탐색

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry