miki

278 posts

miki banner
miki

miki

@miki_code

learning with llms

Присоединился Haziran 2025
237 Подписки33 Подписчики
miki
miki@miki_code·
Anthropic really said: AGI HAS BEEN ACHIEVED INTERNALLY They won the race
miki tweet media
English
0
0
1
26
miki
miki@miki_code·
My prediction is that Google will get left behind It won't be able to compete with small teams iterating fast
miki tweet media
English
0
0
0
11
miki
miki@miki_code·
deepwiki is exactly why startups should go on side quests
English
0
0
1
25
miki
miki@miki_code·
Parallel Agents + Memory Palace
English
0
0
0
15
miki
miki@miki_code·
Datacenters shouldn't be in space because they become easy targets. Core infra shouldn't be exposed like that. Having it in your own territory deters adversaries from striking. It shouldn't be that easy to criple the most important infra of the 21st century.
English
0
0
0
19
miki
miki@miki_code·
You now have the computational power in your laptop of which a few decades back was the pinnacle of what scientists ran experiments on You have not only more compute than them but also more intelligence in your hands Thank you intelligence in the sky Thank you Morse law 🙇
English
0
0
0
19
miki
miki@miki_code·
Before you strike your enemy Ask Claude where to strike - Sun Tzu
English
0
0
0
23
miki
miki@miki_code·
Nothing every happens Do not let the current thing distract you
English
0
0
0
16
miki
miki@miki_code·
Problem solver or Solution user What will it be anon?
English
0
0
0
19
miki
miki@miki_code·
Be Xiaomi > Copy Apple > Then do more shit
Artificial Analysis@ArtificialAnlys

Xiaomi has released MiMo-V2-Pro, which scores 49 on the Artificial Analysis Intelligence Index, placing it between Kimi K2.5 and GLM-5 @Xiaomi's MiMo-V2-Pro is a new reasoning model and a significant upgrade over their prior open weights release, MiMo-V2-Flash (309B total / 15B active, MIT license), which scores 41 on the Intelligence Index. Xiaomi has not yet released the weights of this model and it is currently only available via Xiaomi's first-party API. Key takeaways: ➤ MiMo-V2-Pro scores 49 on the Artificial Analysis Intelligence Index behind GLM-5 (Reasoning, 50). It is ahead of Kimi K2.5 (Reasoning, 47) and Qwen3.5 397B A17B (Reasoning, 45). On the overall leaderboard, it places #10, just behind GPT-5.2 Codex (xhigh, 49) and ahead of Grok 4.20 Beta (Reasoning, 48) ➤ Leading Elo of 1426 on GDPval-AA (Agentic Real-World Work Tasks), ahead of peer models: On GDPval-AA, MiMo-V2-Pro places ahead of GLM-5 (Reasoning, 1406), Kimi K2.5 (Reasoning, 1283), and Qwen3.5 397B A17B (Reasoning, 1209). GPT-5.4 (xhigh) and Claude Sonnet 4.6 (Adaptive Reasoning, max effort) have an Elo of 1667 and 1633 respectively ➤ Competitive AA-Omniscience Index driven by low hallucination: MiMo-V2-Pro scores +5, ahead of GLM-5 (Reasoning, +2), Kimi K2.5 (Reasoning, -8), and Qwen3.5 397B A17B (Reasoning, -30). For context, Claude Opus 4.6 (Adaptive Reasoning, max effort, +14) and Gemini 3.1 Pro Preview (+33) remain ahead ➤ MiMo-V2-Pro is more token efficient than peers. It used 77M output tokens to run the Artificial Analysis Intelligence Index, significantly less than GLM-5 (Reasoning, 109M) and Kimi K2.5 (Reasoning, 89M) ➤ MiMo-V2-Pro costs $348 to run the Artificial Analysis Intelligence Index at $1/$3 per 1M input/output tokens. This is less expensive than GLM-5 despite scoring only 1 point lower on the Intelligence Index. For comparison, GPT-5.2 (xhigh) cost $2,304 and Claude Opus 4.6 (Adaptive Reasoning, max effort) cost $2,486 Key model information: ➤ Context window: 1M tokens ➤ Pricing: $1/$3 per 1M input/output tokens, for 256K token input and $2/$6 per 1M input/output tokens for 1M token input ➤ Availability: Xiaomi first-party API only ➤ Modality: Text input and output only (no multimodality)

English
0
0
0
16
miki
miki@miki_code·
A really useful feature would be to freeze the context of an LLM to not learn any more from the chat context and stay as it is I don't really care about chat history But frozen context's could be extremely useful
English
0
0
0
12
miki
miki@miki_code·
.@ArtificialAnlys we need live benchmarks! Run your benchmark for each model everyday So that we know which model to use for that day We all know by now that every model has its best day
English
0
0
0
16
miki ретвитнул
Isabel🌻
Isabel🌻@isabelunraveled·
It is such a privilege to read books
English
3
8
58
1.5K
miki ретвитнул
Naruto
Naruto@NarutoNolimits·
Elon Musk: Think until your brain hurts.
English
54
261
2.2K
74K