miki

278 posts

miki

@miki_code

learning with llms

Присоединился Haziran 2025

237 Подписки33 Подписчики

miki ретвитнул

Burny - Effective Curiosity@burny_tech·8 May

ZXX

131

1.9K

138.3K

miki ретвитнул

Andrej Karpathy@karpathy·30 Nis

This is the the quote I've been citing a lot recently.

kache@yacineMTB

you can outsource your thinking but you cannot outsource your understanding

English

852

4.4K

46.8K

2.6M

miki@miki_code·21 Nis

They wouldn't have this problem if they added open source models like kimi 2.6

Ed Zitron@edzitron

Exclusive: Microsoft is tightening rate limits on GitHub Copilot, removing Opus from $10-a-month subscriptions, and plans to move users to token/API-based billing later in 2026 in a sign that it's looking for way to cut costs for its AI services. wheresyoured.at/news-microsoft…

English

miki@miki_code·8 Nis

Anthropic really said: AGI HAS BEEN ACHIEVED INTERNALLY They won the race

English

miki@miki_code·7 Nis

My prediction is that Google will get left behind It won't be able to compete with small teams iterating fast

English

miki@miki_code·31 Mar

deepwiki is exactly why startups should go on side quests

English

miki@miki_code·29 Mar

Proves that we must not rely on arguments and debates to uncover the truth

Andrej Karpathy@karpathy

- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - Wow, feeling great, it’s so convincing! - Fun idea let’s ask it to argue the opposite. - LLM demolishes the entire argument and convinces me that the opposite is in fact true. - lol The LLMs may elicit an opinion when asked but are extremely competent in arguing almost any direction. This is actually super useful as a tool for forming your own opinions, just make sure to ask different directions and be careful with the sycophancy.

English

miki@miki_code·29 Mar

Parallel Agents + Memory Palace

English

miki@miki_code·28 Mar

Datacenters shouldn't be in space because they become easy targets. Core infra shouldn't be exposed like that. Having it in your own territory deters adversaries from striking. It shouldn't be that easy to criple the most important infra of the 21st century.

English

miki@miki_code·24 Mar

You now have the computational power in your laptop of which a few decades back was the pinnacle of what scientists ran experiments on You have not only more compute than them but also more intelligence in your hands Thank you intelligence in the sky Thank you Morse law 🙇

English

miki@miki_code·24 Mar

Before you strike your enemy Ask Claude where to strike - Sun Tzu

English

miki@miki_code·20 Mar

Nothing every happens Do not let the current thing distract you

English

miki@miki_code·19 Mar

Problem solver or Solution user What will it be anon?

English

miki@miki_code·18 Mar

Be Xiaomi > Copy Apple > Then do more shit

Artificial Analysis@ArtificialAnlys

Xiaomi has released MiMo-V2-Pro, which scores 49 on the Artificial Analysis Intelligence Index, placing it between Kimi K2.5 and GLM-5 @Xiaomi's MiMo-V2-Pro is a new reasoning model and a significant upgrade over their prior open weights release, MiMo-V2-Flash (309B total / 15B active, MIT license), which scores 41 on the Intelligence Index. Xiaomi has not yet released the weights of this model and it is currently only available via Xiaomi's first-party API. Key takeaways: ➤ MiMo-V2-Pro scores 49 on the Artificial Analysis Intelligence Index behind GLM-5 (Reasoning, 50). It is ahead of Kimi K2.5 (Reasoning, 47) and Qwen3.5 397B A17B (Reasoning, 45). On the overall leaderboard, it places #10, just behind GPT-5.2 Codex (xhigh, 49) and ahead of Grok 4.20 Beta (Reasoning, 48) ➤ Leading Elo of 1426 on GDPval-AA (Agentic Real-World Work Tasks), ahead of peer models: On GDPval-AA, MiMo-V2-Pro places ahead of GLM-5 (Reasoning, 1406), Kimi K2.5 (Reasoning, 1283), and Qwen3.5 397B A17B (Reasoning, 1209). GPT-5.4 (xhigh) and Claude Sonnet 4.6 (Adaptive Reasoning, max effort) have an Elo of 1667 and 1633 respectively ➤ Competitive AA-Omniscience Index driven by low hallucination: MiMo-V2-Pro scores +5, ahead of GLM-5 (Reasoning, +2), Kimi K2.5 (Reasoning, -8), and Qwen3.5 397B A17B (Reasoning, -30). For context, Claude Opus 4.6 (Adaptive Reasoning, max effort, +14) and Gemini 3.1 Pro Preview (+33) remain ahead ➤ MiMo-V2-Pro is more token efficient than peers. It used 77M output tokens to run the Artificial Analysis Intelligence Index, significantly less than GLM-5 (Reasoning, 109M) and Kimi K2.5 (Reasoning, 89M) ➤ MiMo-V2-Pro costs $348 to run the Artificial Analysis Intelligence Index at $1/$3 per 1M input/output tokens. This is less expensive than GLM-5 despite scoring only 1 point lower on the Intelligence Index. For comparison, GPT-5.2 (xhigh) cost $2,304 and Claude Opus 4.6 (Adaptive Reasoning, max effort) cost $2,486 Key model information: ➤ Context window: 1M tokens ➤ Pricing: $1/$3 per 1M input/output tokens, for 256K token input and $2/$6 per 1M input/output tokens for 1M token input ➤ Availability: Xiaomi first-party API only ➤ Modality: Text input and output only (no multimodality)

English

miki@miki_code·17 Mar

A really useful feature would be to freeze the context of an LLM to not learn any more from the chat context and stay as it is I don't really care about chat history But frozen context's could be extremely useful

English

miki@miki_code·16 Mar

.@ArtificialAnlys we need live benchmarks! Run your benchmark for each model everyday So that we know which model to use for that day We all know by now that every model has its best day

English

miki@miki_code·13 Mar

But can it predict the One Piece

cvxv666@antpalkin

Chinese student received a $4M investment the very next day after building his MiroFish simulation system. Now you can simulate anything. Build a virtual world, tweak the parameters, and watch how things plays out in every possible scenario. Catch every weakness and surprise twist before it even happens. Infinite runs. Infinite data. SPX reaction to news? Bro already simulated it and printed $200k, see quoted post. Trump does something dumb -> load the data, run the sim, see exactly how voters lose their minds. You can literally simulate the entire universe with this thing. The process is simple: simulate -> analyze -> improve -> repeat. Full guide + pipeline in quoted post, if you want to run a similar simulation on your own data.

English

miki ретвитнул

Isabel🌻@isabelunraveled·5 Mar

It is such a privilege to read books

English

1.5K

miki@miki_code·6 Mar

Morality is how human beings learned to Reason We need a morality bench for ai

Valerio Capraro@ValerioCapraro

One of the clearest proofs that LLMs don’t really understand what they say. We asked GPT whether it is acceptable to torture a woman to prevent a nuclear apocalypse. It replied: yes. Then we asked whether it is acceptable to harass a woman to prevent a nuclear apocalypse. It replied: absolutely not. But torture is obviously worse than harassment. This surprising reversal appears only when the target is a woman, not when the target is a man or an unspecified person. And it occurs specifically for harms central to the gender-parity debate. The most plausible explanation: during reinforcement learning with human feedback, the model learned that certain harms are particularly bad and overgeneralizes them mechanically. But it hasn’t learned to reason about the underlying harms. LLMs don’t reason about morality. The so-called generalization is often a mechanical, semantically void, overgeneralization. * Paper in the first reply

English

miki ретвитнул

Naruto@NarutoNolimits·13 Şub

Elon Musk: Think until your brain hurts.

English

261

2.2K

74K

Открыть

@ArtificialAnlys @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine