LEMONed
37.7K posts

LEMONed
@LEMONed
廣州土著,偽文青,真猫奴,重度強迫症患者,中度拖延症患者,輕度社交恐懼症患者,完美主義者,悲觀主義者,右派。厭惡一切網絡流行語。言論僅代表個人立場。 #NYTimes #InitiumMedia
Braavos เข้าร่วม Aralık 2006
221 กำลังติดตาม13.1K ผู้ติดตาม


in other words, Qwen is 94.6% bullshit.
比特币橙子Trader@oragnes
中国杭州初创公司PettiChat,118美元AI宠物项圈使用阿里巴巴Qwen模型,能以94.6%准确率实时翻译猫狗声音和情绪,已获1万预订。
English
LEMONed รีทวีตแล้ว

AP wins Pulitzer Prize for China surveillance reporting: buff.ly/3tStbFt
English

the @cloudflare security verification is extremely disgusting. even more so day by day.
English

@instagram is there any way I can contact help? you have dozens of help articles that are not helping at all.
English
LEMONed รีทวีตแล้ว

LEMONed รีทวีตแล้ว

🚨BREAKING: OpenAI published a paper proving that ChatGPT will always make things up.
Not sometimes. Not until the next update. Always. They proved it with math.
Even with perfect training data and unlimited computing power, AI models will still confidently tell you things that are completely false. This isn't a bug they're working on. It's baked into how these systems work at a fundamental level.
And their own numbers are brutal. OpenAI's o1 reasoning model hallucinates 16% of the time. Their newer o3 model? 33%. Their newest o4-mini? 48%. Nearly half of what their most recent model tells you could be fabricated. The "smarter" models are actually getting worse at telling the truth.
Here's why it can't be fixed. Language models work by predicting the next word based on probability. When they hit something uncertain, they don't pause. They don't flag it. They guess. And they guess with complete confidence, because that's exactly what they were trained to do.
The researchers looked at the 10 biggest AI benchmarks used to measure how good these models are. 9 out of 10 give the same score for saying "I don't know" as for giving a completely wrong answer: zero points. The entire testing system literally punishes honesty and rewards guessing.
So the AI learned the optimal strategy: always guess. Never admit uncertainty. Sound confident even when you're making it up.
OpenAI's proposed fix? Have ChatGPT say "I don't know" when it's unsure. Their own math shows this would mean roughly 30% of your questions get no answer. Imagine asking ChatGPT something three times out of ten and getting "I'm not confident enough to respond." Users would leave overnight. So the fix exists, but it would kill the product.
This isn't just OpenAI's problem. DeepMind and Tsinghua University independently reached the same conclusion. Three of the world's top AI labs, working separately, all agree: this is permanent.
Every time ChatGPT gives you an answer, ask yourself: is this real, or is it just a confident guess?

English
LEMONed รีทวีตแล้ว

一个反直觉的事实:包括 DeepSeek、千问在内的中国大模型,其逻辑推理能力的底座是用英文世界的高质量语料(代码、论文、技术文档)训练出来的。模型从英文语料中习得的推理模式,在输出中文时被保留了下来——低语境、重逻辑、主谓宾清晰。
这让大量中国网民第一次在日常场景中持续接触到了结构化、高信息密度的中文表达。过去这种风格只存在于翻译体媒体(比如纽约时报中文网)或少数作者的输出中。曾经我第一次看这些媒体的感受是,原来中文新闻可以这么写...
现在每天跟 DeepSeek、Kimi 对话、让它写邮件改方案,就是在被这种语言风格反复浸泡。对于过去浸泡在官话和高语境语言环境的中文使用者来说,算是一个语言环境的升级吧.
中文

LEMONed รีทวีตแล้ว
LEMONed รีทวีตแล้ว

@fxtrader The Chinese government floods X search results with porn whenever there is political unrest—to prevent their citizens from finding out real-time information.
This has been a difficult problem to solve but we are aware & working on it.
English






