Dango233

153 posts

Dango233 banner
Dango233

Dango233

@dango233max

Baking open AI system Garnishing open weights https://t.co/Y5yWy3Hn2K

Shenzhen شامل ہوئے Kasım 2011
318 فالونگ945 فالوورز
Dango233
Dango233@dango233max·
@karminski3 我召回测试用的都是苏丹的游戏的文本...
中文
0
0
0
809
karminski-牙医
karminski-牙医@karminski3·
感觉大模型召回都已经不用测了? Fiction.LiveBench 作者刚在X上更新了最新的测试结果, 目前来看过年前后这一波大模型长上下文召回都很不错. 120K 长度来看, 最好的是 claude-opus-4.6, 达到了93.8%, 然后是 GLM-5 的85.7%, 以及 Kimi-K2.5 的78.1%, Qwen3.5-plus 的76.2. 不过 MiniMax-M2.5 则是40.6, 而且 MiniMax-M2.5 在8K就下降到60%以下了. 暂时不确定是什么问题. 我自己做的那个霍格沃茨测试新榜单几乎都毫无参考价值, 各个大模型训练语料都混入了非常多的哈利波特小说原文, 而且单次插桩目前来看召回效果都很好, 只有像 Fiction.LiveBench 这样的复杂召回测试能体现模型能力了. #召回 #长上下文 #大模型测试 #KCORES大模型竞技场
karminski-牙医 tweet media
中文
14
4
70
16.6K
Dango233 ری ٹویٹ کیا
virushuo
virushuo@virushuo·
我们始终还是相信 multi-agents 是必须的,尽管很多公司都认为它实现起来难度太大。我承认确实比预期困难一些,但是这应该是目前最“不一样”的多agent框架了。这个视频中每个节点都是agent,没有工作流,它们是自组织的,诞生,合作,互相攻击和死亡都是自主行为。
Intelligent Internet@ii_posts

Unstructured intelligence = chaos Most agent frameworks ship without a nervous system: deadlocks, context loss, vacuum hallucinations. We built Common Ground to fix this, agents coordinate on a shared protocol.

中文
5
13
88
19.4K
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
Unstructured intelligence = chaos Most agent frameworks ship without a nervous system: deadlocks, context loss, vacuum hallucinations. We built Common Ground to fix this, agents coordinate on a shared protocol.
English
25
48
447
535.4K
Dango233 ری ٹویٹ کیا
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
Chinese New Year is rapidly becoming the AI researcher's favorite holiday
English
41
59
1.4K
140.7K
Dango233 ری ٹویٹ کیا
virushuo
virushuo@virushuo·
我参与了中文版翻译工作。希望把关于 AI 时代经济与治理的讨论带给更多中文读者,欢迎大家指出任何翻译/术语建议。虽然AI已经能做大部分翻译任务,但翻译过程中还是有很大量的人类对齐工作,尤其一些概念中/英差距很大,又要兼顾原作者表达的语气和方式,整个工作体验还是很有意思的。
Intelligent Internet@ii_posts

你好,中国的朋友们! 《The Last Economy》中文版现已上线,可在我们网站免费阅读。 “The Last Economy” by @EMostaque is now available in Chinese What language should we do next?

中文
19
72
396
62.1K
Dango233 ری ٹویٹ کیا
Emad
Emad@EMostaque·
Our state of the art open source general purpose agent hits V1 Feature equivalent to Replit / Manus / Genspark etc, to make websites to presentations and more connected to all your other tools Readying open repo update in a week or two, give it a try and give feedback!
Intelligent Internet@ii_posts

II-Agent V1 is here. The AI agent built for real work is finally out of beta. Faster, smarter, and production-ready. It’s time to change how you build. 👇 Let’s see what’s new.

English
35
38
328
30.5K
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
II-Agent V1 is here. The AI agent built for real work is finally out of beta. Faster, smarter, and production-ready. It’s time to change how you build. 👇 Let’s see what’s new.
English
22
49
216
131.4K
Dango233
Dango233@dango233max·
@bboczeng 在AI paper里面用manifold谈不上装hhhh
中文
0
0
1
368
勃勃OC
勃勃OC@bboczeng·
@dango233max 不是了,其实deepseek的工作估计也不那么在乎citation,反正员工也出不了国、当不了杰青院士,而且也不差钱。 盲猜标题是DeepSeek CEO梁文峰自己要求的 目的只有一个: 装逼
中文
1
0
1
576
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
Find research faster with II-Commons Search arXiv + PubMed (web app + Agent demo, API/MCP/A2A) Beta now live for II-Accounts
English
7
14
55
501.9K
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
While building II’s open stack, we put together a Gemini-CLI → MCP+OpenAI a bridge to access the tools + model in our tests. This lets any MCP-savvy agent tap Gemini + its tools via your gemini-cli instance!
Intelligent Internet tweet media
English
6
21
80
32.4K
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
II-Medical-8B-1706 is our latest state of the art open medical model 💡 Outperforms the latest @Google MedGemma 27b model with 70% less parameters 🤏 Quantised GGUF weights, works on <8 Gb RAM 🚀 One more step to the universal health knowledge access that everyone deserves ⚕️
Intelligent Internet tweet media
English
21
116
577
249.8K
Dango233
Dango233@dango233max·
ZXX
0
0
3
423
Dango233
Dango233@dango233max·
Veo3太好玩 “代表热量消灭你!”
中文
2
4
28
6K
Dango233 ری ٹویٹ کیا
Intelligent Internet
Intelligent Internet@ii_posts·
II-Commons  Infrastructure for shared knowledge.  Transparent. Distributed. Open source.  The foundation for trustworthy AI.
Intelligent Internet tweet media
English
1
21
74
31.8K
Teknium (e/λ)
Teknium (e/λ)@Teknium·
I switched off claude in cursor to gemini btw
English
112
13
876
72.2K