Bo An

2.9K posts

Bo An

Bo An

@bo__an

PhD@Yale; STS; history of computing, economics, AI, media & China; also make academic apps for fun.

Cambridge, MA Katılım Nisan 2012
2.8K Takip Edilen1.1K Takipçiler
Sabitlenmiş Tweet
Bo An
Bo An@bo__an·
I also made a simple landing page to list most of the client-side tools I've made and will be making: dh-tools.com Client-side means tools that are ready to use in the browser, mostly using WASM libraries, to save the user's privacy and my server hosting fees🥰
English
0
0
9
403
Bo An
Bo An@bo__an·
I try not to be disparaging cause I share a lot of passion for automation but this is little more than "this could have been 7 markdowns" in 2026. And it follows the numerous earlier attempts e.g. auto-deep-research by the HKU team and more recently Kosmos in its attempt to be domain-agnostic. The only difference is that it's published on Nature and there is less templating and let the lead agent steer by itself. But I think it's a deadend from the start. The only promising path is to be domain-centric, and deeply domian-centric. So ironically, less harnessing is actually harming its potential.
Sakana AI@SakanaAILabs

The AI Scientist: Towards Fully Automated AI Research, Now Published in Nature Nature: nature.com/articles/s4158… Blog: sakana.ai/ai-scientist-n… When we first introduced The AI Scientist, we shared an ambitious vision of an agent powered by foundation models capable of executing the entire machine learning research lifecycle. From inventing ideas and writing code to executing experiments and drafting the manuscript, the system demonstrated that end-to-end automation of the scientific process is possible. Soon after, we shared a historic update: the improved AI Scientist-v2 produced the first fully AI-generated paper to pass a rigorous human peer-review process. Today, we are happy to announce that “The AI Scientist: Towards Fully Automated AI Research,” our paper describing all of this work, along with fresh new insights, has been published in @Nature! This Nature publication consolidates these milestones and details the underlying foundation model orchestration. It also introduces our Automated Reviewer, which matches human review judgments and actually exceeds standard inter-human agreement. Crucially, by using this reviewer to grade papers generated by different foundation models, we discovered a clear scaling law of science. As the underlying foundation models improve, the quality of the generated scientific papers increases correspondingly. This implies that as compute costs decrease and model capabilities continue to exponentially increase, future versions of The AI Scientist will be substantially more capable. Building upon our previous open-source releases (github.com/SakanaAI/AI-Sc…), this open-access Nature publication comprehensively details our system's architecture, outlines several new scaling results, and discusses the promise and challenges of AI-generated science. This substantial milestone is the result of a close and fruitful collaboration between researchers at Sakana AI, the University of British Columbia (UBC) and the Vector Institute, and the University of Oxford. Congrats to the team! @_chris_lu_ @cong_ml @RobertTLange @_yutaroyamada @shengranhu @j_foerst @hardmaru @jeffclune

English
0
0
1
159
Bo An
Bo An@bo__an·
@jerryjliu0 does it output word-level bounding boxes which is needed for word-by-word correction for things like manuscripts?
English
1
0
0
75
Jerry Liu
Jerry Liu@jerryjliu0·
One of the biggest requirements for document OCR is visual grounding, and frontier models (gemini, opus, gpt-5.4) suck at it by default. In other words they don't have a great sense of the positions of things on a page. We've made massive strides in making sure our models are able to segment and detect every granular element in the most complex docs. This allows you to build AI agents that can surface extremely precise citations in the source documents: ✅ newspapers ✅ infographics ✅ handwritten notes ✅ product catalogs ✅ research presentations and much more Come check it out in LlamaParse! cloud.llamaindex.ai/?utm_source=xj…
Jerry Liu tweet mediaJerry Liu tweet media
LlamaIndex 🦙@llama_index

LlamaParse Agentic Plus mode now delivers precise visual grounding with bounding boxes for the most challenging document elements. Our latest update brings major improvements to how we handle complex visual content: 📐 Complex LaTex formulas - accurately parse mathematical expressions with precise positioning ✍️ Handwriting recognition - extract handwritten text with location coordinates 📊 Complex layouts - navigate multi-column documents and intricate formatting 📈 Infographics and charts - identify and extract data visualizations with spatial context This means you can now build applications that not only extract text from documents but also understand exactly where that content appears on the page - perfect for creating more intelligent document analysis workflows. Try LlamaParse Agentic Plus mode and see how visual grounding transforms your document parsing capabilities: cloud.llamaindex.ai/?utm_source=so…

English
16
28
198
20.9K
Bo An
Bo An@bo__an·
NPD在智能体时代将如鱼得水
中文
0
0
2
67
Bo An
Bo An@bo__an·
so what was Anthropic's thinking when it issued a cease & desist order to (then) OpenClawd to force a name change instead of hiring the guy and name it OpenClaude?
English
0
0
0
119
Bo An
Bo An@bo__an·
@DIYgod 5.4. 人狠话不多
中文
0
0
0
1.8K
DIŸgöd ☀️
DIŸgöd ☀️@DIYgod·
所以现在 codex 应该用 gpt-5.3-codex 还是 gpt-5.4
中文
63
0
62
47.8K
Bo An retweetledi
Junyang Lin
Junyang Lin@JustinLin610·
me stepping down. bye my beloved qwen.
English
1.7K
738
13.6K
6.5M
Bo An retweetledi
songkeys 🐿️🦋@song.work
🤯 美团花 20亿 做的 AI 浏览器,今天大买特买媒体稿。 然而其中沉浸式翻译功能是原封不动搬了 @mengxi_ream 的 Read Frog 开源代码(GPL协议)。 但没给赞助、没给 credits、没按协议开源代码。 有认识美团这个研发团队的朋友帮忙联系看看是故意的还是不小心吗?
songkeys 🐿️🦋@song.work tweet media
梦溪睡了吗@mengxi_ream

美团这么大公司,抄我的代码给自家 AI 浏览器用,还不给我赞助啊😂 几千亿市值的公司,白嫖嫖到我头上了,shame on you 左:美团 ai 浏览器,右:我的产品 根据我的 GPL 协议,任何使用我代码的产品都要开源,请问美团你的这个新 ai 产品开源了吗?法律意识呢? 更多截图在评论。

中文
42
41
447
105K
Bo An retweetledi
Chinese Text Project
Chinese Text Project@chinesetextproj·
AI-generated translations have been added to ctext.org - including the complete pre-Qin and Han corpus, the 25 dynastic histories, and hundreds of other works. Translations will be added for other texts on an ongoing basis. ctext.org/instructions/t…
Chinese Text Project tweet mediaChinese Text Project tweet media
English
14
54
303
74K
Bo An retweetledi
PsiACE
PsiACE@repsiace·
一篇新的文章,希望可以帮大家了解从 coding 到生活,从 coding agent 到 openclaw 或者 bub ,背后的问题、范式和模型的一些变化 链接在评论区
PsiACE tweet media
中文
8
18
115
14.1K
Bo An retweetledi
Arnaud Bertrand
Arnaud Bertrand@RnaudBertrand·
I have a website about Traditional Chinese Medicine that I spent literal years building. When I asked questions to Claude about the topic, it parroted almost word-for-word what I myself wrote. So please spare us the gaslighting about training AI on others' work...
Anthropic@AnthropicAI

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.

English
172
2.7K
28.1K
736.8K
Bo An
Bo An@bo__an·
过去两个月我慢慢摸索出来一种比较稳定的基于tmux的coding cli的协作方式,非常vanilla,也非常稳定,也不用任何半吊子的“总控制台”,不需要特别适配,Claude Code, Codex, OpenCode都能用,天然支持ssh。我经常用这个系统开着5个以上的团队(至少13个agent进程,不算cli内部的subagents)同时协作,一切都井然有序(当然要配合worktree等等)。准备过两周开源一下。
中文
1
0
3
158