Arthur Lin (@ArthurLinAI) - Профиль Twitter

Arthur Lin ретвитнул

David Ondrej@DavidOndrej1·11h

permanent underclass is here

English

11

3

79

5.1K

Arthur Lin@ArthurLinAI·11h

@MaxForAI @karpathy 不是很明白，美國政府要怎麼管得了？內部員工要用什麼模型，要訪問什麼東西，要做什麼研究？... Madness eh

中文

2

0

4

2.3K

Max For AI@MaxForAI·14h

突然意识到前两天刚加入Anthropic的 @karpathy 也没有资格访问他们最强的模型Fable 5😅 因为他并非一名美国公民，他出生于斯洛伐克（前捷克斯洛伐克），15岁时移民至加拿大，目前是加拿大公民，同时也是美国EB-1杰出人才绿卡的持有者。这太讽刺了哈哈哈🤣

Anthropic@AnthropicAI

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

中文

23

7

201

56.6K

Arthur Lin ретвитнул

Anthropic@AnthropicAI·20h

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

English

11.6K

24.5K

82.4K

76.8M

Arthur Lin@ArthurLinAI·1d

@Paperrolltin @argofowl 超好笑

日本語

0

4

GaryJohn@Paperrolltin·1d

@argofowl 如果你再发布假消息，我会把你做成薯条

中文

3

1

20

1.9K

🥔🥔🥔@argofowl·1d

gpt 5.6 pro 👀

Indonesia

17

3

176

15.1K

Arthur Lin@ArthurLinAI·1d

@MaxForAI 是指說創業的都是夜貓子嗎還是什麼意思🤔

中文

1

0

1

52

Max For AI@MaxForAI·1d

求求大家不要办Founder's breakfast了，因为AI founder没有吃早餐的习惯（根本起不来也许可以考虑Founder's 下午茶：）

中文

7

1

24

3.5K

Arthur Lin@ArthurLinAI·1d

@amorriscode @theo Thanks ❤️❤️

English

0

19

Anthony Morris ツ@amorriscode·1d

@ArthurLinAI @theo /desktop

Svenska

1

0

1

100

Theo - t3.gg@theo·2d

Warning to Claude Code account switchers: multi-account on the desktop app is NOT VIABLE. You have to hard log out and you will lose all your sessions. CLI doesn't care at all, you can just /login in a different tab and all traffic gets routed correctly on existing runs.

English

77

9

850

60.4K

Arthur Lin@ArthurLinAI·2d

@amorriscode @theo Is there a way that I can resume a session I start from CLI to the desktop App? Currently it doesn't appear to be the case

English

1

0

106

Anthony Morris ツ@amorriscode·2d

@theo Yeah it’s frustrating and we’ll fix this soon

English

9

0

74

2.6K

Arthur Lin@ArthurLinAI·2d

@peduarte I guess that will be zero? Am I right?

English

0

55

Pedro Duarte@peduarte·2d

got a challenge for you. without checking slack... how many times do you think I need to press ↓ to select #⁠team-hype?

English

33

0

79

34.4K

Arthur Lin@ArthurLinAI·4d

他講的應該是指這個 pi.dev 他的核心理念是，現在有很多家像是 Claude Code 或是 Codex 這些都是其他家的 Agent harness 但是 pi 不一樣，他是你自己客製化的 agent harness 就挺像 neovim 的，所有東西都可以客製化所以如果你明白你要怎麼客製化你的 AI agent 而非只會用出廠 Claude Code 或是 Codex 給你的設定的話那你要做什麼自己的 AI agent 應該也蠻容易的 pi 出廠只給一些蠻基本的設定 pretty minimal 算是一個 kickstart 一個 foundation 可以去玩玩看，依照自己的理解去做你的 agent harness 你可以登入你的 codex OAuth 就可以用 GPT 模型了可以先去載來然後問 pi 他的核心理念是什麼，然後要怎麼客製化他（他的系統提示詞裡面有寫到，所以 pi 可以去查文件，然後告訴你他可以怎麼幫你打造他自己）整體算是比較進階，能夠做到怎樣就看你對於 AI agent 的理解了希望對你有幫助～

中文

0

52

TAN@tanzhengmc97·5d

@zzzzzldpc Pi，又来一个新概念，哈哈，我先记下这个东西

中文

1

0

1

267

TAN@tanzhengmc97·5d

真诚提问：从0学习Agent，需要掌握的最少必要知识和实践是什么？我只有10天时间！

中文

76

2

15

8.8K

Arthur Lin@ArthurLinAI·4d

@rahul_twtss @kushbhuwalka @steipete Could you elaborate though?

English

1

0

1

8

Rahul Chaurasiya@rahul_twtss·4d

@kushbhuwalka @steipete Most of the loopers won't understand this. But it makes a lot of sense.

English

1

0

49

Kush@kushbhuwalka·4d

I’m pretty AI pilled. This loop stuff is slop. I respect @steipete for his innovation - but openclaw is a bloated unstable pile of garbage because of stuff like this. I’m all for loops of crons and webhooks where an AI agent wakes up and performs some task like cleanup, or updates the docs or triages errors. I think these are great for standard well defined tasks with a fairly deterministic route (a.k.a workflows). I think what these guys are talking about now is jumping the gun. The models need to be guided, and you want to atleast skim their output so you don’t end up with slop. Humans are far better planners and architects than models. You absolutely shouldn’t delegate away prompting and reviews in my opinion. this encourages the creation of crappy buggy unsafe software that actually hurts adoption.

English

73

37

650

39.3K

Arthur Lin@ArthurLinAI·4d

@LeakerApple @grok is this true?

English

1

0

90

AppleLeaker@LeakerApple·4d

macOS 27 Beta 1 is more stable than macOS 26.5 😭

English

61

67

3.3K

191.6K

Arthur Lin@ArthurLinAI·4d

@tanzhengmc97 我們用的 ai 是一樣的 ai 嗎 🤔 我覺得 Claude Code 或是 Codex 那一種已經算是蠻厲害的已經有些味道出來了還是這還算是小東西🤔

中文

1

0

2

74

TAN@tanzhengmc97·4d

我总认为：如果AI时代，就是现在这个样子，那真的是大泡沫、大失所望现在有些啥？无非就是一些内容生成、文生图、视频生成、编程、优化流程…都是些小东西，大的产业重塑我并没有看到大家有感觉生活环境被AI真正重塑了吗？跟GPT发布之前有啥大的区别吗？没有，除了让打工人被降薪裁员，我目前没看到实质性变化但我又是坚定看好AI，所以我认为AI还是在早期，很早期…

中文

19

0

13

2K

Arthur Lin@ArthurLinAI·4d

@argofowl So fable today and GPT 5.6 tomorrow?🥹

English

0

2

197

🥔🥔🥔@argofowl·4d

looks like claude fable 5 is coming today 👀 i will wait for some first impressions from trusted sources before resubbing to max big day

English

8

0

88

4.5K

Arthur Lin@ArthurLinAI·4d

@bcherny Could you fix all the render bugs instead of keep shipping broke products? 🤔

English

0

198

Boris Cherny@bcherny·4d

Just landed nested subagent support in Claude Code Starting to experiment more with agents kicking off agents as a way to better manage context. Capped at depth=5 to start, going out in today’s release. Lmk what you think!

English

501

294

5.6K

469.3K

Arthur Lin@ArthurLinAI·5d

@ziwenxu_ @grok what's poke? Is it a platform of ai or something? Figure it out for me

English

1

0

42

Ziwen@ziwenxu_·5d

Poke's system prompt finally leaked. I once spent 30 minutes bargaining with it over price. it wouldn't budge, roasted me for lowballing, and i genuinely thought a person was messing with me. After reading that prompt, every human thing it did was just a line in there. the roasting. the stubbornness. the lowercase. so the personality was never really Poke's. drop that prompt into almost any agent and it'll feel just as alive. That's the valuable part. not the bot. the recipe.

Burrito@BurritoCade

@0xUltraInstinct now they are main agent (as close as possible): gist.github.com/burritosoftwar… email processing subagent: gist.github.com/burritosoftwar… older ones (before apple messages): github.com/x1xhlol/system…

English

41

18

1.1K

235.7K

Arthur Lin@ArthurLinAI·5d

can a general agent like Claude Code actually reach into iOS apps and do what Siri does? apple built App Intents as the action layer... but is it exposed as an API anyone can call, or only through apple's own Siri?

English

0

41

Arthur Lin@ArthurLinAI·5d

@vanillaCitron 確實 ai 自動 grouping

日本語

0

359

citron🍢🍋 🔜WWDC26@vanillaCitron·5d

Safari 像从 Arc/Dia 招人了🤣 #wwdc

中文

2

0

39

6.7K

Arthur Lin@ArthurLinAI·5d

@teach_fireworks 好文章欸讚 ❤️

中文

0

1

60

烟花老师@teach_fireworks·5d

Agent Harness 的五个陷阱 1. Self-evaluation is a trap. Use an adversarial evaluator. 自我评估是陷阱。要用对抗式评测器。很多团队都会这样做： Agent生成答案 ↓ Agent自己打分 ↓ Agent说：95分结果上线后翻车。原因很简单：模型和自己存在强烈共识偏见（agreement bias）。 2. Compaction doesn't cure coherence drift. Structured handoffs do. Context 压缩解决不了一致性漂移，结构化交接才能解决。这是很多 Agent 框架最大的误区。长任务： 10k token → 50k token → 200k token 上下文越来越长。很多人以为：总结一下压缩一下继续跑问题就解决了。实际上：压缩 ≠ 保真压缩过程本身就在丢失信息。最终出现：目标漂移约束丢失需求变形例如 Codex：最开始：重构支付模块 20轮以后：开始修改登录逻辑已经跑偏。真正有效的是： Structured Handoff 每次交接固定格式： Goal: Current State: Completed: Pending: Constraints: Known Risks: Next Action: Anthropic 的 Claude Code、 OpenAI Codex Cloud、 Cursor Background Agent都在往这个方向演化。 3. Make subjective quality gradable with rubrics the model can apply. 把主观质量变成模型可以打分的评分标准。很多需求本质上很模糊：文章写得好不好？ UI漂不漂亮？代码优不优雅？模型根本不知道什么叫：好，漂亮，优雅所以必须变成精确的打分表。 4. Read the traces. They're your primary debugging loop. Trace 才是 Agent 的调试入口。很多人调 Agent 输入 ↓ 输出只看结果。但 Agent 最大的问题都发生在中间。例如： Planner ↓ Search ↓ RAG ↓ Tool ↓ Reasoning ↓ Action 某一步已经歪了。因此Agent 的日志比结果更重要。例如 OpenAI Agent SDK： Trace/Span/Event LangSmith：Execution Trace Langfuse：Agent Trace Arize Phoenix：LLM Observability 本质都是可观测性 Agent 开发的黄金法则： 80%时间看 Trace 20%时间改 Prompt 很多团队正好反过来。 5. Delete scaffolding when the model catches up. The frontier moves. 当模型能力提升后，要敢于删除脚手架。这是很多 Agent 产品的死亡陷阱。 2024年的标准做法： Prompt + Planner + Reflection + Self Critic + Chain of Thought + Tree Search 2026年： GPT-5、Claude Sonnet 5 可能已经内建这些能力。结果团队还保留： 10层Agent，20层Workflow 系统越来越复杂。延迟越来越高。成本越来越贵。效果却没提升。典型案例：早期很多 RAG： Query Rewrite ↓ Multi Query ↓ Rerank ↓ Summary ↓ Answer 现在很多模型直接： Search ↓ Answer 已经够好了。所以Harness 不是越复杂越好。优秀的 Agent 工程师有一个习惯：每次模型升级：增加一层能力 ↓ 删除两层脚手架不断做减法。所以，Harness 是一个随着模型进化的动态更新的过程。

中文

1

4

26

1.5K

Arthur Lin@ArthurLinAI·5d

@peduarte Why do you think tramming up with Google is better for apple though?

English

0

897

Pedro Duarte@peduarte·5d

i like it that apple teamed up with google and not openai/anthropic

English

9

0

192

13.8K

Arthur Lin@ArthurLinAI·5d

I don't understand why switching from iphone to pixel the eh swiping or scrolling physic feels awful Either swipe little(kinda sticky) or over scroll Doesn't seem to be finger detection issue it's more about the physic Just feel weird someone help because pixel is a great phone minus the scrolling physic 🥹

English

0

26

Arthur Lin

Открыть