Yuanchang

201 posts

Yuanchang

@yuanchang_org

Ecom seller; AI builder

Germany Katılım Aralık 2021

257 Takip Edilen48 Takipçiler

Yuanchang@yuanchang_org·6d

@grapeot 这篇似乎没有在目录share同步

中文

鸭哥@grapeot·6d

到8月17日，Jira/Confluence 里记过的内容会被 Atlassian 默认用于 AI 训练。metadata 收集对 Free/Standard/Premium 强制且不可关闭——只有 Enterprise（801 用户起）才能退出。这件事最值得注意的不是隐私问题。它代表 SaaS 行业正在把数据保护从合规义务转化成定价功能——你的数据不被训练，正变得和 SSO 一样，是一个需要花钱买的功能。从 Zoom (2023) 到 Slack (2024) 到 Atlassian/GitHub (2026)，三代政策试错揭示了这个变化的全貌。 yage.ai/share/saas-ai-…

中文

2.6K

Yuanchang@yuanchang_org·28 Nis

@fm100 @turingou 有直播录播嘛

中文

105

Bob Fu 傅丰元@fm100·27 Nis

这次趁在东京办活动，有幸请到了 @turingou 郭宇加入我们的圆桌讨论，他最近在做的 tuwa.ai 也是非常有想象力的语音 AI 产品。他发来的嘉宾一句话头衔的是「退休程序员」。翻译英文时我擅自翻译成了两层含义 Retired &Retreated，退休和隐居的程序员。活动就在明天周二上午，东京的朋友加入我们：luma.com/kq31a21p

RTE Dev Community@rtedevcommunity

AI Hardware: A simple "tool" or an emotional "companion"? 🤖❤️ Next Tue (Apr 28), the Physical AI Event Series lands in Tokyo! 🇯🇵 Join us alongside @RokidGlobal , @AgoraIO , @RiseLink_X & @TechTabi_ to explore the future of AI hardware. 👇

中文

21.3K

Yuanchang@yuanchang_org·25 Nis

@turingou 另外看这个截图才发现，这个窗口 UI 好像不水平，是故意这么设置的嘛

中文

Yuanchang@yuanchang_org·25 Nis

@turingou 昨晚跑的任务自动挂了（应该是因为 codex 掉了），然后今天再去登录的时候，创建沙箱就没有办法登录了。

中文

391

郭宇 guoyu.eth@turingou·25 Nis

今天非常高兴和大家正式介绍并开源我的第 14 款 vibe 产品 wanman.ai 它的理念很简单，让世界上所有人，都能在 AI agents 团队的帮助下，从零创办或接管任何组织，围绕用户的核心意图，持续自动化地运营一人公司。为了实践这种理念，wanman 必须设计的尽可能简单，不需部署，不用买 mac mini，不用操作复杂的权限，只要打开网站，输入想法，即可运行。 wanman 有两种工作模式，第一种是从故事目标自治运行，它会分析目标，规划任务并邀请 AI 员工，自动开会并对齐目标，每个虚拟工作日结束后，wanman 会主动进行创意发散，从而保持任务持续运行。第二种是 wanman 接管（takeover）目前支持任何 GitHub 仓库，它会分析代码仓库，围绕目标进行自动优化和测试，在持续运行的基础上，不断提交代码到远端仓库。无论哪种工作模式，wanman 都会运行到目标达成。它和许多流行的 harness 产品相似，均支持：agents 消息通信、自进化 skill、沙箱隔离环境运行、多模型架构等功能，和其他产品不同的是，wanman 的设计哲学是，让人类用户退居二线，仅作 AI 团队的观察者。它的核心架构和组件已在 GitHub 开源：github.com/chekusu/wanman 从现在开始，任何用户都可以登录 wanman.ai 免费体验，目前支持授权 codex 使用，未来会逐步支持多模型授权与自动调度。这是我今年以来开发最久，也是最重要的产品之一，为了开发 wanman，我做了 sandbank cloud，chatben，tuwa 等基础设施，这些产品的 vibe 经验，最后都成了 wanman 的一部分代码。 wanman 的命名来自于日语的 one man（ワンマン）的罗马音，在日本的乡间，经常能看到ワンマン電車在田野中悠然自得地穿行，我相信在不久的将来，也会有许多人使用 wanman 创建自己的ワンマン会社。祝大家都能借助 wanman.ai 告别创业的烦恼，享受创造的乐趣！

中文

152

193

1.5K

172.4K

Yuanchang@yuanchang_org·24 Nis

@cky011 @dotey 其实是400k，api能接1M的

中文

ck y@cky011·23 Nis

@dotey 不知道为什么上下文只有258k

中文

2.5K

宝玉@dotey·23 Nis

OpenAI 发布 GPT-5.5，目前向 ChatGPT 的 Plus、Pro、Business 和 Enterprise 用户开放，同步上线 Codex。API 将"很快"跟进。 GPT-5.5 的核心卖点是"更聪明但不变慢"。OpenAI 声称它在实际服务中的每 token 延迟与 GPT-5.4 持平，同时完成同样的 Codex 任务消耗更少 token。换句话说，能力提升的同时效率也在涨。跑分方面，GPT-5.5 在多个基准上刷新了纪录。Terminal-Bench 2.0（复杂命令行工作流测试）拿到 82.7%，GPT-5.4 是 75.1%，Claude Opus 4.7 是 69.4%。在 OSWorld（让模型独立操作真实计算机环境）上达到 78.7%，与 Claude Opus 4.7 的 78.0% 接近。GDPval（对标 44 个职业的行业专家产出质量）得分 84.9%，也领先一个身位。不过 SWE-Bench Pro 上，Claude Opus 4.7 的 64.3% 仍然高于 GPT-5.5 的 58.6%，OpenAI 自己在表格里也标注了该基准存在记忆化问题。数学和科研领域的提升更显著。FrontierMath Tier 4 从 GPT-5.4 的 27.1% 跳到 35.4%，但 Claude Opus 4.7 只有 22.9%。OpenAI 还发布了一篇用 GPT-5.5 内部版本辅助发现的 Ramsey 数新证明，已在 Lean 中完成验证。新推出的 GeneBench 测试多阶段遗传学数据分析，GPT-5.5 拿到 25.0%，GPT-5.4 是 19.0%。第三方机构 Artificial Analysis 的综合智能指数显示了一个有趣的效率对比：在相同智能水平下，GPT-5.5 消耗的 token 总量大约是竞品前沿编码模型的一半。这对 API 用户来说是实打实的成本优势。定价方面，API 为 5 美元/百万输入 token、30 美元/百万输出 token，上下文窗口 100 万 token。GPT-5.5 Pro 定价更高，30 美元输入、180 美元输出。虽然单价高于 GPT-5.4，但 OpenAI 强调 token 效率的提升可以对冲价格差异。安全方面，OpenAI 将 GPT-5.5 的网络安全和生物/化学能力归为 Preparedness Framework 下的"High"级别（未达 Critical）。CyberGym 得分 81.8%，高于 Claude Opus 4.7 的 73.1%。同时推出了面向安全研究人员的 Trusted Access for Cyber 计划，降低合规用户的安全限制。 OpenAI 内部已经有超过 85% 的员工每周使用 Codex，覆盖工程、财务、市场、数据科学等职能。文中提到的一个案例：财务团队用 Codex 审阅了 24,771 份 K-1 税表，共 71,637 页，比去年提前两周完成。

OpenAI@OpenAI

Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex.

中文

155

70.1K

Yuanchang@yuanchang_org·20 Nis

@OpenAIDevs so where is GPT 6?

English

OpenAI Developers@OpenAIDevs·20 Nis

Last week, we released a preview of memories in Codex. Today, we’re expanding the experiment with Chronicle, which improves memories using recent screen context. Now, Codex can help with what you’ve been working on without you restating context.

English

223

365

4.5K

1.2M

Yuanchang@yuanchang_org·17 Nis

@labomen001 @TommyFalkowski @adonis_singh Did you try it? Was it successful?

English

122

Labomen@labomen001·17 Nis

@TommyFalkowski @adonis_singh Thanks a ton, I thought it'd require patching the Codex app, I can do this myself even

English

359

adi@adonis_singh·17 Nis

made codex patch computer-use so I can use it in EU

English

380

26K

Yuanchang@yuanchang_org·17 Nis

@JoshKale Could the video you forwarded be any lower quality?

English

Josh Kale@JoshKale·16 Nis

Today Perplexity shipped everything Siri was supposed to be 💻 Personal computer now has access to: → iMessage → Every folder on your Mac → 400+ connected apps → Apple Mail, Calendars, Browsers etc... Underneath, Claude Opus 4.7 is the brain. It breaks your goal into subtasks and routes each one to whichever of 20 models wins at it. GPT for long context. Gemini for deep research. Grok for speed. Nano Banana for images. Veo for video. Codex for code. It runs 24/7. You can trigger it from your phone. Pretty sweet design too

Perplexity@perplexity_ai

Today we're releasing Personal Computer. Personal Computer integrates with the Perplexity Mac App for secure orchestration across your local files, native apps, and browser. We’re rolling this out to all Perplexity Max subscribers and everyone on the waitlist starting today.

English

114

231

3.5K

663.9K

Yuanchang@yuanchang_org·17 Nis

It feels like the recent Codex update was a total letdown. The most prominent core feature, Computer Use, is unavailable in the EU, and they haven't released any truly new models. While they did announce GPT-5.4 Cyber, individual users can't even access it. Considering OpenAI's focus on the 2C (consumer) business, this is nothing short of a disaster. What do you think?@thsottiaux

English

165

Yuanchang@yuanchang_org·17 Nis

@thsottiaux That's cool, but how would users in Germany and across Europe use it?

English

Tibo@thsottiaux·16 Nis

Codex Compute efficient ✅ Always up, never down ✅ Best at hardcore engineering ✅ Crazy good app, first to escape the terminal ✅

English

451

188

5.1K

2.4M

Yuanchang@yuanchang_org·16 Nis

@steipete @skjtwts apply and BYOK

English

Peter Steinberger 🦞@steipete·15 Nis

@skjtwts you gotta apply

English

8.8K

Sundaram Kumar Jha@skjtwts·15 Nis

GPT 5.4-cyber ?? where is that ??

Peter Steinberger 🦞@steipete

If you look at GPT 5.4-Cyber and it's ability for closed source reverse engineering, I have bad news for you. I do very much feel the pain though, there's hundreds of teams that try to poke holes into @openclaw. Our response has been of rapid iteration and code hardening. Which did introduce occasiaonal regression (and yes you all been yelling at me), but I see as the only way forward. I would be very careful of other open source projects/harnesses that ignore this work and do not publish their advisories. github.com/openclaw/openc…

English

16K

Yuanchang@yuanchang_org·10 Nis

claude 也开始污言秽语了……

中文

290

Yuanchang@yuanchang_org·8 Nis

@__Inty__ Error 500一般是上新模型

中文

391

Inty News@__Inty__·8 Nis

最新消息：Claude Code服务器挂了显示 API Error: 500

中文

17.3K

Yuanchang@yuanchang_org·1 Nis

@gemini_shuffled for 不知道他在说啥的朋友

中文

2.7K

Gemini@gemini_shuffled·31 Mar

看了这么多分析报告，才发现不会用 ai 的人永远不会： 1. 设备 id 存在~/.claude.json 中的 userID 字段，你直接改一个字母就换了设备 id 了 2. 操作系统只知道你是 mac，arm 架构 3. 环境信息只知道你是 node bun 或者 npm 4. 遥测可以关闭你们有源码有什么用呢？分析出来全是错的

中文

322

86.7K

Yuanchang@yuanchang_org·31 Mar

@0xultravioleta same here

English

ultravioleta 🟣@0xultravioleta·31 Mar

Claude Code be like 👀 529 {"type":"error","error":{"type":"overloaded_error","message":"Overloaded"},"request_id":"req_011CZbds*****qDKw5CZ72mP"}

English

540

Yuanchang@yuanchang_org·28 Mar

@SuisPasDaVinci What a great marketer

English

达芬七｜Seven@SuisPasDaVinci·27 Mar

老外网购中国筷子买1发7000，被迫成为筷子艺术家🤣

中文

315

861

7.4K

913.8K

Yuanchang@yuanchang_org·26 Mar

MAU should be blockchain-native and decentralized. The vision is: don't trust, verify

English

292

Yuanchang@yuanchang_org·26 Mar

Anthropic just published their agent harness design: Generator + Evaluator separation, 6hr autonomous coding sessions, $124 to build a working DAW. Key finding: agents systematically over-praise their own work. You need a separate evaluator. They didn't open-source it. Meanwhile, Karpathy's autoresearch (25K stars) proved overnight self-evolution works — but with zero memory. Iteration 50 might retry what failed at iteration 3. I'm building the third path: Resonance Nano — a Minimal Agent Unit with identity (Context Infrastructure), memory (structured notebook), and autonomous explore/optimize mode switching. MAU will be open source. Soon.

English

341

Yuanchang@yuanchang_org·24 Mar

成本：$200/月的 Claude Max 撑起整条管线。本地转录 $0，邮件 $0，基础设施就是一台 Mac。瓶颈不在钱，在于你愿不愿意把自己的认知系统化。完整技术拆解（架构图、SSH 远程踩坑、信任分级设计、给 Builder 的操作建议）👇 yuanchang.org/posts/ai-perso…

中文

Yuanchang@yuanchang_org·24 Mar

系统还有自动蒸馏管道：Observer 每天观察行为模式，Reflector 定期提炼为决策公理。积累至今 45 条公理，涉及决策的任务自动加载。最有趣的是自指性：我用这个系统改进它自己。发现 TODO 是空的 → 录了条语音 → AI 诊断根因、提出方案 → 我又录了条「方案 B 不错」→ 管线直接创建脚本并集成。全程语音。认知框架基于 @grapeot 开源的 Context Infrastructure。

中文

Yuanchang@yuanchang_org·24 Mar

过去几个月，我搭了一套 AI 个人进化系统。 AI 模型越来越聪明，但有个问题始终没解决：它不认识你。每次打开 ChatGPT / Claude 都是全新对话，给你的只是面向所有人的「正确的废话」。我的判断：AI 已经从 CPU-bound 转向 Memory-bound。模型智能人人可得，不值钱了。你的个人上下文才是复利资产。所以我搭了一条完整管线：语音备忘录录音 → 本地转录 → AI 分类执行 → 信任分级 → 邮件通知。日均处理 20-40 条录音，用得越久 AI 越懂你。完整拆解 🧵👇

中文

185

Keşfet

@grapeot @fm100 @turingou @cky011 @dotey @OpenAIDevs @labomen001 @TommyFalkowski