William

3.9K posts

William banner
William

William

@Q_samas

AI is reshaping human civilization by expanding our capacity to learn, create, and innovate—pushing the boundaries of what humanity can achieve.

Canada Katılım Nisan 2024
689 Takip Edilen635 Takipçiler
Sabitlenmiş Tweet
William
William@Q_samas·
Artificial intelligence, a catalyst for human civilization, amplifies our capacity to explore, create, and understand the world. From accelerating scientific discovery to redefining interaction, learning, and global systems, AI serves as a partner in expanding human potential.
English
0
0
8
1.2K
William
William@Q_samas·
@mizorewww pro 20x也用成这样?建议开新号了😅
中文
0
0
2
37
William
William@Q_samas·
@nini_incrypto_ GPT-5.5确实有点降智,但是你这个感觉像是IP问题,而且目前已经是high medium 和instant这些思考程度了,为啥你还是thinking?😅
中文
0
0
1
97
nini
nini@nini_incrypto_·
我不知道我的gpt降智成啥了 我跟他讨论spacex ipo相关,他一直反驳我说没有ipo,还莫名其妙的扯到2025年,谁跟他讨论25年啊……
nini tweet media
中文
73
0
35
36.4K
撸毛换大饼 · Ai
clash Verge 是电脑上最好用的翻墙软件,没有之一 可惜没有手机版本的,啥时候出个iPhone版的, Shadowrocket 小火箭太难用了🥹
撸毛换大饼 · Ai tweet media
中文
203
52
601
178.7K
William
William@Q_samas·
@RocM301 @Apple 给你个建议,关闭Siri然后不要管他,过段时间自动就变成new Siri。
中文
0
0
2
301
小鹏Digital
小鹏Digital@RocM301·
我还在wait,每天记录,我看你@apple 到哪天才能给我通过。
小鹏Digital tweet media
中文
20
1
25
10K
William
William@Q_samas·
@kfk_ai 怕被蒸馏成第一了😂
日本語
0
0
2
1.4K
Kafka
Kafka@kfk_ai·
余承东刚说完盘古要做第一 美国政府就把 Fable 关了
中文
27
4
72
72.9K
Rhys
Rhys@RhysSullivan·
trying out some alternatives with fable access gone 你好,我的中国同志们!
Rhys tweet media
中文
134
9
922
82.1K
William
William@Q_samas·
@arena Same model’s rank can change by time?WTF???
English
0
0
1
50
Arena.ai
Arena.ai@arena·
GPT-5.5 (xHigh) ranks #2 on Agent Arena (+10.6% net improvement), making it the highest-ranked OpenAI model closely behind Claude Fable 5 (High). Per signal breakdown, GPT-5.5 (xHigh) ranks #1 in Praise vs. Complaint (+29.4%) and Bash Recovery (+14.1%), scoring higher than Claude Fable 5 (High) on both signals. It trails Claude Fable 5 (High) on Confirmed Success (+5.4% vs. +17.6%) and Steerability (+1.9% vs. +5.4%). Agent Arena evaluates models on millions of real-world, long-horizon agentic tasks. Models use tools like web search, filesystem, and terminal to complete complex workflows: writing code, creating slide decks, researching the web, building apps, and analyzing documents. We use causal tracing to measure model performance across real-world agentic tasks. More breakdown of GPT-5.5 (xHigh) across five signals in the thread.
Arena.ai tweet media
Arena.ai@arena

Introducing Agent Mode: Agentic AI is now measured in the Arena. Agent Mode can do deep research, create reports, generate images, build websites, debug code, and more. It completes more complex tasks by using tools like web search, bash in a sandbox environment, image generation, file writing, and asking follow-up questions. Frontier models are waiting for you in Agent Mode to take on real-world tasks. GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro, and top open models. Test them yourself.

English
20
39
472
45.7K
Noah Cat
Noah Cat@Cartidise·
3 days dater, my iPhone is still indexing and i’m still on the new Siri waitlist COME ON APPLE
Noah Cat tweet mediaNoah Cat tweet media
English
73
9
506
104.6K
Sarah
Sarah@araseb_·
Are you team Claude or Codex right now ?
English
269
2
222
66.1K
William
William@Q_samas·
@angeljimenez What fuck is this dumb question?So dumb question 😅
English
3
0
1
467
BunnyLau
BunnyLau@BunnyxStudio·
搞定了在国内使用完整版 Siri AI 的方法,看起来这次的检验方式还是挺特别的,等我有空再测试一下具体的细节。
BunnyLau tweet mediaBunnyLau tweet mediaBunnyLau tweet media
中文
39
4
266
121.4K
William retweetledi
Boris Cherny
Boris Cherny@bcherny·
Fable 5 is the biggest step up I’ve felt in our models since Opus 4.5 back in November. After 4.5 came out I uninstalled my IDE when I realized that I’d been doing 100% of my coding in a terminal for a few weeks. With Fable, it’s felt like Claude has stepped up from being a coding agent to a thought and design partner in building the product. Fable has judgement, taste, and dimensionality in a way that previous models didn’t, leading me to trust it more with the most complex work. I think the first time I had this realization was when I asked Fable to debug something. It is the first model I have used that was so methodical and precise, taking measurements and adding logs then verifying that it truly fixed the issue before declaring victory. There’s nothing in claude code’s prompting telling the model to do that, it’s just part of its personality. It really has this “big model smell” that I haven’t felt before.
English
654
599
10.6K
890.8K
henriqez
henriqez@gabrrlgh·
una duda, por qué los chinos no usan el WhatsApp ni el Facebook ???
Español
2.2K
20
966
916.3K
William
William@Q_samas·
@mon1y_ 锁地理了,有地理围栏😅
中文
0
0
1
107
达芬七|Seven
达芬七|Seven@SuisPasDaVinci·
卧槽,Claude Fable 5牛逼了这回,AGI要来嘞
达芬七|Seven tweet media
中文
10
0
15
13.7K
William retweetledi
hsn
hsn@hsn8086·
hsn tweet media
ZXX
18
20
326
51K
蔡子博士Chris
蔡子博士Chris@caiziboshi·
高考完就苹果全家桶,这个父母有品位!
中文
148
5
188
60.9K