东京闲少 (@sawaxtokyo) - Twitter Profili | Zamantika Mersobahis Locabet

东京闲少@sawaxtokyo·2h

@cb_doge bullshit app experience on iPadOS，just give up might be better

English

0

5

DogeDesigner@cb_doge·2h

NEVER GIVE UP

English

376

380

1.9K

28.1K

东京闲少@sawaxtokyo·3h

@nikkeibpITpro 日本語版とは何でしょうか、比較的高い特性に値しますか？

日本語

0

370

日経クロステック IT@nikkeibpITpro·1d

リコーがGPT-5級の日本語LLMを完成させた xtech.nikkei.com/atcl/nxt/news/… 700億パラメーターを誇る「Llama-3.3-Ricoh」は、融資稟議など金融業務の自動化を実現。社員が「AIが部下として働く日が来た」と語るほどの完成度で…（2025年10月）

日本語

63

602

3.6K

564.3K

东京闲少@sawaxtokyo·3h

@aaronp613 meaningless if they dont fix the permanent memory of grok

English

0

10

Aaron@aaronp613·7h

X is working on a “Analyze with Grok” menu on iOS. You will be able to choose from: - Summarize this - Is this true? - Explain this

English

13

4

161

8.1K

东京闲少@sawaxtokyo·3h

@WesRoth dont post old story repeatedly ，it is meaningless if grok dont fix the memory system and the fucking damn shit app experience on the fucking iPadOS

English

0

30

Wes Roth@WesRoth·4h

xAI achieved a score of 53 on the Artificial Analysis Intelligence Index, placing it ahead of Muse Spark, Claude Sonnet 4.6, and previous iterations of Grok. The most substantial gain is in real-world agentic tasks, particularly on the GDPval-AA benchmark, where Grok 4.3 saw a 321-point Elo increase to reach 1500. It now surpasses Gemini 3.1 Pro Preview and GPT-5.4 mini in this category, though it still trails GPT-5.5.

Artificial Analysis@ArtificialAnlys

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!

English

6

3

41

2.8K

东京闲少@sawaxtokyo·11h

@elonmusk where is cross session memory，where is grok 4.4

English

0

1

25

Elon Musk@elonmusk·12h

Banger 😂

Indonesia

5.4K

23.7K

144.5K

42.9M

东京闲少@sawaxtokyo·16h

@mark_k @xai too slow，grok app stopped upgrade

English

0

30

Mark Kretschmann@mark_k·17h

Upcoming releases from @xai 🔥🔥 - Grok Build (imminent) like Codex and Claude Code - Grok Computer (soon) computer-use agent - Grok Imagine Pro 1080p - Grok Imagine 2.0 - Grok 4.4 with 1T parameters - Grok 4.5 with 1.5T parameters - Grok 5 with 10T

English

138

108

1.5K

48.9K

东京闲少@sawaxtokyo·18h

nothing update for grok app good job xAI

English

0

5

东京闲少@sawaxtokyo·2d

@M16A_hayabusa this is Japan

English

0

142

M16A HAYABUSA@M16A_hayabusa·3d

注：２１世紀の日本です。

鉄の男@nighthawkf117aj

今は令和です大日本帝国時代？君は時代遅れの人間ですか？

日本語

34

279

1.4K

48.7K

东京闲少@sawaxtokyo·2d

@techdevnotes I only need memory

English

0

53

Tech Dev Notes@techdevnotes·2d

Wednesday Grok Imagine got canvas Thursday Grok 4.3 got API Friday Grok Voice got cloning What if Saturday is Grok Build

English

28

15

274

9.6K

东京闲少@sawaxtokyo·2d

@TheCaptainEli what are you guys updated for？？？？？？

English

0

5

Captain Eli@TheCaptainEli·2d

New update

English

4

46

1.8K

东京闲少@sawaxtokyo·2d

@YangYang1068820 @Rumoreconomy 并没有grok在这里

中文

0

343

Bing Yang@YangYang1068820·2d

@Rumoreconomy perplexity

English

2

16

5.6K

财经真相@Rumoreconomy·2d

记得有个平台，叫啥来着？一次性可以把几个平台都用了

中文

222

94

875

174.3K

东京闲少@sawaxtokyo·2d

@Mark67358226 @Rococo90933671 自民党都没换过

中文

0

1

718

Lhn002@Mark67358226·3d

@Rococo90933671 日韓也是獨裁統治嗎🤔，日本不瞭解韓國現在不至於吧

中文

27

0

90

35.5K

大狐帝国史官邹智萱（笔名段歌文）@Rococo90933671·3d

全员恶人企鹅：喂我花生

中文

429

130

2.9K

345.7K

东京闲少@sawaxtokyo·3d

@elonmusk Grok layout

English

0

1

45

Elon Musk@elonmusk·3d

Grok

OpenRouter@OpenRouter

The new Grok-4.3 from @xai is live on OpenRouter! Grok-4.3 releases at a lower price than Grok-4.2, while seeing a large jump in agentic performance: a 321 point increase to 1500 ELO on @ArtificialAnlys GDPval-AA, surpassing other top models despite the lower price.

English

1.8K

3.5K

23.2K

20.7M

东京闲少@sawaxtokyo·3d

@LouiseGiam Let’s see your grok fucking shit layout

English

0

86

Louise Giam@LouiseGiam·3d

Grok 4.3 on April 30!

Artificial Analysis@ArtificialAnlys

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!

English

7

4

200

8K

东京闲少@sawaxtokyo·3d

holy shit the grok app layout @elonmusk #grok #xAI

English

1

0

1

22

东京闲少@sawaxtokyo·3d

@TheCaptainEli @elonmusk

QAM

0

7

东京闲少@sawaxtokyo·3d

@TheCaptainEli fix the damn fuck layout please

English

1

0

19

Captain Eli@TheCaptainEli·4d

Another Grok update from one hour ago. The second one today..

English

5

6

34

1.4K

东京闲少@sawaxtokyo·3d

@XFreeze useless where is the cross session memory.md

English

0

48

X Freeze@XFreeze·3d

Grok Imagine Agent Mode (Beta) just went live on Grok web It’s a full creative agent working on one infinite open canvas Grok Agent plans → generates → edits → iterates everything automatically in the same workspace Tell it what you want and watch it plan, generate, edit, and iterate everything in one seamless workspace: • 🎬 “Generate a 1-minute cinematic film” • 📚 “Create a complete manga set” • 🛍️ “Build UGC product stories” This is the real leap from simple prompts to end-to-end creative production This is the biggest upgrade to Grok Imagine yet