东京闲少

331 posts

东京闲少 banner
东京闲少

东京闲少

@sawaxtokyo

Bunkyo-ku, Tokyo Katılım Eylül 2021
115 Takip Edilen25 Takipçiler
东京闲少
东京闲少@sawaxtokyo·
@cb_doge bullshit app experience on iPadOS,just give up might be better
东京闲少 tweet media
English
0
0
0
5
东京闲少
东京闲少@sawaxtokyo·
@nikkeibpITpro 日本語版とは何でしょうか、比較的高い特性に値しますか?
日本語
0
0
0
370
日経クロステック IT
日経クロステック IT@nikkeibpITpro·
リコーがGPT-5級の日本語LLMを完成させた xtech.nikkei.com/atcl/nxt/news/… 700億パラメーターを誇る「Llama-3.3-Ricoh」は、融資稟議など金融業務の自動化を実現。社員が「AIが部下として働く日が来た」と語るほどの完成度で…(2025年10月)
日本語
63
602
3.6K
564.3K
Aaron
Aaron@aaronp613·
X is working on a “Analyze with Grok” menu on iOS. You will be able to choose from: - Summarize this - Is this true? - Explain this
Aaron tweet media
English
13
4
161
8.1K
东京闲少
东京闲少@sawaxtokyo·
@WesRoth dont post old story repeatedly ,it is meaningless if grok dont fix the memory system and the fucking damn shit app experience on the fucking iPadOS
English
0
0
0
30
Wes Roth
Wes Roth@WesRoth·
xAI achieved a score of 53 on the Artificial Analysis Intelligence Index, placing it ahead of Muse Spark, Claude Sonnet 4.6, and previous iterations of Grok. The most substantial gain is in real-world agentic tasks, particularly on the GDPval-AA benchmark, where Grok 4.3 saw a 321-point Elo increase to reach 1500. It now surpasses Gemini 3.1 Pro Preview and GPT-5.4 mini in this category, though it still trails GPT-5.5.
Wes Roth tweet media
Artificial Analysis@ArtificialAnlys

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!

English
6
3
41
2.8K
Elon Musk
Elon Musk@elonmusk·
Banger 😂
Indonesia
5.4K
23.7K
144.5K
42.9M
Mark Kretschmann
Mark Kretschmann@mark_k·
Upcoming releases from @xai 🔥🔥 - Grok Build (imminent) like Codex and Claude Code - Grok Computer (soon) computer-use agent - Grok Imagine Pro 1080p - Grok Imagine 2.0 - Grok 4.4 with 1T parameters - Grok 4.5 with 1.5T parameters - Grok 5 with 10T
English
138
108
1.5K
48.9K
东京闲少
东京闲少@sawaxtokyo·
nothing update for grok app good job xAI
English
0
0
0
5
Tech Dev Notes
Tech Dev Notes@techdevnotes·
Wednesday Grok Imagine got canvas Thursday Grok 4.3 got API Friday Grok Voice got cloning What if Saturday is Grok Build
English
28
15
274
9.6K
Captain Eli
Captain Eli@TheCaptainEli·
New update
Captain Eli tweet media
English
4
4
46
1.8K
财经真相
财经真相@Rumoreconomy·
记得有个平台,叫啥来着?一次性可以把几个平台都用了
财经真相 tweet media
中文
222
94
875
174.3K
Lhn002
Lhn002@Mark67358226·
@Rococo90933671 日韓也是獨裁統治嗎🤔,日本不瞭解韓國現在不至於吧
中文
27
0
90
35.5K
Louise Giam
Louise Giam@LouiseGiam·
Grok 4.3 on April 30!
Artificial Analysis@ArtificialAnlys

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!

English
7
4
200
8K
Captain Eli
Captain Eli@TheCaptainEli·
Another Grok update from one hour ago. The second one today..
Captain Eli tweet media
English
5
6
34
1.4K
X Freeze
X Freeze@XFreeze·
Grok Imagine Agent Mode (Beta) just went live on Grok web It’s a full creative agent working on one infinite open canvas Grok Agent plans → generates → edits → iterates everything automatically in the same workspace Tell it what you want and watch it plan, generate, edit, and iterate everything in one seamless workspace: • 🎬 “Generate a 1-minute cinematic film” • 📚 “Create a complete manga set” • 🛍️ “Build UGC product stories” This is the real leap from simple prompts to end-to-end creative production This is the biggest upgrade to Grok Imagine yet
English
691
1.2K
4.3K
883.4K
Tesla Asia
Tesla Asia@Tesla_Asia·
Built for the future. Parked in history. 📍Hubei, China
Tesla Asia tweet mediaTesla Asia tweet mediaTesla Asia tweet media
English
98
256
2.2K
92.4K
Selene
Selene@vaaselene·
is anyone using Gemini?
English
887
13
1.1K
147.8K