W. Nakabayashi
724 posts

W. Nakabayashi
@WidVelv
At times, what defeats a person is not a strong enemy but one’s own heart.






🇧🇷 REAL TIME BIG DATA: Pesquisa para presidente. 🔴 Lula (PT): 39% 🟢 Flávio Bolsonaro (PL): 30% 🔵 Ratinho Junior (PSD): 10% 🟠 Romeu Zema (Novo): 3% ⚫ Aldo Rebelo (DC): 2% 🟡 Renan Santos (Missão): 1%






Powell’s Pause: A Gamble Wrapped in Uncertainty: capitalspectator.com/powells-pause-…




Se Trump não desistir desse conflito a tempo, ele será responsável por dois choques inflacionários em menos de doze meses: tarifas (bens) e petróleo (energia). A inflação, medida pelo PCE, permanece mais próxima de 3% do que da meta do Fed.





xAI has released Grok 4.20 for API access in beta, and it scores 48 on the Artificial Analysis Intelligence Index with reasoning enabled Compared to @xAI’s previous Grok 4 flagship, Grok 4.20 Beta 0309 is an intelligence upgrade, achieving +6 points on the Intelligence Index. It launches with a longer 2M token context window (up from Grok 4’s 256K context window, matching Grok 4.1 Fast’s 2M), and significantly lower pricing ($2/$6 vs Grok 4’s $3/$15). Grok 4.20’s performance lags behind the current intelligence frontier, but it performs strongly on instruction following and features a notably low hallucination rate, beating all other models we’ve tested on AA-Omniscience for hallucination. xAI released 3 variants: reasoning, non-reasoning, and multi-agent. We’ve evaluated the reasoning and non-reasoning modes, and are considering the best approach for testing the new multi-agent functionality, which parallelizes over multiple agents behind the scenes in one API call. Key takeaways: ➤ Improved intelligence over Grok 4: Grok 4.20 Beta 0309 (Reasoning) scores 48 on the Artificial Analysis Intelligence Index, +6 from Grok 4 and +9 compared to Grok 4.1 Fast. This score falls short of the current intelligence frontier at 57 (Gemini 3.1 Pro Preview and GPT-5.4) ➤ Low price for the level of intelligence: Grok 4.20 is priced at $2/$6 per 1M input/output tokens, representing a decrease compared with Grok 4’s $3/$15 API rates. The reasoning variant cost $484 to complete the evaluations in the Artificial Analysis Intelligence Index, which is a reduction of ~70% compared to Grok 4, driven by lower pricing and lower token use ➤ Leading non-hallucination rate: Grok 4.20 scores 78% in the AA-Omniscience non-hallucination metric. This is the best result we have seen yet for this metric, and reflects the model only answering around one fifth of the time when it did not know the answer ➤ Fast inference performance: xAI is serving Grok 4.20 at 267 tokens per second - similar to what we see for gpt-oss-120b across providers and on the Pareto frontier for speed versus intelligence ➤ Mixed improvements in tool use: Grok 4.20 improved on the tool calling performance of Grok 4 in some evaluations, scoring 97% on Tau2-Telecom. However, its score of 1,062 on GDPval-AA, our benchmark of general agent performance on real work tasks, is well behind frontier peers and sits approximately in line with Grok 4.1 Fast


















