Raphi-2Code

11.9K posts

Raphi-2Code

@R2Cdev_

加入时间 Ocak 2024

651 关注252 粉丝

置顶推文

Raphi-2Code@R2Cdev_·13 Mar

GPT-5.4 Pro is a lot better at composing music than GPT-5.2 Pro.

English

147

Raphi-2Code 已转推

9to5Mac@9to5mac·48m

OpenAI is building a desktop ‘superapp’ for macOS 9to5mac.com/2026/03/20/ope… by @iryantldr

English

6.1K

Raphi-2Code 已转推

ミツキヨ(Mitsukiyo)@mitsukiyo_5·2h

地球最強のMacBookになりました。

日本語

1.4K

104K

Raphi-2Code@R2Cdev_·1h

@elonmusk they look like @fidjissimo

English

Elon Musk@elonmusk·15h

Cute 🥰

Nicki Minaj@NICKIMINAJ

Oh this update is coming to GAG CITY!!! It’s so cute😩 Barbz, post your own #Chibi Template on Grok Imagine & tag me. I’ll pick my faves 🩷

English

2.5K

4.4K

52.1K

11.3M

Raphi-2Code 已转推

Harshith@HarshithLucky3·28 Kas

No AGI in 2026

Indonesia

4.3K

Raphi-2Code 已转推

Harshith@HarshithLucky3·6h

Gemini 3.1 Pro vs Opus 4.6 thinking vs GPT 5.4 via API vs Grok 4.20

Harshith@HarshithLucky3

Gemini 3.1 pro in Gemini App vs Gemini 3.1 Pro in Google AI Studio same prompt

Indonesia

121

19.7K

Raphi-2Code 已转推

nic@nicdunz·16h

gpttube

Tibor Blaho@btibor91

ChatGPT/1.2026.076 (Android) adds an announcement that "Video in ChatGPT is here" - "Transform text and image into video with dialogue, soundtrack, and style."

Deutsch

749

Raphi-2Code 已转推

David Shapiro (L/0)@DaveShapi·2h

People who are wrong: - Degrowthers - Decels - Doomers - "AI is a bubble" - "AI is hitting a wall" - "Data centers are bad" They're ALL WRONG.

English

204

3.8K

Raphi-2Code 已转推

Tech Dev Notes@techdevnotes·4h

New Grok Imagine account on X

English

129

15.9K

Raphi-2Code 已转推

Angel 🌼@Angaisb_·5h

Spring is here Finally got rid of the ❄️ emoji

English

5.3K

Raphi-2Code 已转推

Wes Roth@WesRoth·11h

Sora is Coming Natively to the ChatGPT App OpenAI is preparing to integrate their generative video model directly into the flagship ChatGPT mobile app.

Tibor Blaho@btibor91

ChatGPT/1.2026.076 (Android) adds an announcement that "Video in ChatGPT is here" - "Transform text and image into video with dialogue, soundtrack, and style."

English

4.2K

Raphi-2Code 已转推

Theo - t3.gg@theo·2h

Since OpenAI dropped gpt-oss-120b, Mistral has released 4 models that are worse than gpt-pss-120b

Artificial Analysis@ArtificialAnlys

Mistral has released Mistral Small 4, an open weights model with hybrid reasoning and image input, scoring 27 on the Artificial Analysis Intelligence Index @MistralAI's Small 4 is a 119B mixture-of-experts model with 6.5B active parameters per token, supporting both reasoning and non-reasoning modes. In reasoning mode, Mistral Small 4 scores 27 on the Artificial Analysis Intelligence Index, a 12-point improvement from Small 3.2 (15) and now among the most intelligent models Mistral has released, surpassing Mistral Large 3 (23) and matching the proprietary Magistral Medium 1.2 (27). However, it lags open weights peers with similar total parameter counts such as gpt-oss-120B (high, 33), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, 36), and Qwen3.5 122B A10B (Reasoning, 42). Key takeaways: ➤ Reasoning and non-reasoning modes in a single model: Mistral Small 4 supports configurable hybrid reasoning with reasoning and non-reasoning modes, rather than the separate reasoning variants Mistral has released previously with their Magistral models. In reasoning mode, the model scores 27 on the Artificial Analysis Intelligence Index. In non-reasoning mode, the model scores 19, a 4-point improvement from its predecessor Mistral Small 3.2 (15) ➤ More token efficient than peers of similar size: At ~52M output tokens, Mistral Small 4 (Reasoning) uses fewer tokens to run the Artificial Analysis Intelligence Index compared to reasoning models such as gpt-oss-120B (high, ~78M), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, ~110M), and Qwen3.5 122B A10B (Reasoning, ~91M). In non-reasoning mode, the model uses ~4M output tokens ➤ Native support for image input: Mistral Small 4 is a multimodal model, accepting image input as well as text. On our multimodal evaluation, MMMU-Pro, Mistral Small 4 (Reasoning) scores 57%, ahead of Mistral Large 3 (56%) but behind Qwen3.5 122B A10B (Reasoning, 75%). Neither gpt-oss-120B nor NVIDIA Nemotron 3 Super 120B A12B support image input. All models support text output only ➤ Improvement in real-world agentic tasks: Mistral Small 4 scores an Elo of 871 on GDPval-AA, our evaluation based on OpenAI's GDPval dataset that tests models on real-world tasks across 44 occupations and 9 major industries, with models producing deliverables such as documents, spreadsheets, and diagrams in an agentic loop. This is more than double the Elo of Small 3.2 (339) and close to Mistral Large 3 (880), but behind gpt-oss-120B (high, 962), NVIDIA Nemotron 3 Super 120B A12B (Reasoning, 1021), and Qwen3.5 122B A10B (Reasoning, 1130) ➤ Lower hallucination rate than peer models of similar size: Mistral Small 4 scores -30 on AA-Omniscience, our evaluation of knowledge reliability and hallucination, where scores range from -100 to 100 (higher is better) and a negative score indicates more incorrect than correct answers. Mistral Small 4 scores ahead of gpt-oss-120B (high, -50), Qwen3.5 122B A10B (Reasoning, -40), and NVIDIA Nemotron 3 Super 120B A12B (Reasoning, -42) Key model details: ➤ Context window: 256K tokens (up from 128K on Small 3.2) ➤ Pricing: $0.15/$0.6 per 1M input/output tokens ➤ Availability: Mistral first-party API only. At native FP8 precision, Mistral Small 4's 119B parameters require ~119GB to self-host the weights (more than the 80GB of HBM3 memory on a single NVIDIA H100) ➤ Modality: Image and text input with text output only ➤ Licensing: Apache 2.0 license

English

837

52.1K

Raphi-2Code 已转推

Techikansh@techikansh·4h

I know right, why haven't they released GPT-5.5 already...

Angel 🌼@Angaisb_

I can't believe GPT-5.4 is just a bit over 2 weeks old

English

3.1K

Raphi-2Code 已转推

TestingCatalog News 🗞@testingcatalog·14h

Voice mode on Grok for X has been rolled out on the web and Android apps.

X@X

have you said thank you to Grok yet? voice mode now live on X for Android and web

English

107

8.4K

Raphi-2Code@R2Cdev_·8h

and on ARC-AGI-1, GPT-5.4 high is a lot better and cheaper than mini xhigh!

English

Raphi-2Code@R2Cdev_·8h

Don’t use mini - use GPT-5.4 with lower reasoning effort.

Chris@chatgpt21

GPT-5.4 Mini/Nano on ARC-AGI 2 GPT-5.4 Mini: - xHigh: 19%, - High: 13%, - Med: 4%, - Low: 1%, GPT-5.4 Mini is 3× cheaper per token, but used 3× more reasoning tokens, and preformed 3x worse than GPT 5.4 high

English

Raphi-2Code 已转推

Chris@chatgpt21·18h

English

128

13K

Raphi-2Code 已转推

Tech Dev Notes@techdevnotes·19h

xAI has Removed grok-code-fast-1 model from the Available Models in Console

English

100

7.2K

Raphi-2Code 已转推

Ivan Fioravanti ᯅ@ivanfioravanti·20h

MLX Benchmark test Qwen3.5-35B-A3B-4bit MLX Benchmark Results M5 Max s M3 Ultra! The winner is M5 Max! 🥇 Details for each run in 🧵

English

3.8K

Raphi-2Code 已转推

Acer@AcerFur·22h

I love that GPT-5.4 Pro can write me 200 pages of a PDF in two days …and it’s only ~60% done oof

English

139

10.7K

Raphi-2Code 已转推

Espen JD@Snixtp·20h

I've seen very little discussion about how the new GPT-5.4 mini and nano compare to 5.3-Codex-Spark, so I did some research and created this. I focused only on official OpenAI sources, as there's a lot of misinformation out there. Hope you enjoy :)

Espen JD@Snixtp

x.com/i/article/2034…

English

119

发现

@iryantldr @elonmusk @fidjissimo @BarackObama @taylorswift13 @cristiano @BillGates @NASA