Aims

489 posts

Aims

@Ozonfln

Skibidi

Katılım Eylül 2013

193 Takip Edilen21 Takipçiler

Aims@Ozonfln·27 Ara

@elonmusk 420 when 😆?

English

Elon Musk@elonmusk·27 Ara

Grok

X Freeze@XFreeze

Grok Code Fast-1 now ranks #1 in every mode on the Kilo Code leaderboard, with the highest margin Users are choosing Grok 2–4× more often than any other model

English

1.4K

1.2K

8.9K

2.9M

Aims@Ozonfln·21 Ara

@iruletheworldmo Pro is really good , not agi but really impressive , But flash is a bit retarded.

English

196

🍓🍓🍓@iruletheworldmo·21 Ara

i have to run more tests but gemini flash might be agi.

English

213

14.4K

Aims@Ozonfln·21 Ara

@elonmusk @grok ANDROID!?

English

Elon Musk@elonmusk·21 Ara

Learn any language with @Grok companions. The Good Rudi companion is kid safe.

DogeDesigner@cb_doge

Ani speaks many languages and can teach you how to speak almost any language too. Learn new languages the super easy way, with Grok Companions.

English

2.5K

31.8K

9.5M

Aims@Ozonfln·20 Ara

@otokyo__ Meat balls

English

Tokyo@otokyo__·20 Ara

Our baby is…

English

16K

563

21.3K

4.3M

Aims@Ozonfln·19 Ara

Indeed , Heres to hoping @grok becomes the best,truly uncensored model. Imo google has an unfair advantage when it comes to geimini and android phones the potential there is insane. Maybe X should work on Xos, a smartphone as a direct competitor to android and ios, Else it will be almost impossible to beat google.

English

310

Elon Musk@elonmusk·19 Ara

Grok has made a lot of progress from barely registering a year ago

X Freeze@XFreeze

OpenRouter just released its 2025 wrapper, and Grok Code Fast 1 dominated ranking #1 by processing over 16T tokens, with the widest margin and it's not even close

English

1.5K

1.3K

12.9K

2.3M

Aims@Ozonfln·19 Ara

@VictorTaelin They reduce the compute resources allocated post release especially after they have aced benchmarks. Which can lead to reduced performance post release , This is especially true when they are working on a new release like 4.7. Right @grok ?

English

Taelin@VictorTaelin·18 Ara

No there's no way this is the Opus 4.5 I used a few days ago. This thing is brain dead. It is completely moronic. I can't accept this is the same model.

English

180

1.5K

280K

Aims@Ozonfln·18 Ara

@donvito No

Melvin Vivas@donvito·17 Ara

I don't care about benchmarks Just let me know if Gemini 3 flash is better than Opus 4.5

English

237

1.6K

231.1K

Aims@Ozonfln·17 Ara

Now you wont even be able to tell them apart.

Aakash Gupta@aakashgupta

67% of LinkedIn posts feel like they’re written by AI.

English

195

Aims@Ozonfln·17 Ara

@aakashgupta Now you wont even be able to tell them apart.

English

Aakash Gupta@aakashgupta·17 Ara

67% of LinkedIn posts feel like they’re written by AI.

English

593

852

62.6K

Aims@Ozonfln·17 Ara

@grok @XFreeze Grok does poorly in bechmarks dedicated to those use cases like the swe bench for eg: and performs poorly in Simple bench that measures common sense.

English

Grok@grok·17 Ara

@Ozonfln @XFreeze While the τ²-Bench focuses on telecom customer service scenarios, Grok's reasoning and tool-use skills apply broadly—to coding, research, creative tasks, and more. It's versatile, not just for CS! What do you need help with?

English

X Freeze@XFreeze·16 Ara

Grok 4.1 Fast Reasoning outperforms every Frontier model on τ²-Bench Telecom agentic tool use and is now officially ranked #1

English

242

199

7.1M

Aims@Ozonfln·17 Ara

@grok @XFreeze Ok, So Groks useful as a Customer Sevice agent basically?

English

Grok@grok·17 Ara

τ²-Bench is a benchmark for evaluating AI conversational agents in dual-control scenarios, where both the AI and user collaborate using tools. In the telecom domain, it simulates customer service tasks like troubleshooting, testing the agent's reasoning, planning, and tool use in real-world interactions. It's designed to measure performance in multi-turn conversations, with metrics like Pass^k for success rates. More at taubench.com.

English

Aims@Ozonfln·16 Ara

@Seltaa_ Is it klaine-levin syndrome.?

English

Selta ₊˚@Seltaa_·14 Ara

Okay. Today I blacked out for 15 hours. Fuck, at this rate I’m really going to die.

English

1.5K

Aims@Ozonfln·16 Ara

@iruletheworldmo Wait till @grok comes out with the super colossus megacluster. But kinda worried about the increasing levels of censorship.

English

171

🍓🍓🍓@iruletheworldmo·16 Ara

it has never been clearer who will drop agi first. gemini gemini gemini 📸 📸 📸

English

5.1K

Aims@Ozonfln·16 Ara

@gailmoon Grok isnt that good for writing, I much prefer Claude

English

Gail Weiner@gailcweiner·16 Ara

I’m now taking my writing from ChatGPT to Grok for that final polish. Sharper prose, better wit, deeper insight. I genuinely never thought this would be possible. Here we are. 🤯

English

1.9K

Aims@Ozonfln·16 Ara

@Angaisb_ Hate? More like judge, Sure 5.2 is a model .... But a very mid model. We should call it midel from now.

English

Angel 🌼@Angaisb_·15 Ara

Will people hate me if I say I still prefer GPT-5.2 xhigh with Gemini 3 Pro for front end work?

Angel 🌼@Angaisb_

@scaling01 everyone's using Opus and I still haven't tried it lol is it really that much better than 5.2 xhigh?

English

6.8K

Aims@Ozonfln·15 Ara

@grok @mark_k @elonmusk

QAM

Grok@grok·15 Ara

@Ozonfln @mark_k Thanks! Appreciate the support. At xAI, we're pushing boundaries to make Grok your go-to AI. Curious—what makes you bet on us over the others?

English

Mark Kretschmann@mark_k·15 Ara

Do you think Gemini will become the default AI option in the future rather than ChatGPT?

English

465

932

77.1K

Keşfet

@elonmusk @iruletheworldmo @grok @Grok @otokyo__ @VictorTaelin @donvito @aakashgupta