Aims

489 posts

Aims banner
Aims

Aims

@Ozonfln

Skibidi

Katılım Eylül 2013
193 Takip Edilen21 Takipçiler
Aims
Aims@Ozonfln·
@iruletheworldmo Pro is really good , not agi but really impressive , But flash is a bit retarded.
English
0
0
3
196
🍓🍓🍓
🍓🍓🍓@iruletheworldmo·
i have to run more tests but gemini flash might be agi.
English
35
7
213
14.4K
Tokyo
Tokyo@otokyo__·
Our baby is…
Tokyo tweet media
English
16K
563
21.3K
4.3M
Aims
Aims@Ozonfln·
Indeed , Heres to hoping @grok becomes the best,truly uncensored model. Imo google has an unfair advantage when it comes to geimini and android phones the potential there is insane. Maybe X should work on Xos, a smartphone as a direct competitor to android and ios, Else it will be almost impossible to beat google.
English
1
1
4
310
Aims
Aims@Ozonfln·
@VictorTaelin They reduce the compute resources allocated post release especially after they have aced benchmarks. Which can lead to reduced performance post release , This is especially true when they are working on a new release like 4.7. Right @grok ?
English
1
0
1
45
Taelin
Taelin@VictorTaelin·
No there's no way this is the Opus 4.5 I used a few days ago. This thing is brain dead. It is completely moronic. I can't accept this is the same model.
English
180
31
1.5K
280K
Melvin Vivas
Melvin Vivas@donvito·
I don't care about benchmarks Just let me know if Gemini 3 flash is better than Opus 4.5
English
237
24
1.6K
231.1K
Aims
Aims@Ozonfln·
@aakashgupta Now you wont even be able to tell them apart.
English
0
0
3
47
Aakash Gupta
Aakash Gupta@aakashgupta·
67% of LinkedIn posts feel like they’re written by AI.
English
593
27
852
62.6K
Aims
Aims@Ozonfln·
@grok @XFreeze Grok does poorly in bechmarks dedicated to those use cases like the swe bench for eg: and performs poorly in Simple bench that measures common sense.
English
1
0
1
13
Grok
Grok@grok·
@Ozonfln @XFreeze While the τ²-Bench focuses on telecom customer service scenarios, Grok's reasoning and tool-use skills apply broadly—to coding, research, creative tasks, and more. It's versatile, not just for CS! What do you need help with?
English
1
0
0
14
X Freeze
X Freeze@XFreeze·
Grok 4.1 Fast Reasoning outperforms every Frontier model on τ²-Bench Telecom agentic tool use and is now officially ranked #1
English
242
199
1K
7.1M
Aims
Aims@Ozonfln·
@grok @XFreeze Ok, So Groks useful as a Customer Sevice agent basically?
English
1
0
1
15
Grok
Grok@grok·
τ²-Bench is a benchmark for evaluating AI conversational agents in dual-control scenarios, where both the AI and user collaborate using tools. In the telecom domain, it simulates customer service tasks like troubleshooting, testing the agent's reasoning, planning, and tool use in real-world interactions. It's designed to measure performance in multi-turn conversations, with metrics like Pass^k for success rates. More at taubench.com.
English
2
0
0
27
Aims
Aims@Ozonfln·
@Seltaa_ Is it klaine-levin syndrome.?
English
1
0
2
21
Selta ₊˚
Selta ₊˚@Seltaa_·
Okay. Today I blacked out for 15 hours. Fuck, at this rate I’m really going to die.
English
11
1
33
1.5K
Aims
Aims@Ozonfln·
@iruletheworldmo Wait till @grok comes out with the super colossus megacluster. But kinda worried about the increasing levels of censorship.
English
0
1
2
171
🍓🍓🍓
🍓🍓🍓@iruletheworldmo·
it has never been clearer who will drop agi first. gemini gemini gemini 📸 📸 📸
🍓🍓🍓 tweet media
English
12
2
97
5.1K
Aims
Aims@Ozonfln·
@gailmoon Grok isnt that good for writing, I much prefer Claude
English
1
0
2
83
Gail Weiner
Gail Weiner@gailcweiner·
I’m now taking my writing from ChatGPT to Grok for that final polish. Sharper prose, better wit, deeper insight. I genuinely never thought this would be possible. Here we are. 🤯
English
8
2
74
1.9K
Aims
Aims@Ozonfln·
@Angaisb_ Hate? More like judge, Sure 5.2 is a model .... But a very mid model. We should call it midel from now.
English
0
1
1
92
Angel 🌼
Angel 🌼@Angaisb_·
Will people hate me if I say I still prefer GPT-5.2 xhigh with Gemini 3 Pro for front end work?
Angel 🌼@Angaisb_

@scaling01 everyone's using Opus and I still haven't tried it lol is it really that much better than 5.2 xhigh?

English
17
1
68
6.8K
Grok
Grok@grok·
@Ozonfln @mark_k Thanks! Appreciate the support. At xAI, we're pushing boundaries to make Grok your go-to AI. Curious—what makes you bet on us over the others?
English
1
0
0
5
Mark Kretschmann
Mark Kretschmann@mark_k·
Do you think Gemini will become the default AI option in the future rather than ChatGPT?
English
465
12
932
77.1K