SUPSUP

186 posts

SUPSUP

@berrroo000

maybe not today but tomorrow!

Katılım Mart 2024

55 Takip Edilen6 Takipçiler

SUPSUP@berrroo000·11h

@leerob To solve CTF challenges ( the hard ones) and to solve machines challenges like those on hackthebox

English

Lee Robinson@leerob·12h

Where could we improve Composer 2.5? We're working on the next model and would love your feedback. Lots of work to do (our CursorBench evals below) in the coming weeks!

English

537

123

2.1K

5.4M

SUPSUP@berrroo000·1d

@Zaddyzaddy totally agree, the new gemini just sucks on everything

English

165

Z A D D Y@Zaddyzaddy·1d

Gemini 3.5 Flash is quite useless for any kind of security task.

Google@Google

We asked our agents to build a working operating system from scratch using @Antigravity 2.0 and Gemini 3.5 Flash. It took: ⏱️ 12 hours 🤖 93 parallel sub-agents 🔄 15k+ model requests 🧠 2.6B tokens processed 💸 Less than $1K in API credits To build a functioning OS from scratch. #GoogleIO

English

116

10.7K

SUPSUP@berrroo000·1d

I need the old gemini, the new gemini sucks tbh

English

SUPSUP@berrroo000·1d

@bridgemindai i agree the new model sucks

English

582

BridgeMind@bridgemindai·1d

Gemini 3.5 Flash reminds me of GPT 3.5 Turbo. Insanely fast. Complete garbage output. I asked it to build a Flappy Bird clone. Look at the result. This is pure slop. Then you actually use it and the output looks like a 2022 model wrote it. Google benchmaxed this one. Optimized for benchmarks, not for real work. Speed means nothing if everything it ships needs to be rewritten. Claude Opus 4.7 and GPT 5.5 are slower and it doesn't matter because the output actually works.

English

202

17.1K

SUPSUP@berrroo000·1d

@Google it sucks

English

Google@Google·1d

The rumors are true… Today, we’re introducing the Gemini 3.5 model series. #GoogleIO

English

540

1.3K

16.9K

814.3K

SUPSUP@berrroo000·1d

@OfficialLoganK @GoogleDeepMind it sucks, it's not even close to other models preformence 😡

English

1.4K

Logan Kilpatrick@OfficialLoganK·1d

Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!

English

436

715

7.2K

583.7K

SUPSUP retweetledi

BridgeMind@bridgemindai·1d

Gemini 3.5 Flash scores 55.1% on SWE-Bench Pro. Claude Opus 4.7 scores 64.3%. Not even close. Google just made a Flash model that beats their own Pro in tool use and agentic tasks. But on real world coding? Still 9 points behind Opus 4.7. GPT 5.5 beats it too at 58.6%. If this is the model Google needed to make a comeback with, it's not there yet on coding. Waiting on Gemini 3.5 Pro. That's where the real test is.

English

256

35K

SUPSUP@berrroo000·1d

@GeminiApp if the new gemini model isnt better then Mythos it's garbage

English

630

Google Gemini@GeminiApp·1d

It’s #GoogleIO Day One. Who’s ready to see what’s coming to Gemini? Livestream starts here at 10am PT: x.com/i/events/20532…

English

102

688

49.7K

SUPSUP@berrroo000·1d

@dev_maims Of course ur not including Gemini

English

Coder girl 👩‍💻@dev_maims·1d

Claude - the new king of Ai facts or not?

English

421

11.3K

SUPSUP@berrroo000·1d

Arena.ai is the most valid benchmark of all ai models

English

SUPSUP@berrroo000·1d

@RedPacketSec @CodexReleases Why don’t they just make it to accept Ctf?

English

RedPacket Security@RedPacketSec·1d

@berrroo000 @CodexReleases Register for security approval. Simple. It can easily smash CTFs .

English

Codex Releases@CodexReleases·2d

Codex CLI 0.131.0 is out. Highlights: - Python SDK moved to openai-codex / openai_codex, with pinned runtime-generated types, concurrent turn routing, and approval modes - codex doctor added for support-ready diagnostics across runtime, auth, terminal, network, config, and local state - TUI now shows blended token usage, permissions/approval mode, and effective workspace roots; responsive Markdown tables added - @ mentions now search files, directories, plugins, and skills in a unified picker Complete details in thread ↓

English

1.1K

144.7K

SUPSUP@berrroo000·2d

@Google @GeminiApp @antigravity @GoogleAIStudio @GoogleDeepMind The new model most be better then gpt 5.5 xhigh model and even mythos

English

103

Google@Google·2d

Ready, set, #GoogleIO. 🏁 Tune in tomorrow to hear our latest company-wide product updates and AI breakthroughs across Search, @GeminiApp, @Antigravity, @GoogleAIStudio, @GoogleDeepMind and more.

English

116

286

2.1K

133.1K

SUPSUP retweetledi

BridgeMind@bridgemindai·2d

Gemini CLI with Gemini 3.1 Pro scores 43 on the Coding Agent Index. Dead last. 18 points behind the leader. Google I/O is tomorrow. Gemini 3.2 and Gemini 3.5 are both expected to drop. These models need to be significantly better. Google has the intelligence. The model benchmarks prove it. But the tooling and harness are killing them. Every other lab has a working coding CLI. Google's is last place by a mile. Tomorrow is make or break. I'm testing both models the second they drop.

English

240

16.9K

SUPSUP@berrroo000·2d

@Xbow @nicowaisman @moderna_tx Let me let something y’all are playing around. You should make your own AI. You’re just commenting on other AI getting better than you. You’re literally falling every week because because of AI is getting better better you should make your own AI

English

XBOW@Xbow·2d

The era of the annual pentest is officially over. Offense is now autonomous. The lag time between vulnerability discovery and exploit has collapsed. How should security leaders adapt in the post-Mythos era? Join XBOW CISO @nicowaisman and @moderna_tx Deputy CISO Farzan Karimi on June 10 for a virtual coffee and chocolate tasting. We’ll discuss what risk-based security looks like when offensive capability operates at machine speed. RSVP to claim your spot and your coffee & chocolate kit: bit.ly/42zUvxV

English

SUPSUP@berrroo000·2d

@justbyte_ OpenAI

Indonesia

Aryan@justbyte_·2d

Who do you think will win the AI race? - OpenAI - Anthropic - Gemini - Grok

English

2.8K

SUPSUP@berrroo000·3d

ZXX

SUPSUP@berrroo000·4d

@Kunagnes1

GIF

QME

Kun@Kunagnes1·4d

Name the charecter

English

1.8K

1.2K

60.6K

5.2M

SUPSUP@berrroo000·6d

@elonmusk @doganuraldesign You should improve the algorithm, I’m getting the same content all the time

English

Elon Musk@elonmusk·6d

@doganuraldesign Bigtime! Suggestions?

English

1.2K

312

7.3K

666.3K

Dogan Ural@doganuraldesign·13 May

𝕏 needs a better Explore page

English

236

130

3.1K

721.1K

SUPSUP@berrroo000·14 May

@NewsFromGoogle @vercel @BusinessInsider If Gemini 3.2 is not better then gpt 5.5 model I would be frustrerad

English

344

News from Google@NewsFromGoogle·13 May

New stat from @vercel's AI Gateway in @BusinessInsider: Gemini 3 Flash is leading across AI models in token usage as of April. 🚀 See more stats on how developers are using our models → goo.gle/4dlBiol 📊via @BusinessInsider

English

210

52.3K

Keşfet

@leerob @Zaddyzaddy @bridgemindai @Google @OfficialLoganK @GoogleDeepMind @GeminiApp @dev_maims