Bertho

72.7K posts

Bertho banner
Bertho

Bertho

@berthojoris

Enjoy your life

Jakarta Katılım Kasım 2010
745 Takip Edilen386 Takipçiler
Bertho retweetledi
Maestro
Maestro@maestro__dev·
Also in 2.6.0: • Parallel local iOS simulator runs now work • Cleaner relative paths across CLI and Cloud • iOS XCTest logs in debug output dir • maestro hierarchy outputs pure JSON • HTML report links to Cloud + binary ID Full release notes: maestro.dev/blog/maestro-c…
English
0
1
0
857
Command Code
Command Code@CommandCodeAI·
The best coding agent plan doesn't exi……… A dollar for $20 of Qwen 3.7 Max usage? A dollar for $40 of DeepSeek V4 Pro usage? Hard to say no to that.
Command Code tweet media
English
27
16
443
211.5K
Kaito
Kaito@KaiXCreator·
Are you team Claude or Codex?
English
397
6
247
38.4K
eric zakariasson
eric zakariasson@ericzakariasson·
what do you think of composer 2.5 so far? how can we make the next model even better? want to hear your feedback on behavior, speed, quality, whatever!
eric zakariasson tweet media
English
437
21
1.2K
109.5K
Bertho retweetledi
Bertho
Bertho@berthojoris·
@anvie Gokil sih ini qwen 3.7max
Indonesia
0
0
0
144
Robin Syihab
Robin Syihab@anvie·
Nyobain benchmark Qwen3.7-MAX, dapat 97%, mantep sih ini model.
Robin Syihab tweet mediaRobin Syihab tweet media
Indonesia
5
3
83
6.1K
Bertho retweetledi
atomic.chat
atomic.chat@atomic_chat_hq·
Qwen 3.7-max beats Opus 4.7 and GPT-5.5 We tested three frontier models on a real agentic task: write a Tetris bot that plays the game and trains itself. Each model could read its own code, run benchmarks, and rewrite itself across 10 iterations. Then we compared the final bots head to head. Qwen 3.7-Max: training cost $1.32, bot improvement +56% Claude Opus 4.7: training cost $12.15, bot improvement +28% GPT-5.5: training cost $2.85, bot improvement +7% Qwen won on every dimension - biggest jump, 9× cheaper than Claude, 2× cheaper than GPT. Long agentic loops is where Qwen Max actually delivers.
English
183
473
4.5K
843.7K
Chris Laupama
Chris Laupama@chrislaupama·
I think Composer 2.5 replaces Opus 4.7 for me now.
English
19
5
272
10K
Bertho retweetledi
Alibaba Group
Alibaba Group@AlibabaGroup·
Qwen3.7-Max is live! 🚀 Introducing the latest proprietary model, built for advanced agentic coding, complex reasoning, and long-horizon execution. It’s here to transform how we approach complex tasks.
Alibaba Group tweet media
English
72
245
1.7K
2.7M
Bertho retweetledi
Qwen
Qwen@Alibaba_Qwen·
🚀🚀Qwen3.7 Preview lands on Arena ! Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.⚡️⚡️ Can't wait to release Qwen3.7 series models!Stay tuned! @arena
Arena.ai@arena

Qwen3.7 Preview By @Alibaba_Qwen lands on Arena for Text and Vision. In Text Arena, Qwen3.7 Max Preview ranks #13 overall. Alibaba is now the #6 lab in this arena. - #7 Math - #9 Expert - #9 Software & IT - #10 Coding In Vision Arena: Qwen3.7 Plus Preview ranks #16 overall, making Alibaba the #5 lab. Congrats to the @Alibaba_Qwen team on the latest progress!

English
198
378
3.4K
618.4K
ISM ☀️
ISM ☀️@ism_sol·
jujur ae dev mana model open source AI paling mantep ? 🤖 1. Kimi K2.6 2. DeepSeek v4 2. Qwen 3.6 4. GLM-5.1
Indonesia
68
10
267
22.4K
Bertho retweetledi
OpenAI
OpenAI@OpenAI·
You've been asking for this one... Now in preview: Codex in the ChatGPT mobile app. Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on your laptop, Mac mini, or devbox.
English
1.7K
2.6K
22K
4.7M
Bertho retweetledi
Kai
Kai@hqmank·
ChatGPT Business is free for 2 months right now🎁. Not just discounted. Fully free for 2 months. US region only, from what I tested. Promo code: STRIPEATLASGPT4BIZ050126
Kai tweet media
English
96
149
2.6K
536.8K
Bertho retweetledi
GMI Cloud
GMI Cloud@gmi_cloud·
we compared Gemini 3.1 Pro, Opus 4.7, and GPT 5.5 to Kimi K2.6, Xiaomi Mimo v2.5, and Qwen 3.6 Max in average, closed source are faster. GPT 5.5 and Opus 4.7 the fastest. Kimi K2.6 comes after. It keeps up thanks to its native INT4 quantization. MiMo took the longest, but the refinement and aesthetic only ranks after Gemini 3.1. as if a mini Gemini. Its slow cuz it's trained for long-horizon agentic work. Claude Opus 4.7 is blooming in a very different style, probably because Anthropic trains for taste, not just accuracy.
English
17
25
323
38.8K