Heydari

620 posts

Heydari

@HeydariAI

Just-in-Time Machine Learning Engineer - Always monitoring the situation

Katılım Aralık 2024

130 Takip Edilen44 Takipçiler

Heydari@HeydariAI·6h

@test_tm7873 @googlecloud @GoogleCloudTech I really felt rejected after that 😭😂

English

testtm@test_tm7873·6h

@HeydariAI @googlecloud @GoogleCloudTech So true. We want answers 😭

English

Heydari@HeydariAI·8h

Can we get an explanation regarding the TRC program? I mean the official page is up but the team is not extending the access for older users @googlecloud @GoogleCloudTech

English

120

Heydari@HeydariAI·7h

@smykx I said yes to this urge

English

samyak@smykx·1d

the urge to learn high performance computing

English

123

5.5K

Heydari@HeydariAI·8h

@diegocabezas01 Nah

176

Diego | AI 🚀 - e/acc@diegocabezas01·17h

GPT 2018 had 117M parameters GPT-2 2019 had 1.5B parameters GPT-3 2020 had 175B parameters DeepSeek-V4-Pro 2026 has 1.6T parameters GPT-5.5 is ESTIMATED to have ~9T parameters.

English

334

43.4K

Heydari@HeydariAI·21h

@arcjax7 @test_tm7873 @googlecloud I preferred TPUs to be less famous man. I hope the don't shut down kaggle TPUs too

English

arcjax@arcjax7·23h

@HeydariAI @test_tm7873 @googlecloud probably just oversubscribed

English

testtm@test_tm7873·1d

Oh well it seems @googlecloud is shutting down the TRC program 😢 So many memories and I learn so much from it. Sadly. Maybe if we will make ourself hear they gona continue supporting us Opensource guys? :)) I hope so!

English

936

Heydari@HeydariAI·23h

@arcjax7 @test_tm7873 @googlecloud Yes I was asking for an extension

English

arcjax@arcjax7·1d

@HeydariAI @test_tm7873 @googlecloud did you get this after asking for an extension, or is this the message telling you to get off it

English

Heydari@HeydariAI·1d

Okay so they shuttted to program down for everyone, not just me So at least it was not about the fact that I'm Iranian 🤣

Heydari@HeydariAI

Okay TRC just said im out of the program somehow because im Iranian huh? Or other folks have similar issue, dc

English

Heydari@HeydariAI·1d

Okay TRC just said im out of the program somehow because im Iranian huh? Or other folks have similar issue, dc

English

Heydari@HeydariAI·2d

@arcjax7 Thats Peak...

English

arcjax@arcjax7·2d

i miss the old andrej, gpt from scratch andrej taught us backprop andrej, cs231n andrej shipped the lectures andrej, open source andrej i hate the new andrej, vibe code andrej closed source anthropic andrej, english is code andrej i still love andrej, i still love andrej

English

1.1K

Heydari@HeydariAI·4d

@arcjax7 If deepmind doesn't make 3.5 pro AGI I would compare it with MGK

English

110

arcjax@arcjax7·4d

openai: drake anthropic: kendrick deepmind: jcole

English

1.9K

Heydari@HeydariAI·5d

@arcjax7 Lots of Kanye contents I see

English

arcjax@arcjax7·5d

Jax is the truth Pytorch is for people who fear God I jit compiled my soul in 2019 vmap me Pure functions only this is bigger than code - kanye on jax

English

479

Heydari@HeydariAI·5d

@md_kasif_uddin Qwen 3.6 27B

Magyar

Kasif@md_kasif_uddin·6d

Be honest, which is the best Open Source AI model?

English

147

178

10.7K

Heydari@HeydariAI·5d

Not to brag but Qwen 3.7 Max is good at Jax and Flax NNX

English

Heydari@HeydariAI·5d

Why models in Qwen Studio don't think? I mean the thinking button is also removed from the app (android) @Alibaba_Qwen

English

Heydari@HeydariAI·5d

@analogalok China is only like 2 months behind

English

Alok@analogalok·6d

Sorry but the "China is years behind in AI" people owe us an apology. Qwen3.7 Max at 56.6. top models at 70+. delta shrinking every single drop. wake up.

Artificial Analysis@ArtificialAnlys

Alibaba’s new Qwen3.7 Max model scores 56.6 on the Artificial Analysis Intelligence Index, 4.8 points higher than Qwen3.6 Max Preview (51.8). While Alibaba still trails models from OpenAI, Anthropic and Google, Qwen3.7 Max is the closest they have been to the frontier Qwen3.7 Max is @Alibaba_Qwen's latest proprietary flagship, scoring 56.6 on the Intelligence Index, a 4.8 point gain over Qwen3.6 Max Preview (51.8) released in April. Qwen3.7 Max continues Alibaba's pattern, in place since Qwen2.5 Max (January 2025), of releasing Max and Plus models as closed weights while the rest of the Qwen line remains open weights. The leading open weights Qwen on the Intelligence Index is Qwen3.6 27B (Reasoning, 45.8) released in April 2026, and the leading open weights MoE Qwen is Qwen3.5 397B A17B (Reasoning, 45.0) released in February 2026 Key takeaways for the reasoning variant: ➤ The Intelligence Index gains over Qwen3.6 Max Preview are concentrated in scientific reasoning, agentic capability and coding. CritPt +9.7 p.p (3.7% to 13.4%), HLE +9.2 p.p (28.9% to 38.1%), TerminalBench Hard +6.9 p.p (43.9% to 50.8%) and GDPval-AA +42 Elo (1504 to 1546). Scores on other benchmarks in the Intelligence Index are flat compared to Qwen3.6 Max Preview ➤ A significant share of the Intelligence Index gain is driven by higher abstention on AA-Omniscience, not higher accuracy. Qwen3.7 Max's accuracy on AA-Omniscience dropped 7.6 p.p (37.7% to 30.1%), while its hallucination rate dropped 21.3 p.p (44.2% to 22.9%). The model is choosing not to answer more questions rather than recalling more facts. Because hallucination rate and accuracy both feed into the Intelligence Index, the hallucination reduction is one of the larger single contributors to the +4.8 point gain on the Intelligence Index ➤ Qwen3.7 Max used 96.7M output tokens to run the Intelligence Index, ~31% more than Qwen3.6 Max Preview (73.9M). It sits mid-pack on frontier token usage: above GPT-5.5 (high, 44.5M) and Gemini 3.1 Pro Preview (57.3M), below Claude Opus 4.7 (Adaptive Reasoning, Max Effort, 112M), Kimi K2.6 (166M) and DeepSeek V4 Pro (Reasoning, Max Effort, 187M) Key model details: ➤ Context window: 1M tokens (up from 256K on Qwen3.6 Max Preview) ➤ Multimodality: Text input and output only ➤ Pricing: Yet to be announced (Qwen3.6 Max Preview is priced at $1.30/$7.80 per 1M input/output tokens on the @alibaba_cloud first-party API) ➤ Licensing: Proprietary, closed weights

English

164

17.2K

Heydari@HeydariAI·5d

@mani_meine مدل به شدت گرونی هست نسبت به این که فلشه، می‌دونم در مقایسه با اوپوس و ۵.۵ خیلی ارزون تره ولی مدل های فلش اینقدر گرون نبودن، سه برابر کردن هزینشو

فارسی

761

Mani@mani_meine·6d

توی کنفرانس گوگل I/O اعلام کردن با آنتی گراویتی و Gemini 3.5 Flash ،یک سیستم عامل کاملا فانکشنال رو موفق شدن بنویسند. دوازده ساعت طول کشیده با استفاده از ۹۳ ساب ایجنت، و مصرف 2.6B توکن و مهمتر از همه با هزینه کمتر از هزار دلار😳. این قسمت کاست افیشنسی از همه مهمتره به نظرم.

فارسی

273

17.6K

Heydari@HeydariAI·5d

@david_saint_ Yes it is more robust than 3.1 pro but it's not a flagship model. Like 9 box for output is not fair for a flash model

English

seijin@david_saint_·5d

@HeydariAI The capability is good. The price/token efficiency is not

English

Heydari@HeydariAI·6d

I'm a google fan and JAX stack developer but Gemini 3.5 Flash is ... It should have been named 3.1 Flash, 3.5 is a huge number for its capability. I hope the 3.5 pro becomes a real flagship model

English

265

Heydari@HeydariAI·5d

@arcjax7 Master of hallucination

English

Heydari@HeydariAI·6d

@pheonix627 دقیقا خیلی راحت فلگ میشه اکانتش میری بلاک میکنی

فارسی

Toxic Wine🇮🇷@pheonix627·20 May

جایزه بهترین اموجی هم میرسه به🎒 واقعا خیلی خوبه هر وقت میبینم یکی تو اسمش اینو گذاشته دیگه وقتمو تلف نمیکنم ببینم چه اراجیفی بافته

فارسی

811

Keşfet

@test_tm7873 @googlecloud @GoogleCloudTech @smykx @diegocabezas01 @arcjax7 @elonmusk @BarackObama