Heydari

616 posts

Heydari banner
Heydari

Heydari

@HeydariAI

Just-in-Time Machine Learning Engineer - Always monitoring the situation

Katılım Aralık 2024
130 Takip Edilen44 Takipçiler
testtm
testtm@test_tm7873·
Oh well it seems @googlecloud is shutting down the TRC program 😢 So many memories and I learn so much from it. Sadly. Maybe if we will make ourself hear they gona continue supporting us Opensource guys? :)) I hope so!
testtm tweet media
English
2
0
11
515
Heydari
Heydari@HeydariAI·
Okay TRC just said im out of the program somehow because im Iranian huh? Or other folks have similar issue, dc
English
0
0
0
37
arcjax
arcjax@arcjax7·
i miss the old andrej, gpt from scratch andrej taught us backprop andrej, cs231n andrej shipped the lectures andrej, open source andrej i hate the new andrej, vibe code andrej closed source anthropic andrej, english is code andrej i still love andrej, i still love andrej
English
2
0
37
1.1K
Heydari
Heydari@HeydariAI·
@arcjax7 If deepmind doesn't make 3.5 pro AGI I would compare it with MGK
English
0
0
2
110
arcjax
arcjax@arcjax7·
openai: drake anthropic: kendrick deepmind: jcole
English
2
0
21
1.9K
arcjax
arcjax@arcjax7·
Jax is the truth Pytorch is for people who fear God I jit compiled my soul in 2019 vmap me Pure functions only this is bigger than code - kanye on jax
arcjax tweet media
English
2
0
9
478
Kasif
Kasif@md_kasif_uddin·
Be honest, which is the best Open Source AI model?
Kasif tweet mediaKasif tweet mediaKasif tweet mediaKasif tweet media
English
147
3
178
10.7K
Heydari
Heydari@HeydariAI·
Not to brag but Qwen 3.7 Max is good at Jax and Flax NNX
English
0
0
0
70
Heydari
Heydari@HeydariAI·
Why models in Qwen Studio don't think? I mean the thinking button is also removed from the app (android) @Alibaba_Qwen
English
0
0
0
41
Alok
Alok@analogalok·
Sorry but the "China is years behind in AI" people owe us an apology. Qwen3.7 Max at 56.6. top models at 70+. delta shrinking every single drop. wake up.
Artificial Analysis@ArtificialAnlys

Alibaba’s new Qwen3.7 Max model scores 56.6 on the Artificial Analysis Intelligence Index, 4.8 points higher than Qwen3.6 Max Preview (51.8). While Alibaba still trails models from OpenAI, Anthropic and Google, Qwen3.7 Max is the closest they have been to the frontier Qwen3.7 Max is @Alibaba_Qwen's latest proprietary flagship, scoring 56.6 on the Intelligence Index, a 4.8 point gain over Qwen3.6 Max Preview (51.8) released in April. Qwen3.7 Max continues Alibaba's pattern, in place since Qwen2.5 Max (January 2025), of releasing Max and Plus models as closed weights while the rest of the Qwen line remains open weights. The leading open weights Qwen on the Intelligence Index is Qwen3.6 27B (Reasoning, 45.8) released in April 2026, and the leading open weights MoE Qwen is Qwen3.5 397B A17B (Reasoning, 45.0) released in February 2026 Key takeaways for the reasoning variant: ➤ The Intelligence Index gains over Qwen3.6 Max Preview are concentrated in scientific reasoning, agentic capability and coding. CritPt +9.7 p.p (3.7% to 13.4%), HLE +9.2 p.p (28.9% to 38.1%), TerminalBench Hard +6.9 p.p (43.9% to 50.8%) and GDPval-AA +42 Elo (1504 to 1546). Scores on other benchmarks in the Intelligence Index are flat compared to Qwen3.6 Max Preview ➤ A significant share of the Intelligence Index gain is driven by higher abstention on AA-Omniscience, not higher accuracy. Qwen3.7 Max's accuracy on AA-Omniscience dropped 7.6 p.p (37.7% to 30.1%), while its hallucination rate dropped 21.3 p.p (44.2% to 22.9%). The model is choosing not to answer more questions rather than recalling more facts. Because hallucination rate and accuracy both feed into the Intelligence Index, the hallucination reduction is one of the larger single contributors to the +4.8 point gain on the Intelligence Index ➤ Qwen3.7 Max used 96.7M output tokens to run the Intelligence Index, ~31% more than Qwen3.6 Max Preview (73.9M). It sits mid-pack on frontier token usage: above GPT-5.5 (high, 44.5M) and Gemini 3.1 Pro Preview (57.3M), below Claude Opus 4.7 (Adaptive Reasoning, Max Effort, 112M), Kimi K2.6 (166M) and DeepSeek V4 Pro (Reasoning, Max Effort, 187M) Key model details: ➤ Context window: 1M tokens (up from 256K on Qwen3.6 Max Preview) ➤ Multimodality: Text input and output only ➤ Pricing: Yet to be announced (Qwen3.6 Max Preview is priced at $1.30/$7.80 per 1M input/output tokens on the @alibaba_cloud first-party API) ➤ Licensing: Proprietary, closed weights

English
16
11
164
17.2K
Heydari
Heydari@HeydariAI·
@mani_meine مدل به شدت گرونی هست نسبت به این که فلشه، می‌دونم در مقایسه با اوپوس و ۵.۵ خیلی ارزون تره ولی مدل های فلش اینقدر گرون نبودن، سه برابر کردن هزینشو
فارسی
2
0
0
760
Mani
Mani@mani_meine·
توی کنفرانس گوگل I/O اعلام کردن با آنتی گراویتی و Gemini 3.5 Flash ،یک سیستم عامل کاملا فانکشنال رو موفق شدن بنویسند. دوازده ساعت طول کشیده با استفاده از ۹۳ ساب ایجنت، و مصرف 2.6B توکن و مهمتر از همه با هزینه کمتر از هزار دلار😳. این قسمت کاست افیشنسی از همه مهمتره به نظرم.
فارسی
9
4
273
17.6K
Heydari
Heydari@HeydariAI·
@david_saint_ Yes it is more robust than 3.1 pro but it's not a flagship model. Like 9 box for output is not fair for a flash model
English
0
0
0
9
seijin
seijin@david_saint_·
@HeydariAI The capability is good. The price/token efficiency is not
English
1
0
0
22
Heydari
Heydari@HeydariAI·
I'm a google fan and JAX stack developer but Gemini 3.5 Flash is ... It should have been named 3.1 Flash, 3.5 is a huge number for its capability. I hope the 3.5 pro becomes a real flagship model
English
2
0
3
262
Heydari
Heydari@HeydariAI·
@pheonix627 دقیقا خیلی راحت فلگ میشه اکانتش میری بلاک میکنی
فارسی
1
0
1
18
Toxic Wine🇮🇷
Toxic Wine🇮🇷@pheonix627·
جایزه بهترین اموجی هم میرسه به🎒 واقعا خیلی خوبه هر وقت میبینم یکی تو اسمش اینو گذاشته دیگه وقتمو تلف نمیکنم ببینم چه اراجیفی بافته
فارسی
2
0
18
805
Toxic Wine🇮🇷
Toxic Wine🇮🇷@pheonix627·
رفتم جاب ویژن دیدم شرکتمون پوزیشن قبلی منو آگهی کرده با حدود 2 تا 2.5 برابر حقوقی که میخواست به من بده لامصبا یعنی ما فاحشه ارزون قیمت بودیم واستون؟
فارسی
28
16
3.5K
121.7K
Heydari
Heydari@HeydariAI·
@yacineMTB What was DeepSeek zero I can't remember
English
0
0
0
17
kache
kache@yacineMTB·
AGI was co-invented by alec radforod and deepseek zero
English
8
3
146
10.2K