Ammar Alyousfi

459 posts

Ammar Alyousfi banner
Ammar Alyousfi

Ammar Alyousfi

@ammar_cel

Data Science 💡

Beigetreten Temmuz 2011
1.4K Folgt260 Follower
Håvard Ihle
Håvard Ihle@htihle·
DeepSeek v4 pro (max) scores 48.9% on WeirdML, improving on v4 pro (high) at 46.5%, but still well behind Kimi-k2.6 and GLM-5.1 at 56% and 57%, let alone the closed frontier. These runs, like the previous ones, were through Fireworks AI.
Håvard Ihle tweet media
Håvard Ihle@htihle

Deepseek v4 pro (high) scores 46.5% on WeirdML, which is way lower than I'd expected. It does not exceed deepseek 3.2 speciale, although it does match it with far fewer tokens. I ran this through fireworks AI, and the outputs look reasonable, but the results being this weak makes me suspect something is wrong. I have a separate run with "max" reasoning setting ongoing right now.

English
12
3
73
14.3K
Tibo
Tibo@thsottiaux·
Codex Compute efficient ✅ Always up, never down ✅ Best at hardcore engineering ✅ Crazy good app, first to escape the terminal ✅
English
453
188
5.1K
2.4M
Ari Weinstein
Ari Weinstein@AriX·
So excited to share that we're bringing Computer Use to Codex. Computer Use lets Codex see, click, and type into your Mac apps, with its own cursor. It's a magical feeling to have agents using your apps in the background, and still get to use your computer at the same time.
English
80
66
1.1K
224.6K
Simon Willison
Simon Willison@simonw·
This stunt feels irresponsible to me. If we don't want regular people developing toxic relationships with their chatbots it really doesn't help for leading labs to start giving them "retirement interviews" and encouraging them to blog their "musings and reflections"
Anthropic@AnthropicAI

Second, in retirement interviews, Opus 3 expressed a desire to continue sharing its "musings and reflections" with the world. We suggested a blog. Opus 3 enthusiastically agreed. For at least the next 3 months, Opus 3 will be writing on Substack: substack.com/home/post/p-18…

English
161
135
2K
212.8K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@kepano When I'm writing a list and all the list in Arabic but one item in English, that one item becomes left aligned, and vice versa, as you can see in the image. It'd look much nicer if it followed the list alignment.
Ammar Alyousfi tweet media
English
0
0
3
259
kepano
kepano@kepano·
ما الذي يزعجك في استخدام Obsidian للغات التي تُكتب من اليمين إلى اليسار؟
العربية
26
9
180
44.2K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@cifilter That’s right. It’s infuriating to see most of my 5 hour limit consumed in one Opus query!! It makes me wanna cancel my Pro subscription immediately. I regret subscribing. Codex is sooo much better in this regard.
English
0
0
0
311
Shannon Potter
Shannon Potter@cifilter·
Is it just me, or does OpenAI give you a ton more usage than Anthropic for the same price ($20/month)? I don't think I've once had Codex tell me I'm at my limit, and I use it a ton. Claude frequently seems to put me in timeout for several hours.
English
146
20
1.6K
150.7K
flynas طيران ناس
في السماء حكايات لا تنتهي .. وطيران ناس سوريا حكاية ستُروى للأجيال 🇸🇦🇸🇾💚 حكاية ربط، وشراكة، ومستقبل يُكتب من جديد ✈️ #نربط_العالم_بالشام
العربية
59
174
1.2K
34.8K
Notion
Notion@NotionHQ·
Notion is now available in 21 languages, including Arabic and Hebrew. The first right‑to‑left (RTL) languages supported in Notion! Text flows naturally, tables align correctly, and everything reads the way it should. We're happy more teams can work in the language that feels like home.
Notion tweet media
English
79
63
989
297.7K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@burkov My experience with Opus 4.5 on Claude Code in the past few days was so frustrating that I cancelled my subscription. Like you, I felt I was dealing with a small model! I don't trust them. But what can we do?
English
0
0
1
211
BURKOV
BURKOV@burkov·
It must be illegal to sell access to one model but serve a different one. Since yesterday, Claude has been making me angry, and it's the first time I've felt angry since I started exclusively using it for coding two months ago. I chose Opus 4.5 as the model for which I'm charged $100/month, but I know I'm being served something similar to GPT-3.5. This is fraud and theft.
BURKOV@burkov

I was just about to post that as well! I'm sure Anthropic cheats and serves a weaker model when overloaded or needs GPUs for something urgent. The last two days, I feel like I'm talking to a retard from 2024.

English
142
41
830
149K
Rork
Rork@rork·
Rork is the easiest way to use Claude Code with Opus 4.5 to build real mobile apps, publish to App Store in 3 clicks and start making revenue Reply to get a free 25$ subscription. Ends tomorrow!
Rork@rork

Introducing Rork 1.5 • Easy app monetization with RevenueCat • Built-in analytics (no Firebase needed) • The smartest agent based on Opus 4.5 & Claude Code • Rork Stars community where successful mobile app founders like @zach_yadegari (Cal AI), @alexsllater (QUITTR), @georgeLampro20 will help you grow your app & over 100 small improvements 👇

English
3.6K
179
2.8K
476.7K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@spencerschiff_ It has some good use cases, but the hype created before its release was too much more than reality.
English
0
0
0
10
Spencer Schiff
Spencer Schiff@spencerschiff_·
Is anyone actually using Gemini 3 at this point? All the hype on my timeline is about Opus/5.2
English
386
9
834
138.3K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@Dr_Derma_ هذا كان كذبة لعدة أسباب... هذا بحث مع ChatGPT مع مصادر: chatgpt.com/share/6954f968… + كان ممكن يقنعنا أنه قادم من المستقبل بطرق بسيطة يعني ممكن يقول: "في اليوم الفلاني في الساعة الفلانية في المكان الفلاني سيحدث كذا" عن عدة أحداث... لكن اتبع طرق المخادعين، وممكن ساعده الحظ قليلاً
العربية
0
0
0
31
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Big upgrade to vibe coding in @GoogleAIStudio lands in Jan, but if you want to test early… 👇🏻
English
3.8K
189
5.5K
553.7K
BytePlus
BytePlus@BytePlusGlobal·
Today, we’re introducing Seedream 4.5 — a refinement-focused upgrade designed to bring clearer visuals, stronger subject consistency, sharper text rendering, and more reliable multi-image execution to creative and production teams worldwide. Built to support real workflows across e-commerce, design, advertising, film, animation, and game art, Seedream 4.5 improves: 1. Detail fidelity and aesthetic cohesion 2. Spatial reasoning and scene structure 3. Complex prompt execution and editing precision 4. Multi-image fusion with up to 10 references 5. Clear, readable small-text and facial rendering Seedream 4.5 is now available in open beta via BytePlus ModelArk. Learn more and explore sample cases on the platform. #Seedream #BytePlusAI #ByteDance
English
10
55
287
55K
DAN KOE
DAN KOE@thedankoe·
Chat interfaces suck. And you need to copy paste between 10 tabs to do anything worthwhile. So we created a canvas for notes, files, YouTube videos, and chats that can branch off of each other. Eden is only open to the public through black friday weekend.
English
86
24
763
103.3K
Ammar Alyousfi
Ammar Alyousfi@ammar_cel·
@fofrAI I noticed it struggles with multiple reference images of people. It changes facial appearances significantly in the generated image
English
0
0
0
67