Mohamed Rashad

471 posts

Mohamed Rashad

@MRashadnow

Katılım Ekim 2023

81 Takip Edilen82 Takipçiler

Mohamed Rashad retweetledi

Mehrdad Farajtabar@MFarajtabar·18h

🧵 1/11 Everyone's doing on-policy distillation now (Qwen3, Deepseek V4, GLM-5). But here's what nobody's asking: at any given token or for a question and a teacher, when does the teacher's guidance actually help, and when does it quietly make things worse? We found a way to answer this. No training needed!

English

337

17.3K

Mohamed Rashad@MRashadnow·20h

فكرة إن نموذج meta فى القائمة دى شئ عبثى جدا

Arena.ai@arena

The top 5 labs in Text Arena rankings by category show that frontier models have distinct strengths and tradeoffs. #1 @AnthropicAI, Claude Opus 4.7 - The most consistently dominant model overall, leading top-tier across nearly every major category. #2 @GoogleDeepMind, Gemini 3.1 Pro - Well-rounded, with a notable edge in Creative Writing, ranked below Opus 4.7 and GPT-5.5 High in Expert #3 @AIatMeta, Muse Spark - Particularly strong in Overall and Coding, though it’s lagging behind in Expert tasks, Math, and Longer Query performance. #4 @OpenAI, GPT-5.5 High - One of the most balanced models overall, staying competitive with the top two across most categories, with especially strong performance in Expert and Math. #5 @xAI, Grok 4.20 - A more specialized profile, standing out primarily in Creative Writing and Hard Prompts, while lagging behind in Expert tasks.

العربية

Mohamed Rashad@MRashadnow·1d

It became pretty much a pattern now that AI coding tools speed the specialists but slows down/confuses the beginners.

English

Mohamed Rashad@MRashadnow·1d

Audio for input, something visual for output

Andrej Karpathy@karpathy

This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc. More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage: 1) raw text (hard/effortful to read) 2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default 3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default ...4,5,6,... n) interactive neural videos/simulations Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral x.com/zan2434/status… There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen. TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.

English

Mohamed Rashad retweetledi

Daniel Haqiqatjou@Haqiqatjou·5 May

38 million people (many Muslims) killed by the US and Europe over 50 years due to sanctions. If Muslims had killed 38 million people over the last 50 years, no Westerner would hesitate to take this as evidence that they are a deeply cruel people and a threat to the human race.

English

270

782

9.6K

Mohamed Rashad@MRashadnow·1d

اول مرة اعرف هذه النبوءة youtu.be/8CtrZ5sd7j4?si…

YouTube

العربية

Mohamed Rashad@MRashadnow·2d

@khattab__10 نقدك لرأى طرف اخر على منصة تانية وجعله بشكل أو بأخر مادة للسخرية للأخرين بدل مواجهتك ليه بشكل مباشر هو فعل غير اخلاقى وكنت مش هتقبله على نفسك لو كانت الادوار معكوسة

العربية

626

Khattab@khattab__10·2d

هما بيعملو ايه في بعض في مشاريع التخرج

العربية

168

31.2K

Mohamed Rashad@MRashadnow·2d

interesting read

wh@nrehiew_

New blog post! Wrote about how SFT, RL, OPD relate to generalization and catastrophic forgetting :)

English

Mohamed Rashad@MRashadnow·2d

مش فاهم يعنى هو كان المفروض يرميها ولا يعمل إيه بالظبط هو لو قرر يأخدها من أمها لأنه شافها غير كفء لتحمل المسؤولية أو فقط لأنه بيحب بنته فالحالتين ميخلهوش إنسان شرير أو سئ … فعله أفضل بكثير من إهماله لبنته.

شبكة رصد@RassdNewsN

"ليه علاقات في القسم".. سيدة تستغيث بسبب حرمانها من طفلتها وابتزاز طليقها لها في محافظة القاهرة

العربية

Mohamed Rashad@MRashadnow·3d

فكرة مشابهة لهذه كانت أتنشرت السنة اللى فاتت تحت مسمى Continous Autoregressive Language Models أو CALM أختصارا

alphaXiv@askalphaxiv

“Continuous Latent Diffusion Language Model” Most diffusion language models still use diffusion to recover token-like states, just in a different generation order. However, this paper uses diffusion in a different way. It learns a continuous latent prior for global semantics first, then decodes that latent into text. It treats language modeling as planning meaning in latent space and only then realizing the wording, and they show that this can scale competitively up to 2000 EFLOPs against autoregressive models, especially for tasks needing deeper semantic planning.

Mohamed Rashad@MRashadnow·4d

كتبت بوست من 5 أيام عن إن الLLMs هتتحرك لHTML مع الوقت … متوقعتش الموضوع هيحصل بالسرعة دى 😂

Thariq@trq212

HTML is the new markdown. I've stopped writing markdown files for almost everything and switched to using Claude Code to generate HTML for me. This is why.

العربية

150

Mohamed Rashad@MRashadnow·4d

@_Suresh2 omnivoice surpassed our expectations

English

Suresh@_Suresh2·4d

@MRashadnow do any of the three handle diacritics consistently? that's where most arabic tts samples fall apart for me

English

Mohamed Rashad@MRashadnow·4d

We have added three new model to Arabic TTS Arena⚔️ You can participate in voting on which is better through here: huggingface.co/spaces/Navid-A…

English

Mohamed Rashad@MRashadnow·5d

لما تبدأ تشوف أى دولة بتتكلم فى قوانين غير منطقية وفى الغالب هتضرها أكتر ما هتنفعها أعرف إن هذه الدولة القوانين مبتتطبقش فيها على الجميع. طول ما سيادة القانون غير سارية على الجميع طول ما الدنيا هتسوء أكثر وأكثر.

العربية

Mohamed Rashad@MRashadnow·5d

@ahmdelemam @Ziad_M_404 عشان quick 😂 هو فعلا كل حاجة بتعملها الquick actions ممكن تعملها بالtool calling العادى … الميزة الوحيدة فيهم إنها أسرع وأبسط (هو token واحد بتضيفه وبيطلعلك بعده ناتج هذه الأكشن)

العربية

Ahmed Elemam@ahmdelemam·5d

@Ziad_M_404 @MRashadnow لو تقدر تجاوبه و نحط السؤال ده في الليست 😃

العربية

Ahmed Elemam@ahmdelemam·5d

twitter.com/i/spaces/1dJrP…

ZXX

1.1K

Mohamed Rashad@MRashadnow·5d

داريو بيشرب المهندسين بتوعوه حاجة مستحيل بشر عاقلين يكونوا مقتنعين بالعبط ده وبيضيعوا موارد الشركة فى هذا الهرى

Anthropic@AnthropicAI

New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers—called activations—encode Claude’s thoughts, but not in a language we can read. Here, we train Claude to translate its activations into human-readable text.

العربية

174

Mohamed Rashad@MRashadnow·6d

@Elnazer109_ من DeepSeekV3 وأنت طالع

Ahmed@Elnazer109_·6d

@MRashadnow معنديش مشكلة خالص ابص علي الابحاث القديمة بس ابدأ منين

العربية

Mohamed Rashad@MRashadnow·5 May

فى أختيارين بخصوص هذه الحلقة/السبيس: - الأول إن اللى يحضروا يقرأوا أبحاث DS القديمة فمنحتاجش نرجعلها وساعتها نتعمق أكثر فى الجديد. - الأختيار الثانى هو إنى أرجع فى شرح كل حاجة لبدايتها وده هيطول الشرح وعلى الأغلب مش هنعدى على كل شئ. 🤷

Ahmed Elemam@ahmdelemam

إيه الجديد في DeepSeek V4 مع @MRashadnow محمد رشاد Set a reminder for my upcoming Space! twitter.com/i/spaces/1dJrP…

العربية

1.6K

Mohamed Rashad@MRashadnow·6d

نفسى اكتب مقالة مطولة عن ليه الشعوب إنتمائها للقوانين الأجتماعية الدينية أكبر من إنتمائها للقوانين الأجتماعية القومية بالرغم من كل المجهود اللى بتعمله الدول فى إظهار نفسها على إنها أعدل أو أحسن تشريعيا

العربية

Mohamed Rashad@MRashadnow·6d

تجربتى مع Inworld RealTime TTS 2 واللى المفروض يكون الرويث لأفضل TTS فى العالم بناءا على الLeaderboards. جربته برضو فى اللغة العربية وكان سئ جدا فا لازال فيه فرصة كبيرة لمن يريد بناء شئ مفيد فى هذه المساحة: inworld.ai/blog/realtime-…

العربية

110

Mohamed Rashad@MRashadnow·6 May

The line bettwen the different finetuning techniques is getting blurier everyday

Arthur Conmy@ArthurConmy

DPO is substantially more similar to SFT than it is to RL. I will die on this hill.

English

Keşfet

@khattab__10 @_Suresh2 @ahmdelemam @Ziad_M_404 @elonmusk @BarackObama @taylorswift13 @cristiano