


Maryam | مريم 🤖
1.5K posts

@Mal7othify
Engineering @SDAIA_SA | Tech Trainer | Public Speaker | Community Builder | Women in Tech Advocate - Opinions are my own











F-Droid warns that Google’s new sideloading restrictions, requiring developer verification, will kill its project and block access to hundreds of open source apps. It argues that this limits independent developers and consolidates Google’s control alternativeto.net/news/2025/9/f-…


🇨🇳 DeepSeek-R1 was published in Nature yesterday as the cover article for their BRILLIANT latest research. They show that pure Reinforcement Learning with answer-only rewards can grow real reasoning skills, no human step-by-step traces required. So completely skip human reasoning traces and still get SOTA reasoning via pure RL. It’s so powerful revelation, because instead of forcing the model to copy human reasoning steps, it only rewards getting the final answer right, which gives the model freedom to invent its own reasoning strategies that can actually go beyond human examples. Earlier methods capped models at what humans could demonstrate, but this breaks that ceiling and lets reasoning emerge naturally. Those skills include self-checking, verification, and changing strategy mid-solution, and they beat supervised baselines on tasks where answers can be checked. Models trained this way also pass those patterns down to smaller models through distillation. AIME 2024 pass@1 jumps from 15.6% to 77.9%, and hits 86.7% with self-consistency. ⚙️ The Core Concepts The paper replaces human-labelled reasoning traces with answer-graded RL, so the model only gets a reward when its final answer matches ground truth, which frees it to search its own reasoning style. The result is longer thoughts with built-in reflection, verification, and trying backups when stuck, which are exactly the skills needed for math, coding, and STEM problems where correctness is checkable. This matters because supervised traces cap the model at human patterns, while answer-graded RL lets it discover non-human routes that still land on correct answers.

واجهة مبتكرة ترتقي بتوقعاتك وتمنحك تجربة رقمية متطورة مع تطبيق #توكلنا بواجهة مستخدم جديدة. حدث التطبيق واستمتع بتجربة مثرية twkapp.store

اللي عندهم مشروع التخرج ويحتاج يكتب تقرير بطريقة مرتبة حرفيا overleaf من افضل المواقع اللي تقدر تستخدمه وتقدر تشاركه مع التيم كأنه Github لكن مخصص لتقارير. هو يستخدم نظام (LaTeX) مخصص للبحوث (document preparation system) overleaf.com