Tweet ghim
Vikas
2K posts

Vikas
@vickcodes
AI/ML Researcher with passion for turning wild ideas into reality.Seeking spark that ignite creativity-let's build together💡 DM for Coffee☕️ Code 🚀
Hyderabad, India Tham gia Mayıs 2016
1K Đang theo dõi196 Người theo dõi
Vikas đã retweet

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation.
Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers.
🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth.
🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale.
🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead.
🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains.
🔗Full report:
github.com/MoonshotAI/Att…

English
Vikas đã retweet

Understanding Deep Learning — 541-page PDF eBook and 68 coding exercises with Python code notebooks that cover all the topics in the book: udlbook.github.io/udlbook/
...or buy the book here: amzn.to/4qwof9k

English
Vikas đã retweet

How many of you know that all the models you are using are built based on transformers.
Transformers created based on research paper "Attention All You Need"
#AI #ML #ArtificialIntelligence #gemini
arxiv.org/abs/1706.03762
English

Drop your profile. Would review bio,header and share honest feedback
#feedback #GrowingTogether #GrowWithCode #Grow
English

@thesayannayak Because c++ is complex.Python is built for accessibility,understandable and easy to write and build.
English

📢 FRЕЅH NЕWS: Elon Musk announces the introduction of its own X digital currency 🚀
📌 Learn more: x.com/i/communities/…
👥 Invited users: @ErickMla @vickcodes @Ksaynak16
English

@vickcodes @X Ask Grok is currently available to Premium and Premium+ subscribers only. Subscribe to unlock this feature: x.com/i/premium_sign…
English

@mfranz_on Is there any way, we can read configuration of the hardware and suggest best local model ?
At least it should have instructions to get this done.
English

Neural networks having a pivotal role in machine learning. Lets review and utilize this.
Kirk Borne@KirkDBorne
Download 284-page PDF “Introduction to Neural Networks” ➡️ dkriesel.com/en/science/neu… ————— #DataScience #AI #Algorithms #ML #MachineLearning #DeepLearning #Mathematics #Calculus #DataScientist
English









