Darshan Patil
120 posts

Darshan Patil
@dapatil211
PhD student at UdeM/MILA (Quebec AI Institute)



Following our @dagstuhl seminar on Continual Learning in the Foundation Model Era, we are now sharing a roadmap! tldr: we view modular memory design as the missing piece to combine the capabilities of In-Weight & In-Context Learning for adaptation at scale arxiv.org/abs/2603.01761

first paper of the phd 🥳 the Superficial Alignment Hypothesis (SAH) argues that pre-training adds most of the knowledge to a model, and post-training merely surfaces it. however, this hypothesis has lacked a precise definition. we fix this.














📢Excited to announce the Workshop on Weight-Space Symmetries @icmlconf! We welcome 4-page submissions analysing symmetries, their effects on training and model structure, and practical methods to utilize them. Submission Deadline: April 24 (23:59 AoE) #ICML2026






🧬 New paper Scientific datasets evolve as science evolves. With proteins, new sequences get added, annotations get corrected, and noisy entries get curated out. Introducing CoPeP, a continual-pretraining benchmark for protein LMs. Details 🧵 1/n


