AnderDN

83 posts

AnderDN banner
AnderDN

AnderDN

@dn_ander

I’m a computational biologist and biochemist 🧬 Postdoctoral researcher at University of Toronto and OICR

Katılım Eylül 2020
162 Takip Edilen42 Takipçiler
AnderDN retweetledi
Grupo R Asturias
Grupo R Asturias@grupoRasturias·
¡Anunciamos la primera sesión del ciclo Bio::Bytes! 📅 Fecha: 19 de febrero. 🕓 Horario: 16:00h - 17:00h.📍Lugar: Edificio Silicosis, FINBA (Av. Roma s/n). 🎟️ Acceso: Libre hasta completar aforo. Conducido por: @dn_ander Inscríbete en: forms.gle/YKLSQ5yEoML65H…
Grupo R Asturias tweet media
Español
1
3
2
276
AnderDN retweetledi
FINBA
FINBA@FINBAsturias·
La Plataforma de Bioestadística y Epidemiología @Bioestad_ISPA organiza un nuevo Curso de Bioestadística básica, del 4 de febrero al 25 de marzo. 25 plazas disponibles, que se adjudicarán por orden de inscripción. Más información e inscripciones en ispa-finba.es/curso-bioestad…
FINBA tweet media
Español
0
3
2
200
AnderDN retweetledi
Dan Landau
Dan Landau@landau_lab·
Big, beautiful trees!! SMART-PTA for whole-genome+transcriptome on thousand of single cells from the normal human esophagus 🤯 Massively scaling up the power of scWGS to build deep phylogenies and chart somatic evolution from birth throughout life. biorxiv.org/content/10.110…
Dan Landau tweet media
English
14
74
244
41.9K
AnderDN retweetledi
Valli Subasri
Valli Subasri@vallisubasri·
Many cancer methylation studies make subtle but fatal mistakes: ❌ Feature selection across train+test (x.com/jmschreiber91/…) ❌ Ignoring confounders ❌ Weak model evaluation and robustness The result? Biased predictions that don’t generalize.🧵We set out to do it right.
Jacob Schreiber@jmschreiber91

The more papers I read for a review article I'm writing about ML pitfalls in genomics, the more my faith is shaken in the results from papers that apply machine learning to methylation arrays. A salty thread. 1/

English
1
2
18
1.7K
AnderDN
AnderDN@dn_ander·
Grateful to my colleagues and to my supervisors, Lincoln Stein & @BoWang87, for their guidance and support. Stay tuned —more to come!
English
1
0
1
76
AnderDN retweetledi
Grupo R Asturias
Grupo R Asturias@grupoRasturias·
Un año más, desde el Grupo de R de Asturias, y en colaboración con @FINBAsturias, lanzamos el Curso de Introducción a R - ¡y ya va por su cuarta edición! 🎉 Si te interesa empezar a manejar datos y crear figuras con R, no dudes en apuntarte. ¡Os esperamos! Más información👇
Grupo R Asturias tweet media
Español
0
5
9
381
AnderDN retweetledi
Bo Wang
Bo Wang@BoWang87·
🚀 What do genomic Transformers actually learn about biology? •What knowledge do they hold at random init, after pre‑training, and following fine‑tuning? •We dove deep into every attention head to find out. 📄 Preprint live now “Interpreting Attention Mechanisms in Genomic Transformer Models: A Framework for Biological Insights” 👉 biorxiv.org/content/10.110… Code: github.com/meconsens/geno… ⸻ 🛠 What we built •Scalable mapping between attention heads & biological features (e.g. TSS, GC content, GO terms) •Label‑specific analysis to uncover context‑dependent attention patterns •GPT‑4 summaries for every head’s attention‑feature links •Head ablation experiments to test causal impact on predictions ⸻ 🔍 Key discoveries •Even models with random DNA weights show biologically meaningful heads •Fine-tuning refines, not erases, what pre‑training learned •Tokenization matters: overlapping vs non‑overlapping k‑mers affect interpretability •Heads tied to biology are more predictive than heads with no feature links •Some heads show negative learning—they attend to absence of features ⸻ 🧠 Why this matters We now have tools to ask: what genomic models learn—and which heads are driving predictions. A big step toward truly interpretable, testable genomics AI. ⸻ ⚠️ Limitations to keep in mind •Not every head is interpretable •Attention patterns can be unstable across layers & tokens •Interpretations explain only part—not all—attention variance •GPT‑4 summaries are helpful but can overgeneralize •Results depend heavily on annotation quality & biological context ⸻ TL;DR: We’re bringing interpretability to the core of genomic Transformers—revealing biologically meaningful attention heads, unpacking how tokenization & training shape them, and letting us pinpoint which ones actually matter. 🎉 Huge shoutout to the incredible lead authors in the lab, Mica Consens, Vivian Chu, Ander Diaz-Navarro for driving this forward! @VectorInstitute @UHN_Research @UofT
Bo Wang tweet mediaBo Wang tweet mediaBo Wang tweet media
English
6
64
328
32.5K
AnderDN retweetledi
ISCB SC RSG-Spain
ISCB SC RSG-Spain@RSGSpain·
🔍 ¿Te suena virtualenv o conda? Pues en R también tenemos una joyita: ¡renv! 💎 Desarrollado por Posit, renv te permite crear entornos virtuales (📦 conjuntos de paquetes aislados) para que tus proyectos en R sean: ✅ Reproducibles ✅ Fáciles de compartir
ISCB SC RSG-Spain tweet mediaISCB SC RSG-Spain tweet mediaISCB SC RSG-Spain tweet mediaISCB SC RSG-Spain tweet media
Español
1
2
6
368
AnderDN retweetledi
Grupo R Asturias
Grupo R Asturias@grupoRasturias·
El 2º premio (100€) se lo ha llevado, también desde la Universidad de Granada: 👧Laura Jiménez Os dejamos aquí unas cuantas imágenes de su propuesta:
Grupo R Asturias tweet mediaGrupo R Asturias tweet mediaGrupo R Asturias tweet media
Español
2
2
1
145
AnderDN retweetledi
Bo Wang
Bo Wang@BoWang87·
Exciting News: Our team — Arman (@arman1sa lead, an AI engineer @UHNAIHUB ) + Nasim Abdollahi — placed 1st in the AIRCHECK Hackathon mini-challenge! They built a gradient-boosted model with Bayesian optimization to predict binding of DEL-derived molecules to target proteins. AIRCHECK is a large-scale open-access platform for AI-driven drug discovery, developed by @thesgconline, X-Chem & HitGen, hosting DEL screening data across diverse protein targets. Thanks to @UHN, @Google, and @UHNAIHUB for supporting this work. More to come on accelerating hit discovery with ML! #AI #DrugDiscovery #Cheminformatics #Hackathon
Bo Wang tweet mediaBo Wang tweet mediaBo Wang tweet media
English
0
11
52
4.3K
AnderDN retweetledi
Bo Wang
Bo Wang@BoWang87·
🔥 Unveiling the Future of Genomics with Genome Language Models (gLMs)! 🔥 Our comprehensive review, "Transformers and genome language models," is finally published in Nature Machine Intelligence! ​ Link: nature.com/articles/s4225… Key Highlights: 🔬 The Challenges Addressed by gLMs: gLMs tackle the intricate task of interpreting vast genomic sequences, enabling predictions about gene regulation, variant effects, and more.​ 🧠 Transformers in Genomics: Discover how transformer architectures, renowned for their success in natural language processing, are adept at capturing long-range dependencies in genomic data, leading to more accurate models.​ 🚀 Beyond Transformers—Introducing HyenaDNA: Explore innovative architectures like HyenaDNA, which offer efficient long-range genomic sequence modeling at single nucleotide resolution, pushing the boundaries of genomic research.​ 📊 Comparative Analysis of Models: We delve into the evolution from sequence-to-function models like DeepSEA and Enformer to sequence-to-sequence models such as DNABERT and Evo, highlighting their respective strengths and applications.​ ⚡ Strengths, Limitations, & Future Directions: Gain insights into the current capabilities of genomic AI, its limitations, and the promising avenues for future research and application.​ This pivotal work is the result of a collaborative effort led by Micaela E. Consens (@micaelanonsense ), with contributions from Cameron Dufault, Michael Wainberg (@michaelwainberg ), Duncan Forster, Mehran Karimzadeh, Hani Goodarzi (@genophoria ), Fabian J. Theis (@fabian_theis ), Alan Moses. @UHNAIHUB @UHN @VectorInst @uoftoront #Genomics #AI #MachineLearning #Transformers #HyenaDNA #DeepLearning #Bioinformatics #GenomeResearch
Bo Wang tweet mediaBo Wang tweet mediaBo Wang tweet mediaBo Wang tweet media
English
8
119
391
48.7K
AnderDN retweetledi
Grupo R Asturias
Grupo R Asturias@grupoRasturias·
Bueno, bueno.... pues aquí está uno de nuestros eventos más importantes del año. Nuestro "CONCURSO DE VISUALIZACIÓN DE DATOS CON R"📈📊, anímate a participar y afrontar el reto que proponemos este año. ¡Esperamos ver con que nos sorprendeis! . . Y que corran esos códigos 😎
Grupo R Asturias tweet mediaGrupo R Asturias tweet media
Español
1
7
17
872
AnderDN
AnderDN@dn_ander·
Our updated version of OncoGAN is out! 🚀 OncoGAN is an AI system capable of generating high-fidelity, open-access synthetic cancer genomes. Do you want to know more about it? 1/9
AnderDN tweet media
English
1
2
8
715