



Ruofan Jin
24 posts












Many of the most complex and useful functions in biology emerge at the scale of whole genomes. Today, we share our preprint “Generative design of novel bacteriophages with genome language models”, where we validate the first, functional AI-generated genomes 🧵









ok it actually works, uggghhh

🚀 Introducing 𝗥𝗡𝗔𝗚𝗲𝗻𝗲𝘀𝗶𝘀 — a 𝗚𝗲𝗻𝗲𝗿𝗮𝗹𝗶𝘀𝘁 𝗙𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻 𝗺𝗼𝗱𝗲𝗹 for 𝗳𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝗮𝗹 𝗥𝗡𝗔 𝘁𝗵𝗲𝗿𝗮𝗽𝗲𝘂𝘁𝗶𝗰𝘀, unifying sequence understanding, de novo design, and 3D structure prediction. From 𝗔𝗽𝘁𝗮𝗺𝗲𝗿𝘀 to 𝗚𝗲𝗻𝗲 𝗘𝗱𝗶𝘁𝗶𝗻𝗴, RNAGenesis powers next-gen RNA engineering across modalities. 🧬 Key results: ➡ De novo aptamer design with 𝗻𝗮𝗻𝗼𝗺𝗼𝗹𝗮𝗿 𝗞𝗗 (as low as 4.02 nM) ➡ sgRNA scaffolds boosting 𝗴𝗲𝗻𝗲 𝗲𝗱𝗶𝘁𝗶𝗻𝗴 𝗲𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆 𝘂𝗽 𝘁𝗼 𝟮.𝟱× (CRISPR-Cas9, Base Editing, Prime Editing) ➡ SOTA on 𝟭𝟭 𝗼𝗳 𝟭𝟯 tasks in the BEACON benchmark ➡ Outperforms 𝗔𝗹𝗽𝗵𝗮𝗙𝗼𝗹𝗱𝟯 (structure prediction), 𝗥𝗵𝗼𝗗𝗲𝘀𝗶𝗴𝗻 (inverse folding), and 𝗥𝗡𝗔-𝗙𝗿𝗮𝗺𝗲𝗙𝗹𝗼𝘄 (de novo structure generation) ➡ Built 𝗥𝗡𝗔𝗧𝘅-𝗕𝗲𝗻𝗰𝗵 — a focused benchmark for RNA therapeutics with 100K+ validated sequences; RNAGenesis outperforms 𝗘𝘃𝗼𝟮 and 𝗥𝗡𝗔-𝗙𝗠 across ASO, siRNA, shRNA, circRNA, and UTR tasks 🧠 Powered by: A hybrid 1B-param model: BERT-style encoder + latent diffusion decoder Inference-time alignment (gradient guidance + beam search) Multi-modal design unifying sequence, structure, and function 🧪 Wet-lab validation confirms high-affinity aptamer binding and improved gene-editing activity 🧮 Computational analysis reveals stronger G–C pairing, more hydrogen bonds, and lower MFE/MMBPSA in designed RNAs 📌 Explore: 🧾 Paper: biorxiv.org/content/10.110… 💻 Code: github.com/zaixizhang/RNA… 🧪 Benchmark: RNATx-Bench Team effort from @PrincetonAInews @StanfordMed @ZJU_China @PKU1898 @lecong @MengdiWang10 and more. #RNA #CRISPR #GeneEditing #RNAtherapeutics #DrugDiscove



