Daniel Chang

105 posts

Daniel Chang

Daniel Chang

@danielchang2002

Genetics PhD student @Stanford w/ @BrianHie | Prev @UMNComputerSci

Katılım Kasım 2018
371 Takip Edilen247 Takipçiler
Sabitlenmiş Tweet
Daniel Chang
Daniel Chang@danielchang2002·
It was super fun working on Evo 2, a DNA language model trained on genomes across the tree of life! Check out the preprint: arcinstitute.org/manuscripts/Ev… A small 🧵 highlighting some mechanistic interpretability work on Evo 2 (Fig. 4) we did in collaboration with @GoodfireAI 🔥🔥🔥
Daniel Chang tweet media
English
6
40
142
31K
Daniel Chang retweetledi
Samuel Alber
Samuel Alber@EggsAndSam7·
Pleased to announce that CellVoyager is published @naturemethods! CellVoyager is a scRNA-seq AI agent that autonomously generates hypotheses and tests them in a live analysis notebook, where users can guide the discovery process. Demo: cellvoyager.org What's new 🧵⤵️
English
1
7
27
5.5K
Daniel Chang retweetledi
Florian Hoffmann, PhD
Florian Hoffmann, PhD@fthoffmann·
Our 2 papers on RNA-guided transcription are now out @Nature. This mechanism re-writes the traditional concept of bacterial transcription and allows RNA transcripts to be generated de novo from potentially any cellular DNA sequence. 🧬 See below for links and thread 🧵
Sternberg Lab@SternbergLab

Out now! In collaboration with @LeifuChangLab, we uncover the molecular and structural underpinnings of CRISPR-Cas12f-like RNA-guided transcription systems! Links to the articles in the following tweet:

English
2
33
184
19.1K
Daniel Chang retweetledi
Garyk Brixi
Garyk Brixi@garykbrixi·
To make Evo 2 more accessible, we're releasing Evo 2 20B, a checkpoint that achieves 40B-level performance on a single H100, as a drop-in replacement. This came out of model surgery with @danielchang2002, and we are excited to see people build on it! github.com/ArcInstitute/e…
Garyk Brixi tweet media
English
1
11
41
8.6K
Daniel Chang retweetledi
Garyk Brixi
Garyk Brixi@garykbrixi·
Evo 2 is out in Nature today, showing that genome language models can predict and design across the full complexity of life, from phages to eukaryotes. A few surprises from the project, including how ignoring trillions of nucleotides was key to getting a good model. 🧵
Garyk Brixi tweet media
English
13
208
1K
97.3K
Daniel Chang retweetledi
Vince Tran
Vince Tran@tranvinq·
The hardest part of protein engineering isn't just finding good mutations – it’s deciphering which ones combine synergistically. Today in @ScienceMagazine, we present MULTI-evolve, a framework for rapid multi-mutant protein engineering, validated across three diverse proteins.
Vince Tran tweet media
English
10
94
423
42.6K
Daniel Chang retweetledi
David Chanin
David Chanin@chanindav·
SAEs fail even when the Linear Representation Hypothesis holds perfectly. We built SynthSAEBench: large-scale synthetic data with 16k ground-truth features, correlation, hierarchy, and superposition. We trained 5 SAE architectures on it. None achieve perfect feature recovery.
David Chanin tweet media
English
5
26
216
9.7K
Daniel Chang retweetledi
nature
nature@Nature·
Nature research paper: Long-read metagenomics reveals phage dynamics in the human gut microbiome go.nature.com/4p3PsiC
English
3
19
81
19.9K
Daniel Chang retweetledi
Simone Alma Evans
Simone Alma Evans@simone_alma·
I am so excited to share our project with you! We find prokaryotic proteases activate toxic enzymes and pores as a modular strategy in phage defense. We studied four fascinating protease-toxin pairs that are abundant across bacterial genomes: Many thanks to our wonderful collaborators and to the Gao lab for making this work possible! biorxiv.org/content/10.110…
English
2
4
8
771
Daniel Chang retweetledi
Brian Hie
Brian Hie@BrianHie·
We are actively recruiting for two positions at the interface between biology and generative design. Backgrounds of particular interest are in protein biochemistry/evolution and synthetic genomics/biology. Please consider joining us! 1/n
English
8
87
346
42.5K
Daniel Chang retweetledi
Jessica Sacher, PhD
Jessica Sacher, PhD@JessicaSacher·
Closing the AI-to-lab loop is hard, especially if you want to test your WHOLE GENOME generator.. Viruses are the only genomes cheap enough to print en mass, but raise biosafety flags So @ArcInstitute chose phages! We went deep w/ @samuelhking & @driscoll_cl on how they did it:
Jessica Sacher, PhD tweet media
English
2
11
59
24.8K
Daniel Chang retweetledi
Yunha Hwang
Yunha Hwang@Micro_Yunha·
We're thrilled to announce SeqHub, an AI-enabled platform for biological sequence analysis. SeqHub brings together sequence search, genome annotation, and data sharing in one place. I dreamed of a single place where I could learn everything about my sequences. Today, a much more refined version of this dream takes form with SeqHub.org, built by an incredible team at @tatta_bio. Our goal is to make sequence interpretation more intuitive and collaborative for everyone working with biological sequences. Currently, SeqHub is optimized for microbial protein and genome analysis. As we expand beyond microbial data, we'd love your feedback to help shape what comes next. I'm deeply grateful to our team at Tatta Bio, and to our collaborators and funders, for making this vision a reality. Check it out at seqhub.org!
English
14
113
565
52.9K
Daniel Chang retweetledi
Stephen Tang
Stephen Tang@stephentang23·
Genome maintenance by telomerase is a fundamental process in nearly all eukaryotes. But where does it come from? Today, we report the discovery of telomerase homologs in a family of antiviral RTs, revealing an unexpected evolutionary origin in bacteria. doi.org/10.1101/2025.1…
Stephen Tang tweet media
English
6
87
282
80.7K
Daniel Chang retweetledi
Daniel Chang retweetledi
Owen Queen
Owen Queen@oq_35·
🚀 Excited to share our new paper: CGBench — Benchmarking Language Model Scientific Reasoning for Clinical Genetics Research Can AI truly understand scientific papers? We explore how LLMs interpret real biomedical literature — not just multiple-choice questions.🧵
Owen Queen tweet media
English
7
17
72
18K
Daniel Chang retweetledi
Elana Simon
Elana Simon@ElanaPearl·
Published! 🎉 Paper now has more feature analysis and higher quality figures - thanks to great reviewer feedback! Code also got a major upgrade - v1.0.0 is way more modular so you can easily swap in different protein embeddings or SAE architectures: github.com/ElanaPearl/Int…
James Zou@james_y_zou

How do protein language models (PLM) think about proteins?🧬 We answer this w/ #InterPLM, just published in @naturemethods! Using sparse autoencoders + LLM agent, we identify 1000s of interpretable concepts learned by PLMs, pointing to new biology 🧵

English
11
34
256
32.3K
Daniel Chang retweetledi
John Wang
John Wang@_jnwang·
Super excited to share what we’ve been working on in collaboration with @SynBioGaoLab over the past few months on de novo antibody design. Check out this great thread by our team lead @santimillef highlighting the technical aspects of the pipeline!
GIF
Santiago Mille@santimillef

The ability to design antibodies against any protein of interest has major implications for medicine, biotech, and basic science. Today, we introduce Germinal, a pipeline for epitope-targeted de novo antibody design achieving  4–22% success rates with efficient experimental validation.

English
4
17
108
11.8K
Daniel Chang retweetledi
Talal Widatalla
Talal Widatalla@talaldotpdb·
Entering my PhD, de novo antibody design was a grand challenge I thought would not be solved without huge increases in affinity data and Ab-Ag structure. Only 2 years later, we provide the first open-source recipe to get antibody binders, almost magically, out of a computer (1/3)
Santiago Mille@santimillef

The ability to design antibodies against any protein of interest has major implications for medicine, biotech, and basic science. Today, we introduce Germinal, a pipeline for epitope-targeted de novo antibody design achieving  4–22% success rates with efficient experimental validation.

English
4
42
289
31.4K