Benjamin Perry

226 posts

Benjamin Perry

@bots_and_bits

Designing enzymes with bots and bits! | Romero Lab at Duke

Durham, NC Katılım Mayıs 2024

206 Takip Edilen273 Takipçiler

Benjamin Perry@bots_and_bits·17h

@samsinai Any suggestions for benchmarks that can reveal these trends?

English

182

Sam Sinai@samsinai·19h

It's much worse in unsupervised biological models, particularly sequence-only. So much baseless story-telling about (most) models that have at best learned to play nearest neighbors and get lucky by recombining additive motifs.

François Chollet@fchollet

This is more evidence that current frontier models remain completely reliant on content-level memorization, as opposed to higher-level generalizable knowledge (such as metalearning knowledge, problem-solving strategies...)

English

5.6K

Benjamin Perry@bots_and_bits·1d

If you want to learn generative models for bio, these folks are some of the best to do it. Check it out!🧠

Fred Zhangzhi Peng@pengzhangzhi1

We recently taught a short course at the ENAR 2026 Spring Meeting on generative models for protein, cell, and biomedical data. We’re excited to share the course materials here for anyone interested: pengzhangzhi.github.io/ENAR26-Course-… with @Anru_Zhang, @AlexanderTong7

English

2.8K

Benjamin Perry@bots_and_bits·1d

@pengzhangzhi1 @Anru_Zhang @AlexanderTong7 Great work Fred! Excited to check it out

English

245

Fred Zhangzhi Peng@pengzhangzhi1·1d

English

113

10.1K

Benjamin Perry@bots_and_bits·2d

@wasserstein_rao Need a PLR Claude skill ASAP‼️

English

158

stefan@wasserstein_rao·2d

Science could move so much faster if we just used the best tools we have to automate it.

stefan@wasserstein_rao

Using claude code to directly control a liquid handling robot is such a crazy experience

English

3.2K

Benjamin Perry@bots_and_bits·6 Mar

@pranamanam @_sophia_tang_ Amazing work!

English

Pranam Chatterjee@pranamanam·6 Mar

We're super excited to have BranchSBM published at #ICLR2026!! 🌳🧫🇧🇷 I am so proud of the team! 📷 Camera-Ready Paper: arxiv.org/abs/2506.09007 💻 Github: github.com/sophtang/Branc… 📹 Sophia's Presentation: youtube.com/watch?v=inVYA0… As you may remember, @_sophia_tang_ (alongside our lab's FIRST ever PhD graduate, @yinuo_z98!! 👩‍🎓) elegantly showed that by learns diverging velocity fields and growth dynamics (via decomposing the transport into multiple unbalanced Schrödinger bridges), we can get probability mass to split across branches so a single initial state (like a progenitor cell type) can generate complex multi-modal trajectories (i.e., that of terminally differentiated states). Sophia does a wonderful job explaining the new results that we're presenting in our camera-ready version below! 👇Please come and support her and the team at our poster in Brazil! 🇧🇷

YouTube

Sophia Tang@_sophia_tang_

Our paper, “Branched Schrödinger Bridge Matching” (BranchSBM), has been accepted as a main conference paper at #ICLR2026 in Rio! 🌳🧫🇧🇷 In the camera-ready version, we include a new experiment scaling BranchSBM to 11 branches on cell differentiation data! 📷 Check out our freshly updated project page and Github repo below 👇🏻 🌳 Project Page: sophtang.github.io/branch-sbm 📄 Camera-Ready Paper: arxiv.org/abs/2506.09007 💻 Github: github.com/sophtang/Branc… 📹 Reading Group Presentation: youtu.be/inVYA0pQ4Wg?si… Branching is ubiquitous in many dynamical systems, including cell differentiation into distinct fates, diverging cellular responses to drug perturbations, and population dynamics. 🧫 But, existing flow matching and SBM frameworks approximate multi-modal distributions by simulating many independent particle trajectories, which are susceptible to mode collapse, with particles concentrating on dominant high-density modes or traversing only low-energy intermediate paths. To address this challenge, we introduce 🌳 BranchSBM 🌳, a framework that learns a set of diverging velocity fields to reconstruct multi-modal target distributions while simultaneously learning growth networks that allocate mass across branches. 🌳 Our key idea was to define the Branched Schrödinger Bridge Problem as the sum of unbalanced generalized Schrödinger bridge problems, where the weight determines the redistribution of mass across each branch over time. 🌳 We introduce a multi-stage training algorithm to learn the optimal branching drift and growth fields that transport mass along a branched trajectory. This allows BranchSBM to capture diverging, energy-minimizing dynamics without requiring intermediate-time supervision and can generate the full branched evolution from a single initial sample. 🌳 We demonstrate the unique capability of BranchSBM to model dynamic branching trajectories in real-world settings, from differentiating single-cell population dynamics (up to 11 branches!) to simulating diverging cellular responses to drug perturbation. On an unrelated note, I wanted to take this post to congratulate my inspiring and endlessly supportive research mentor, @yinuo_z98, who just defended her PhD and is officially a PhD graduate!! 👩🏻‍🎓 We’re super excited to present BranchSBM in Rio this April 🇧🇷, along with new workshop papers to be announced! And of course, very grateful for the support from @AlexanderTong7 and @pranamanam 💫

English

11.5K

Benjamin Perry@bots_and_bits·4 Mar

Multi-objective phage-assisted continuous evolution. Being able to generate large parallel datasets to map and engineer complex fitness landscapes at scale is BIG. Check out @bffswithbiology's work!🧪

Ryan Boileau@bffswithbiology

Aaaand it’s online ahhhhh!!! 🥳🥳 So excited!! The first glimpse of my postdoc work with @chorye @dukecagt. Here, @stefanmgolas and I developed TurboPRANCE, an open-source robotics platform for rapid and scaled phage-assisted continuous evolutions. 🧪Tweetorial party!👇1/n

English

692

Benjamin Perry@bots_and_bits·3 Mar

@Micro_Yunha exciting work!

English

Yunha Hwang@Micro_Yunha·3 Mar

For a typical microbial genome, all-vs-all PPI prediction with AlphaFold3 would take hundreds of GPU-years. With FlashPPI, we can scale molecular interaction prediction across diverse, non-model microbial genomes, unlocking truly scalable discovery. We deployed FlashPPI on Seqhub.org for intuitive and rapid exploration of PPI network, give it a spin!

English

3.8K

Yunha Hwang@Micro_Yunha·3 Mar

Protein–protein interactions (PPIs) are key to discovering and interpreting new biological functions. We’re excited to introduce 𝑭𝒍𝒂𝒔𝒉𝑷𝑷𝑰: a new application of gLM2 that uses genomic language modeling to predict proteome-wide PPIs in microbial genomes in minutes.

GIF

English

448

21.8K

Benjamin Perry@bots_and_bits·27 Şub

@DhuviKarthikey1 sounds like you should use a claude cowork scheduled task to brief your lab on AI tool updates

English

Benjamin Perry retweetledi

dhuv.io@DhuviKarthikey1·26 Şub

Gave a presentation last week to the lab on using AI tools and it’s half outdated alr 🫠🥴

English

679

dhuv.io@DhuviKarthikey1·26 Şub

wtf is this rate of shipping…

Claude@claudeai

New in Cowork: scheduled tasks. Claude can now complete recurring tasks at specific times automatically: a morning brief, weekly spreadsheet updates, Friday team presentations.

English

344

Benjamin Perry retweetledi

Christian Dallago@sacdallago·27 Şub

Five years ago, we released FLIP. The core question was: can ML models for protein fitness prediction generalize in the ways that actually matter for protein engineering, i.e. low data, extrapolation to more mutations, out-of-distribution sequences?

English

4.6K

Benjamin Perry@bots_and_bits·26 Şub

Finally some better protein fitness landscape benchmarks💪

Kevin K. Yang 楊凱筌@KevinKaichuang

We made FLIP2, a protein fitness benchmark spanning seven new datasets, including enzymes, protein-protein interactions, and light-sensitive proteins, as well as splits that measure generalization relevant to real-world protein engineering campaigns.

English

2.1K

Benjamin Perry@bots_and_bits·21 Şub

Protein Language Modeling made easy on google colab💥

fajie yuan@duguyuan

Want to fine-tune protein language models but don't have ML experience? 💻❌ We've got you covered! 📢 ✅ Previously: ColabSaprot & ColabSeprot (ESM1/2, ProTrek, ProtBert) 🆕 Now Available: ColabESMC & ColabESM3 Links➡️github.com/westlake-repl/… Tutorial➡️youtube.com/watch?v=nmLtjl…

English

5.4K

Benjamin Perry@bots_and_bits·21 Şub

@duguyuan Amazing!

English

fajie yuan@duguyuan·21 Şub

@bots_and_bits After training on Colab, one can easily share their model on Hugging Face with just one click. Others can directly use these shared models on our Colab platform, or re-train them with their own data, and then share them back to the Hub. Everything can be done with a few clicks.

English

fajie yuan@duguyuan·21 Şub

YouTube

fajie yuan@duguyuan

ColabSaprot & SaprotHub are now in @NatureBiotech! 🧬 A user-friendly, no-code platform for training, sharing, and collaborating on protein language models. We also provide ColabSeprot, integrating ESM1b, ESM2, ProTrek, and ProtBert for the community. nature.com/articles/s4158…

English

15.4K

Benjamin Perry@bots_and_bits·21 Şub

@duguyuan Does this integrate with hugging face at all?

English

fajie yuan@duguyuan·21 Şub

Available Colab Links: ColabSaprot (35M,650M): colab.research.google.com/github/westlak… ColabSeprot (ESM1b‑650M, ESM2‑35M/150M/650M, ProTrek‑35M/650M, ProtBERT‑420M): colab.research.google.com/github/westlak… ColabESMC (300M, 600M): colab.research.google.com/github/westlak… ColabESM3 (1.4B): colab.research.google.com/github/westlak…

656

Benjamin Perry@bots_and_bits·21 Şub

@duguyuan Impressive work! Colab for the win 😎

English

103

Benjamin Perry@bots_and_bits·21 Şub

@andrewwhite01 looking forward to reading😎

English

297

Andrew White 🐦‍⬛@andrewwhite01·21 Şub

After a few years of procrastination, I've updated my textbook. Changes: 1. Tensorflow -> PyTorch 2. Darkmode 3. Added scaffold split section 4. Fixed many typos

English

657

26.3K

Benjamin Perry@bots_and_bits·20 Şub

@tranvinq really exciting work!

English

Vince Tran@tranvinq·20 Şub

Excited to see this out! Grateful to Patrick for his mentorship and support throughout the arc of this story. We look forward to seeing what the community will engineer with MULTI-evolve!

Patrick Hsu@pdhsu

Delighted to share new @arcinstitute work from our group on AI-accelerated lab-in-the-loop, in @ScienceMagazine today One of the most remarkable things about biology is that it's digital. DNA, RNA, proteins: these are all sequences, and their function is directly encoded in their sequence of letters. But a protein of length N has 20^N possible variants and the vast majority are non-functional. Evolution spent billions of years finding the functional needles in this haystack through random exploration and natural selection. For modern biomedicine, we need to solve this in days to weeks.

English

2.2K

Benjamin Perry@bots_and_bits·20 Şub

@kabirhbiswas @romerolab1 Hi Kabir! Good question. We separate MSA + Folding in our method; however, folding is still needed. So any complex that would cause VRAM issues in the default AF3 folding pipeline would encounter the same issues here.

English

Kabir H Biswas, PhD@kabirhbiswas·20 Şub

@romerolab1 Congratulations to you and the team! Any chance that these improvements will also allow prediction of larger structures/complexes with the same GPU RAM?

English

176

Romero lab@romerolab1·19 Şub

AlphaFold 3 is a game-changer for biomolecular modeling, but the CPU-bound MSA bottleneck is a major hurdle for high-throughput discovery. Today, Romero lab introduces AlphaFast: our new framework that delivers a 22.8x speedup in AF3 inference on a single GPU. 🚀 1/5

English

140

7.6K

Benjamin Perry@bots_and_bits·19 Şub

@TensorTwerker @navvye Sounds like a good test to run😎

English

nabbo (bio/acc)@TensorTwerker·19 Şub

@bots_and_bits @navvye yeah but I feel boltz-2 is already very fast

English

nabbo (bio/acc)@TensorTwerker·19 Şub

needs AlphaFold3 weights 💔💔💔

Benjamin Perry@bots_and_bits

AlphaFold 3 just got a massive speed boost. 🚀 We’re introducing AlphaFast: a GPU-accelerated framework that cuts AF3 inference from >10 mins to ~25 seconds on a single GPU–a 22.8x speedup–without losing structural accuracy. More details below! 1/6 🧵

English

3.7K

Benjamin Perry@bots_and_bits·19 Şub

@QosmosChem Yes. We wanted to start with AF3 but the general framework should apply to any folding model!

English

818

QosmosChem@QosmosChem·19 Şub

@bots_and_bits Theoretically could this be applied to OpenFold or Boltz?

English

1.1K

Benjamin Perry@bots_and_bits·19 Şub

English

100

711

37.6K

Keşfet

@samsinai @pengzhangzhi1 @Anru_Zhang @AlexanderTong7 @wasserstein_rao @pranamanam @_sophia_tang_ @yinuo_z98