




Chenchen Han
30 posts























Just evaluated the inverse folding ability of the released ESM3 (esm3_sm_open_v1) on the CATH test set (around 1100 proteins). ESM3 performed better than Saprot but surprisingly inferior to ProteinMPNN🧐. PS: The overall predicitons took ~1.5h on one A40 GPU.



Zhikai uploaded a 6-min tutorial for SaprotHub! 🚀 Biologists can now easily train & share their protein language models. Join us, be a SaprotHub author! #Bioinformatics #ProteinModeling @LTEnjoy @sokrypton Paper: biorxiv.org/content/10.110… Video: youtube.com/watch?v=r42z1h…





Recruited 12 bio students, no coding exp, to use ColabSaprot for re-training, zero-shot mutation, & protein design. They matched AI experts w/o hyper-parameter tuning! With SaprotHub, any biologist can train protein models! @sokrypton @LTEnjoy biorxiv.org/content/10.110…





Now everyone customize/share protein language models for their custom task/dataset via @GoogleColab 🤓 Paper: biorxiv.org/content/10.110… Colab: colab.research.google.com/drive/1nxYBed3… Credit: @LTEnjoy, Zhikai Li, @ChenchenHa42849, @BonnieSwt, Junjie Shan, @XibinBayesZhou, Dacheng Ma, @duguyuan


Excited to share ProTrec, a fast & accurate protein search tool! 30x/60x better seq-func/func-seq retrieval 100x faster than Foldseek & MMseq2 9 tasks: seq-stru, seq-func, struc-fun, etc. Beats ESM2 in 9/11 tasks Thanks to @sokrypton @WChentong biorxiv.org/content/10.110…
