Nazreen Pallikkavaliyaveetil
7 posts

Nazreen Pallikkavaliyaveetil
@nazreenpm
Postdoctoral Associate Computational Biology @AI/ML @Yale
New Haven, Connecticut Beigetreten Nisan 2010
143 Folgt29 Follower

Theseus is live & we’re working with the best customers in the world.
Y Combinator@ycombinator
Theseus (YC S24) made a little triangle that lets any drone fly without GPS. They sold it to the special forces, and they love it. Congrats on the launch, @ilaffey2, @sachalevy3, and @CarlSchoeller! ycombinator.com/launches/Ln6-t…
English
Nazreen Pallikkavaliyaveetil retweetet

🚀 We are at @icmlconf in Vienna presenting #cell2sentence (C2S), the first framework for LLM-based single-cell foundation models! 🧬✨
C2S can generate cells from language prompts, interpret cells, and even generate natural language insights directly from data! 🔍💬📊
Stay tuned for some major model releases this week! 🔥📢
icml.cc/virtual/2024/p…
biorxiv.org/content/10.110…
github.com/vandijklab/cel…

English
Nazreen Pallikkavaliyaveetil retweetet

Major Cell2Sentence update 🎉🔬! We’ve been thrilled to see the attention Cell2Sentence has received from the single-cell community.
Now, we’re excited to release our first update of Cell2Sentence (C2S) - a framework to leverage LLMs to train foundational single-cell models, directly in text.
What’s new & out:
Updated preprint with latest results biorxiv.org/content/10.110…
First full cell model available on the HuggingFace hub huggingface.co/vandijklab/pyt…
Updated codebase for data transformation & training github.com/vandijklab/cel…
We now fine-tune language models to generate entire cells, predict combinatorial cell labels, and generate textual data insights directly from cell sentences.
We train GPT-2 and Pythia models on a large multi-tissue dataset containing 36M cells from @cellxgene as well as an immune tissue dataset containing 270k cells.
C2S LMs achieve SOTA performance in single-cell data generation.
C2S models trained for combinatorial label prediction settings excel in low-data regimes, outperforming single-cell foundation model baselines.
We also show that C2S models benefit from natural language pre-training and always outperform models trained from scratch on cell sentences.
C2S provides a straightforward approach to adapting LLMs for single-cell data analysis, leveraging their natural language capabilities to generate and derive insights from single cells.
We are convinced that C2S’ approach of integrating data modalities through text is the way forward for single-cell foundation models, from representing multi-omics data to generating clinical insights, all in a human readable format.
We’re excited to start building a community around Cell2Sentence! If you also think that C2S will be the framework for single-cell foundation models, and are interested in contributing, reach out to us! We welcome any collaborations and discussions.
Huge thanks to our collaborator @aminkarbasi and the C2S team (@danielflevine, @sachalevy3, @SyedARizvi5688, @nazreenpm, Xingyu Chen, @dzhang03, @GhadermarziSina, Ruiming Wu, Ivan Vrkic, Anna Zhong, Daphne Raskin, Insu Han, @aho_fonseca, @josueortc) for their hard work on C2S! Special thanks to @rahuldhodapkar, who co-supervises this project.
GIF
English
Nazreen Pallikkavaliyaveetil retweetet

Thrilled to announce that CINEMA-OT is now published at Nature Methods!
nature.com/articles/s4159…

English
Nazreen Pallikkavaliyaveetil retweetet

Single Cells as text? We developed Cell2Sentence, a method that allows training of Large Language Models on single-cell data!
biorxiv.org/content/10.110…
With @danielflevine @SyedARizvi5688 @sachalevy3 @rahuldhodapkar
@YaleSEAS @YaleMed #AI #ML #NLP #genomics #CompBio #singlecell

English
Nazreen Pallikkavaliyaveetil retweetet

Introducing BrainLM 🧠🤖the first foundation model for #fMRI analysis trained on 6,700 hours of brain activity data! Fine-tune for specialized tasks or leverage zero-shot inference capabilities!
@WuTsaiYale @YaleCompsci @YaleCBB @YaleMed
biorxiv.org/content/10.110…

English
Nazreen Pallikkavaliyaveetil retweetet

Postdoc positions available @YaleMed @YaleCompsci: ML/AI methods for immune profiling. In collaboration with @VirusesImmunity
postdocs.yale.edu/machine-learni… @YaleCBB @YalePostdocAsso @YaleCVRC Please share!

English