David Lobell retweetledi

Pre-training a vision transformer (ViT) for each large dataset/domain is becoming too expensive...💰
We introduce 🔥ExPLoRA🔥, to extend unsupervised pre-training for ViTs on new domains and create foundation models using very few parameters!
arxiv.org/abs/2406.10973
1/8
English
















