
Thanks, I agree that the note is absolutely relevant!
I’d guess imputation quality is mainly a question of how relevant the reference panel is. The corner case where both parents’ genomes are available, as in @HeraSight, suggests that PGT coverage of ~0.002–0.01x may be enough for accurate individual genome imputation reconstruction at 99%+ accuracy: herasight.substack.com/p/imputepgta-v2
And in another recent preprint that you tweeted, Barbara Sousa da Mota and colleagues showed that adding ancient genomes to an imputation reference panel improves accuracy for ancient DNA, whereas simply increasing the number of modern genomes does not provide much additional benefit: biorxiv.org/content/10.648…
At the same time, I remember @amythewilliams paper showing that a sample size of >8M enables near-perfect phasing: sciencedirect.com/science/articl…
So this makes me cautiously optimistic that, for relatively recent genomes, increasing both the size and the relevance of reference panels may soon make ~0.07x coverage sufficient for imputing common variants, without requiring pseudohaploidization.
Today in a population genetics class I got a great question from a student: “Why can’t we build a reference panel from low-coverage genomes?”
English




