The subordinate role of pseudogenization to recombinative deletion following polyploidization in angiosperms
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Nature Research
Abstract
Extensive gene loss is a hallmark of rediploidization following polyploidization, but its molecular basis remains unclear: whether it occurs primarily through pseudogenization or DNA deletion. Here, we examine pseudogenization in collinear segments from ancient whole-genome multiplications (WGMs) across 12 angiosperms. Although total pseudogenes are abundant, we find far fewer WGM-derived pseudogenes than expected if pseudogenization and DNA deletion contribute equally to gene loss. Simulations of neutrally evolving pseudogenes indicate that, if DNA deletion is absent, pseudogenes should be detectable for far longer than observed in the paleo-polyploid genomes, suggesting gene loss driven by DNA deletion. Analyses of three neo-autopolyploid genomes confirm this pattern: among substantial gene loss, DNA deletions occur on average 1.5 times more frequently than pseudogenization. Our findings imply that gene loss post-polyploidization primarily takes place via DNA deletion, enabled by a genomic environment with an elevated recombination rate created by WGMs. In contrast, small-scale duplications yield scattered duplicated genes, which appear less exposed to deletion and hence result in a high number of pseudogenes. This model is further reinforced by an enrichment of WGM-derived pseudogenes in high recombination regions. Moreover, some pseudogenes may govern a function, as indicated by non-neutral Ka/Ks ratios and overlap with lncRNAs.
Description
DATA AVAILABILITY : The genome data of the paleo-polyploid species used in this study are available in the PLAZA v5.0 database75 under accession codes atr (Amborella trichopoda), ath (Arabidopsis thaliana), bol (Brassica oleracea), bra (Brassica rapa), gma (Glycine max), mdo (Malus domestica), osa (Oryza sativa), ptr (Populus trichocarpa), sly (Solanum lycopersicum), sbi (Sorghum bicolor), vvi (Vitis vinifera), and zma (Zea mays) https://bioinformatics.psb.ugent.be/plaza.dev/_dev_instances/feedback/download/download]. The Saccharum spontaneum AP85-441 genome data used in this study are available at NCBI under accession code GCA_003544955.1 [https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_003544955.1/]. The Actinidia arguta genome data used in this study are available at Genome Warehouse under accession code GWHBJWW00000000 [https://ngdc.cncb.ac.cn/gwh/Assembly/83526/show]. The Solanum tuberosum Otava genome data used in this study are available at SpudDB under accession code Otava v1 [https://spuddb. uga.edu/otava_potato_download.shtml]. Data underlying figures are provided on Zenodo (https://doi.org/10.5281/zenodo.15552828)107. Source file 1 in Zenodo contains the list of identified pseudogenes in twelve paleo-polyploid species.
Keywords
Pseudogenization, Angiosperms, Polyploidization, Deoxyribonucleic acid (DNA)
Sustainable Development Goals
SDG-15: Life on land
SDG-13: Climate action
SDG-13: Climate action
Citation
Crombez, E., Van de Peer, Y. & Li, Z. 2025, 'The subordinate role of pseudogenization to recombinative deletion following polyploidization in angiosperms', Nature Communications, vol. 16, no. 1, art. 6335, pp. 1-15. https://doi.org/10.1038/s41467-025-61676-3.
