Revisiting Avian 'missing' Genes from De Novo Assembled Transcripts
Overview
Authors
Affiliations
Background: Argument remains as to whether birds have lost genes compared with mammals and non-avian vertebrates during speciation. High quality-reference gene sets are necessary for precisely evaluating gene gain and loss. It is essential to explore new reference transcripts from large-scale de novo assembled transcriptomes to recover the potential hidden genes in avian genomes.
Results: We explored 196 high quality transcriptomic datasets from five bird species to reconstruct transcripts for the purpose of discovering potential hidden genes in the avian genomes. We constructed a relatively complete and high-quality bird transcript database (1,623,045 transcripts after quality control in five birds) from a large amount of avian transcriptomic data, and found most of the presumed missing genes (83.2%) could be recovered in at least one bird species. Most of these genes have been identified for the first time in birds. Our results demonstrate that 67.94% genes have GC content over 50%, while 2.91% genes are AT-rich (AT% > 60%). In our results, 239 (53.59%) genes had a tissue-specific expression index of more than 0.9 in chicken. The missing genes also have lower Ka/Ks values than average (genome-wide: Ka/Ks = 0.99; missing gene: Ka/Ks = 0.90; t-test = 1.25E-14). Among all presumed missing genes, there were 135 for which we did not find any meaningful orthologues in any of the 5 species studied.
Conclusion: Insufficient reference genome quality is the major reason for wrongly inferring missing genes in birds. Those presumably missing genes often have a very strong tissue-specific expression pattern. We show multi-tissue transcriptomic data from various species are necessary for inferring gene family evolution for species with only draft reference genomes.
Zhao Q, Yin Z, Hou Z J Anim Sci Biotechnol. 2025; 16(1):9.
PMID: 39828703 PMC: 11745021. DOI: 10.1186/s40104-024-01141-1.
Wheatcroft D, Backstrom N, Dutoit L, McFarlane S, Mugal C, Wang M G3 (Bethesda). 2024; 15(2).
PMID: 39670717 PMC: 11797017. DOI: 10.1093/g3journal/jkae293.
Yevshin I, Shagimardanova E, Ryabova A, Pintus S, Kolpakov F, Gusev O Int J Mol Sci. 2024; 25(20).
PMID: 39456845 PMC: 11508066. DOI: 10.3390/ijms252011066.
Wu S, Dou T, Yuan S, Yan S, Xu Z, Liu Y BMC Genomics. 2024; 25(1):430.
PMID: 38693501 PMC: 11061957. DOI: 10.1186/s12864-024-10316-z.
The genome of a globally invasive passerine, the common myna, Acridotheres tristis.
Stuart K, Johnson R, Major R, Atsawawaranunt K, Ewart K, Rollins L DNA Res. 2024; 31(2).
PMID: 38366840 PMC: 10917472. DOI: 10.1093/dnares/dsae005.