» Articles » PMID: 30611188

Revisiting Avian 'missing' Genes from De Novo Assembled Transcripts

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2019 Jan 7
PMID 30611188
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Argument remains as to whether birds have lost genes compared with mammals and non-avian vertebrates during speciation. High quality-reference gene sets are necessary for precisely evaluating gene gain and loss. It is essential to explore new reference transcripts from large-scale de novo assembled transcriptomes to recover the potential hidden genes in avian genomes.

Results: We explored 196 high quality transcriptomic datasets from five bird species to reconstruct transcripts for the purpose of discovering potential hidden genes in the avian genomes. We constructed a relatively complete and high-quality bird transcript database (1,623,045 transcripts after quality control in five birds) from a large amount of avian transcriptomic data, and found most of the presumed missing genes (83.2%) could be recovered in at least one bird species. Most of these genes have been identified for the first time in birds. Our results demonstrate that 67.94% genes have GC content over 50%, while 2.91% genes are AT-rich (AT% > 60%). In our results, 239 (53.59%) genes had a tissue-specific expression index of more than 0.9 in chicken. The missing genes also have lower Ka/Ks values than average (genome-wide: Ka/Ks = 0.99; missing gene: Ka/Ks = 0.90; t-test = 1.25E-14). Among all presumed missing genes, there were 135 for which we did not find any meaningful orthologues in any of the 5 species studied.

Conclusion: Insufficient reference genome quality is the major reason for wrongly inferring missing genes in birds. Those presumably missing genes often have a very strong tissue-specific expression pattern. We show multi-tissue transcriptomic data from various species are necessary for inferring gene family evolution for species with only draft reference genomes.

Citing Articles

Near telomere-to-telomere genome assemblies of Silkie Gallus gallus and Mallard Anas platyrhynchos restored the structure of chromosomes and "missing" genes in birds.

Zhao Q, Yin Z, Hou Z J Anim Sci Biotechnol. 2025; 16(1):9.

PMID: 39828703 PMC: 11745021. DOI: 10.1186/s40104-024-01141-1.


Divergence in expression of a singing-related neuroplasticity gene in the brains of 2 Ficedula flycatchers and their hybrids.

Wheatcroft D, Backstrom N, Dutoit L, McFarlane S, Mugal C, Wang M G3 (Bethesda). 2024; 15(2).

PMID: 39670717 PMC: 11797017. DOI: 10.1093/g3journal/jkae293.


Genome of Russian Snow-White Chicken Reveals Genetic Features Associated with Adaptations to Cold and Diseases.

Yevshin I, Shagimardanova E, Ryabova A, Pintus S, Kolpakov F, Gusev O Int J Mol Sci. 2024; 25(20).

PMID: 39456845 PMC: 11508066. DOI: 10.3390/ijms252011066.


Annotations of four high-quality indigenous chicken genomes identify more than one thousand missing genes in subtelomeric regions and micro-chromosomes with high G/C contents.

Wu S, Dou T, Yuan S, Yan S, Xu Z, Liu Y BMC Genomics. 2024; 25(1):430.

PMID: 38693501 PMC: 11061957. DOI: 10.1186/s12864-024-10316-z.


The genome of a globally invasive passerine, the common myna, Acridotheres tristis.

Stuart K, Johnson R, Major R, Atsawawaranunt K, Ewart K, Rollins L DNA Res. 2024; 31(2).

PMID: 38366840 PMC: 10917472. DOI: 10.1093/dnares/dsae005.


References
1.
Huang X, Madan A . CAP3: A DNA sequence assembly program. Genome Res. 1999; 9(9):868-77. PMC: 310812. DOI: 10.1101/gr.9.9.868. View

2.
Smith J, Bruley C, Paton I, Dunn I, Jones C, Windsor D . Differences in gene density on chicken macrochromosomes and microchromosomes. Anim Genet. 2000; 31(2):96-103. DOI: 10.1046/j.1365-2052.2000.00565.x. View

3.
Schwartz S, Kent W, Smit A, Zhang Z, Baertsch R, Hardison R . Human-mouse alignments with BLASTZ. Genome Res. 2003; 13(1):103-7. PMC: 430961. DOI: 10.1101/gr.809403. View

4.
Edgar R . MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32(5):1792-7. PMC: 390337. DOI: 10.1093/nar/gkh340. View

5.
Yanai I, Benjamin H, Shmoish M, Chalifa-Caspi V, Shklar M, Ophir R . Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics. 2004; 21(5):650-9. DOI: 10.1093/bioinformatics/bti042. View