» Articles » PMID: 38504112

Enriched Atlas of LncRNA and Protein-coding Genes for the GRCg7b Chicken Assembly and Its Functional Annotation Across 47 Tissues

Abstract

Gene atlases for livestock are steadily improving thanks to new genome assemblies and new expression data improving the gene annotation. However, gene content varies across databases due to differences in RNA sequencing data and bioinformatics pipelines, especially for long non-coding RNAs (lncRNAs) which have higher tissue and developmental specificity and are harder to consistently identify compared to protein coding genes (PCGs). As done previously in 2020 for chicken assemblies galgal5 and GRCg6a, we provide a new gene atlas, lncRNA-enriched, for the latest GRCg7b chicken assembly, integrating "NCBI RefSeq", "EMBL-EBI Ensembl/GENCODE" reference annotations and other resources such as FAANG and NONCODE. As a result, the number of PCGs increases from 18,022 (RefSeq) and 17,007 (Ensembl) to 24,102, and that of lncRNAs from 5789 (RefSeq) and 11,944 (Ensembl) to 44,428. Using 1400 public RNA-seq transcriptome representing 47 tissues, we provided expression evidence for 35,257 (79%) lncRNAs and 22,468 (93%) PCGs, supporting the relevance of this atlas. Further characterization including tissue-specificity, sex-differential expression and gene configurations are provided. We also identified conserved miRNA-hosting genes with human counterparts, suggesting common function. The annotated atlas is available at gega.sigenae.org.

Citing Articles

Full-length transcriptome sequencing of seven tissues of GuShi chickens.

Tian K, Zhang C, Gao C, Shi J, Xu C, Xie W Poult Sci. 2024; 104(2):104697.

PMID: 39721272 PMC: 11732535. DOI: 10.1016/j.psj.2024.104697.


Comprehensive Annotation and Expression Profiling of C2H2 Zinc Finger Transcription Factors across Chicken Tissues.

Chen S, Jiang J, Liang W, Tang Y, Lyu R, Hu Y Int J Mol Sci. 2024; 25(19).

PMID: 39408854 PMC: 11476951. DOI: 10.3390/ijms251910525.


RNA-seq dataset of the chorioallantoic membrane of male and female chicken embryos, after 11 and 15 days of incubation.

Hennequet-Antier C, Halgrain M, Rehault-Godbert S Data Brief. 2024; 56:110830.

PMID: 39263233 PMC: 11388263. DOI: 10.1016/j.dib.2024.110830.


GEGA (Gallus Enriched Gene Annotation): an online tool providing genomics and functional information across 47 tissues for a chicken gene-enriched atlas gathering Ensembl and Refseq genome annotations.

Degalez F, Bardou P, Lagarrigue S NAR Genom Bioinform. 2024; 6(3):lqae101.

PMID: 39157583 PMC: 11327871. DOI: 10.1093/nargab/lqae101.

References
1.
Zhao L, Wang J, Li Y, Song T, Wu Y, Fang S . NONCODEV6: an updated database dedicated to long non-coding RNA annotation in both animals and plants. Nucleic Acids Res. 2020; 49(D1):D165-D171. PMC: 7779048. DOI: 10.1093/nar/gkaa1046. View

2.
Rinn J, Snyder M . Sexual dimorphism in mammalian gene expression. Trends Genet. 2005; 21(5):298-305. DOI: 10.1016/j.tig.2005.03.005. View

3.
Necsulea A, Soumillon M, Warnefors M, Liechti A, Daish T, Zeller U . The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature. 2014; 505(7485):635-40. DOI: 10.1038/nature12943. View

4.
Uszczynska-Ratajczak B, Lagarde J, Frankish A, Guigo R, Johnson R . Towards a complete map of the human long non-coding RNA transcriptome. Nat Rev Genet. 2018; 19(9):535-548. PMC: 6451964. DOI: 10.1038/s41576-018-0017-y. View

5.
Liu B, Shyr Y, Cai J, Liu Q . Interplay between miRNAs and host genes and their role in cancer. Brief Funct Genomics. 2019; 18(4):255-266. PMC: 6609535. DOI: 10.1093/bfgp/elz002. View