» Articles » PMID: 14871864

Poorly Conserved ORFs in the Genome of the Archaea Halobacterium Sp. NRC-1 Correspond to Expressed Proteins

Overview
Journal Bioinformatics
Specialty Biology
Date 2004 Feb 12
PMID 14871864
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: A large fraction of open reading frames (ORFs) identified as 'hypothetical' proteins correspond to either 'conserved hypothetical' proteins, representing sequences homologous to ORFs of unknown function from other organisms, or to hypothetical proteins lacking any significant sequence similarity to other ORFs in the databases. Elucidating the functions and three-dimensional structures of such orphan ORFs, termed ORFans or poorly conserved ORFs (PCOs), is essential for understanding biodiversity. However, it has been claimed that many ORFans may not encode for expressed proteins.

Results: A genome-wide experimental study of 'paralogous PCOs' in the halophilic archaea Halobacterium sp. NRC-1 was conducted. Paralogous PCOs are ORFs with at least one homolog in the same organism, but with no clear homologs in other organisms. The results reveal that mRNA is synthesized for a majority of the Halobacterium sp. NRC-1 paralogous PCO families, including those comprising relatively short proteins, strongly suggesting that these Halobacterium sp. NRC-1 paralogous PCOs correspond to true, expressed proteins. Hence, further computational and experimental studies aimed at characterizing PCOs in this and other organisms are merited. Such efforts could shed light on PCOs' functions and origins, thereby serving to elucidate the vast diversity observed in the genetic material.

Citing Articles

Orphan Genes Shared by Pathogenic Genomes Are More Associated with Bacterial Pathogenicity.

Entwistle S, Li X, Yin Y mSystems. 2019; 4(1).

PMID: 30801025 PMC: 6372840. DOI: 10.1128/mSystems.00290-18.


Structural view of a non Pfam singleton and crystal packing analysis.

Cheng C, Shaw N, Zhang X, Zhang M, Ding W, Wang B PLoS One. 2012; 7(2):e31673.

PMID: 22363703 PMC: 3282739. DOI: 10.1371/journal.pone.0031673.


Structural features and the persistence of acquired proteins.

Narra H, Cordes M, Ochman H Proteomics. 2008; 8(22):4772-81.

PMID: 18924109 PMC: 3014317. DOI: 10.1002/pmic.200800061.


Identification and investigation of ORFans in the viral world.

Yin Y, Fischer D BMC Genomics. 2008; 9:24.

PMID: 18205946 PMC: 2245933. DOI: 10.1186/1471-2164-9-24.


Large-scale comparative genomic ranking of taxonomically restricted genes (TRGs) in bacterial and archaeal genomes.

Wilson G, Feil E, Lilley A, Field D PLoS One. 2007; 2(3):e324.

PMID: 17389915 PMC: 1824705. DOI: 10.1371/journal.pone.0000324.