» Articles » PMID: 33087711

Reference Exome Data for a Northern Brazilian Population

Overview
Journal Sci Data
Specialty Science
Date 2020 Oct 22
PMID 33087711
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Exome sequencing is widely used in the diagnosis of rare genetic diseases and provides useful variant data for analysis of complex diseases. There is not always adequate population-specific reference data to assist in assigning a diagnostic variant to a specific clinical condition. Here we provide a catalogue of variants called after sequencing the exomes of 45 babies from Rio Grande do Nord in Brazil. Sequence data were processed using an 'intersect-then-combine' (ITC) approach, using GATK and SAMtools to call variants. A total of 612,761 variants were identified in at least one individual in this Brazilian Cohort, including 559,448 single nucleotide variants (SNVs) and 53,313 insertion/deletions. Of these, 58,111 overlapped with nonsynonymous (nsSNVs) or splice site (ssSNVs) SNVs in dbNSFP. As an aid to clinical diagnosis of rare diseases, we used the American College of Medicine Genetics and Genomics (ACMG) guidelines to assign pathogenic/likely pathogenic status to 185 (0.32%) of the 58,111 nsSNVs and ssSNVs. Our data set provides a useful reference point for diagnosis of rare diseases in Brazil. (169 words).

Citing Articles

Reference exome data for a Northern Brazilian population.

Weeks A, Francis R, Neri J, Costa N, Arrais N, Lassmann T Sci Data. 2020; 7(1):360.

PMID: 33087711 PMC: 7578642. DOI: 10.1038/s41597-020-00703-y.

References
1.
Wang J, Raskin L, Samuels D, Shyr Y, Guo Y . Genome measures used for quality control are dependent on gene function and ancestry. Bioinformatics. 2014; 31(3):318-23. PMC: 4308666. DOI: 10.1093/bioinformatics/btu668. View

2.
Karczewski K, Weisburd B, Thomas B, Solomonson M, Ruderfer D, Kavanagh D . The ExAC browser: displaying reference data information from over 60 000 exomes. Nucleic Acids Res. 2016; 45(D1):D840-D845. PMC: 5210650. DOI: 10.1093/nar/gkw971. View

3.
Liu X, Jian X, Boerwinkle E . dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Hum Mutat. 2011; 32(8):894-9. PMC: 3145015. DOI: 10.1002/humu.21517. View

4.
Tan A, Abecasis G, Kang H . Unified representation of genetic variants. Bioinformatics. 2015; 31(13):2202-4. PMC: 4481842. DOI: 10.1093/bioinformatics/btv112. View

5.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J . Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015; 17(5):405-24. PMC: 4544753. DOI: 10.1038/gim.2015.30. View