» Articles » PMID: 20220756

Transcriptome Genetics Using Second Generation Sequencing in a Caucasian Population

Overview
Journal Nature
Specialty Science
Date 2010 Mar 12
PMID 20220756
Citations 513
Authors
Affiliations
Soon will be listed here.
Abstract

Gene expression is an important phenotype that informs about genetic and environmental effects on cellular state. Many studies have previously identified genetic variants for gene expression phenotypes using custom and commercially available microarrays. Second generation sequencing technologies are now providing unprecedented access to the fine structure of the transcriptome. We have sequenced the mRNA fraction of the transcriptome in 60 extended HapMap individuals of European descent and have combined these data with genetic variants from the HapMap3 project. We have quantified exon abundance based on read depth and have also developed methods to quantify whole transcript abundance. We have found that approximately 10 million reads of sequencing can provide access to the same dynamic range as arrays with better quantification of alternative and highly abundant transcripts. Correlation with SNPs (small nucleotide polymorphisms) leads to a larger discovery of eQTLs (expression quantitative trait loci) than with arrays. We also detect a substantial number of variants that influence the structure of mature transcripts indicating variants responsible for alternative splicing. Finally, measures of allele-specific expression allowed the identification of rare eQTLs and allelic differences in transcript structure. This analysis shows that high throughput sequencing technologies reveal new properties of genetic effects on the transcriptome and allow the exploration of genetic effects in cellular processes.

Citing Articles

Comparative analysis of genotype imputation strategies for SNPs calling from RNA-seq.

Guo K, Zhong Z, Zeng H, Zhang C, Chitotombe T, Teng J BMC Genomics. 2025; 26(1):245.

PMID: 40082746 PMC: 11907794. DOI: 10.1186/s12864-025-11411-5.


Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits.

Krizanac A, Reimer C, Heise J, Liu Z, Pryce J, Bennewitz J Genet Sel Evol. 2025; 57(1):3.

PMID: 39905301 PMC: 11796172. DOI: 10.1186/s12711-025-00951-9.


Identifying therapeutic targets for primary ovarian insufficiency through integrated genomic analyses.

Du H, Zeng P, Liu X, Zhang J, Huang Z J Ovarian Res. 2024; 17(1):193.

PMID: 39358799 PMC: 11446024. DOI: 10.1186/s13048-024-01524-y.


Removing unwanted variation between samples in Hi-C experiments.

Fletez-Brant K, Qiu Y, Gorkin D, Hu M, Hansen K Brief Bioinform. 2024; 25(3).

PMID: 38711367 PMC: 11074651. DOI: 10.1093/bib/bbae217.


Statistical Learning of Large-Scale Genetic Data: How to Run a Genome-Wide Association Study of Gene-Expression Data Using the 1000 Genomes Project Data.

Sugolov A, Emmenegger E, Paterson A, Sun L Stat Biosci. 2024; 16(1):250-264.

PMID: 38495080 PMC: 10940486. DOI: 10.1007/s12561-023-09375-9.


References
1.
Sabatti C, Risch N . Homozygosity and linkage disequilibrium. Genetics. 2002; 160(4):1707-19. PMC: 1462072. DOI: 10.1093/genetics/160.4.1707. View

2.
Pickrell J, Marioni J, Pai A, Degner J, Engelhardt B, Nkadori E . Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010; 464(7289):768-72. PMC: 3089435. DOI: 10.1038/nature08872. View

3.
Pastinen T, Hudson T . Cis-acting regulatory variation in the human genome. Science. 2004; 306(5696):647-50. DOI: 10.1126/science.1101659. View

4.
Stranger B, Forrest M, Clark A, Minichiello M, Deutsch S, Lyle R . Genome-wide associations of gene expression variation in humans. PLoS Genet. 2005; 1(6):e78. PMC: 1315281. DOI: 10.1371/journal.pgen.0010078. View

5.
Stranger B, Forrest M, Dunning M, Ingle C, Beazley C, Thorne N . Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science. 2007; 315(5813):848-53. PMC: 2665772. DOI: 10.1126/science.1136678. View