» Articles » PMID: 20141623

Development of a EST Dataset and Characterization of EST-SSRs in a Traditional Chinese Medicinal Plant, Epimedium Sagittatum (Sieb. Et Zucc.) Maxim

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2010 Feb 10
PMID 20141623
Citations 86
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum.

Results: cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9).

Conclusion: A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and approximately 1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species.

Citing Articles

Identification of New Cultivar and Different Provenances of (Poaceae: Bambusoideae) Using Simple Sequence Repeats Developed from the Whole Genome.

Geng R, Xu J, Jiang J, Cheng Z, Sun M, Xia N Plants (Basel). 2024; 13(20).

PMID: 39458856 PMC: 11511551. DOI: 10.3390/plants13202910.


Illumina-based transcriptomic analysis of the fast-growing leguminous tree : functional gene annotation and identification of novel SSR-markers.

Ishio S, Kusunoki K, Nemoto M, Kanao T, Tamura T Front Plant Sci. 2024; 15:1339958.

PMID: 39268003 PMC: 11390451. DOI: 10.3389/fpls.2024.1339958.


Exploring regulatory network of icariin synthesis in through integrated omics analysis.

Zhu X, Wen S, Gul H, Xu P, Yang Y, Liao X Front Plant Sci. 2024; 15:1409601.

PMID: 38933461 PMC: 11203402. DOI: 10.3389/fpls.2024.1409601.


De Novo Transcriptome Profiling for the Generation and Validation of Microsatellite Markers, Transcription Factors, and Database Development for .

Singh R, Singh A, Mahato A, Paliwal R, Tiwari G, Kumar A Int J Mol Sci. 2023; 24(11).

PMID: 37298166 PMC: 10252285. DOI: 10.3390/ijms24119212.


Genetic and molecular analysis of the anthocyanin pigmentation pathway in .

Mi Y, He R, Wan H, Meng X, Liu D, Huang W Front Plant Sci. 2023; 14:1133616.

PMID: 37063227 PMC: 10090855. DOI: 10.3389/fpls.2023.1133616.


References
1.
Morgante M, Hanafey M, Powell W . Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet. 2002; 30(2):194-200. DOI: 10.1038/ng822. View

2.
Rozen S, Skaletsky H . Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 1999; 132:365-86. DOI: 10.1385/1-59259-192-2:365. View

3.
Toth G, Gaspari Z, Jurka J . Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res. 2000; 10(7):967-81. PMC: 310925. DOI: 10.1101/gr.10.7.967. View

4.
Chou H, Holmes M . DNA sequence quality trimming and vector removal. Bioinformatics. 2001; 17(12):1093-104. DOI: 10.1093/bioinformatics/17.12.1093. View

5.
Coleman T, Roesser J . RNA secondary structure: an important cis-element in rat calcitonin/CGRP pre-messenger RNA splicing. Biochemistry. 1998; 37(45):15941-50. DOI: 10.1021/bi9808058. View