» Articles » PMID: 31112551

A Chromosome-level Sequence Assembly Reveals the Structure of the Arabidopsis Thaliana Nd-1 Genome and Its Gene Set

Overview
Journal PLoS One
Date 2019 May 22
PMID 31112551
Citations 26
Authors
Affiliations
Soon will be listed here.
Abstract

In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.

Citing Articles

NAVIP: Unraveling the influence of neighboring small sequence variants on functional impact prediction.

Baasner J, Rempel A, Howard D, Pucker B PLoS Comput Biol. 2025; 21(2):e1012732.

PMID: 39964984 PMC: 11849982. DOI: 10.1371/journal.pcbi.1012732.


Disruption of recombination machinery alters the mutational landscape in plant organellar genomes.

Waneka G, Broz A, Wold-McGimsey F, Zou Y, Wu Z, Sloan D bioRxiv. 2024; .

PMID: 38895361 PMC: 11185577. DOI: 10.1101/2024.06.03.597120.


ACMGA: a reference-free multiple-genome alignment pipeline for plant species.

Zhou H, Su X, Song B BMC Genomics. 2024; 25(1):515.

PMID: 38796435 PMC: 11127342. DOI: 10.1186/s12864-024-10430-y.


A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range.

Lian Q, Huettel B, Walkemeier B, Mayjonade B, Lopez-Roques C, Gil L Nat Genet. 2024; 56(5):982-991.

PMID: 38605175 PMC: 11096106. DOI: 10.1038/s41588-024-01715-9.


Automatic annotation of the bHLH gene family in plants.

Thoben C, Pucker B BMC Genomics. 2023; 24(1):780.

PMID: 38102570 PMC: 10722790. DOI: 10.1186/s12864-023-09877-2.


References
1.
. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000; 408(6814):796-815. DOI: 10.1038/35048692. View

2.
Stadermann K, Holtgrawe D, Weisshaar B . Chloroplast Genome Sequence of Arabidopsis thaliana Accession Landsberg erecta, Assembled from Single-Molecule, Real-Time Sequencing Data. Genome Announc. 2016; 4(5). PMC: 5034127. DOI: 10.1128/genomeA.00975-16. View

3.
Simpson J, Pop M . The Theory and Practice of Genome Sequence Assembly. Annu Rev Genomics Hum Genet. 2015; 16:153-72. DOI: 10.1146/annurev-genom-090314-050032. View

4.
Copenhaver G, Pikaard C . RFLP and physical mapping with an rDNA-specific endonuclease reveals that nucleolus organizer regions of Arabidopsis thaliana adjoin the telomeres on chromosomes 2 and 4. Plant J. 1996; 9(2):259-72. DOI: 10.1046/j.1365-313x.1996.09020259.x. View

5.
Hofte H, Desprez T, Amselem J, Chiapello H, Rouze P, Caboche M . An inventory of 1152 expressed sequence tags obtained by partial sequencing of cDNAs from Arabidopsis thaliana. Plant J. 1993; 4(6):1051-61. DOI: 10.1046/j.1365-313x.1993.04061051.x. View