» Articles » PMID: 23800225

Automatic Workflow for the Classification of Local DNA Conformations

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Jun 27
PMID 23800225
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Background: A growing number of crystal and NMR structures reveals a considerable structural polymorphism of DNA architecture going well beyond the usual image of a double helical molecule. DNA is highly variable with dinucleotide steps exhibiting a substantial flexibility in a sequence-dependent manner. An analysis of the conformational space of the DNA backbone and the enhancement of our understanding of the conformational dependencies in DNA are therefore important for full comprehension of DNA structural polymorphism.

Results: A detailed classification of local DNA conformations based on the technique of Fourier averaging was published in our previous work. However, this procedure requires a considerable amount of manual work. To overcome this limitation we developed an automatic classification method consisting of the combination of supervised and unsupervised approaches. A proposed workflow is composed of k-NN method followed by a non-hierarchical single-pass clustering algorithm. We applied this workflow to analyze 816 X-ray and 664 NMR DNA structures released till February 2013. We identified and annotated six new conformers, and we assigned four of these conformers to two structurally important DNA families: guanine quadruplexes and Holliday (four-way) junctions. We also compared populations of the assigned conformers in the dataset of X-ray and NMR structures.

Conclusions: In the present work we developed a machine learning workflow for the automatic classification of dinucleotide conformations. Dinucleotides with unassigned conformations can be either classified into one of already known 24 classes or they can be flagged as unclassifiable. The proposed machine learning workflow permits identification of new classes among so far unclassifiable data, and we identified and annotated six new conformations in the X-ray structures released since our previous analysis. The results illustrate the utility of machine learning approaches in the classification of local DNA conformations.

Citing Articles

3dDNAscoreA: A scoring function for evaluation of DNA 3D structures.

Zhang Y, Yang C, Xiong Y, Xiao Y Biophys J. 2024; 123(17):2696-2704.

PMID: 38409781 PMC: 11393702. DOI: 10.1016/j.bpj.2024.02.018.


Accurate prediction of B-form/A-form DNA conformation propensity from primary sequence: A machine learning and free energy handshake.

Gupta A, Kulkarni M, Mukherjee A Patterns (N Y). 2021; 2(9):100329.

PMID: 34553171 PMC: 8441556. DOI: 10.1016/j.patter.2021.100329.


Ion Binding Properties and Dynamics of the 2 G-Quadruplex Using a Polarizable Force Field.

Ratnasinghe B, Salsbury A, Lemkul J J Chem Inf Model. 2020; 60(12):6476-6488.

PMID: 33264004 PMC: 7775346. DOI: 10.1021/acs.jcim.0c01064.


Biologically important conformational features of DNA as interpreted by quantum mechanics and molecular mechanics computations of its simple fragments.

Poltev V, Anisimov V, Dominguez V, Gonzalez E, Deriabina A, Garcia D J Mol Model. 2018; 24(2):46.

PMID: 29392428 DOI: 10.1007/s00894-018-3589-8.


A DNA structural alphabet provides new insight into DNA flexibility.

Schneider B, Boaeikova P, Necasova I, cech P, Svozil D, cerny J Acta Crystallogr D Struct Biol. 2018; 74(Pt 1):52-64.

PMID: 29372899 PMC: 5786007. DOI: 10.1107/S2059798318000050.


References
1.
Nikolova E, Bascom G, Andricioaei I, Al-Hashimi H . Probing sequence-specific DNA flexibility in a-tracts and pyrimidine-purine steps by nuclear magnetic resonance (13)C relaxation and molecular dynamics simulations. Biochemistry. 2012; 51(43):8654-64. PMC: 3676944. DOI: 10.1021/bi3009517. View

2.
Jain A, Wang G, Vasquez K . DNA triple helices: biological consequences and therapeutic potential. Biochimie. 2008; 90(8):1117-30. PMC: 2586808. DOI: 10.1016/j.biochi.2008.02.011. View

3.
Tisne C, Hantz E, Hartmann B, Delepierre M . Solution structure of a non-palindromic 16 base-pair DNA related to the HIV-1 kappa B site: evidence for BI-BII equilibrium inducing a global dynamic curvature of the duplex. J Mol Biol. 1998; 279(1):127-42. DOI: 10.1006/jmbi.1998.1757. View

4.
Sims G, Kim S . Global mapping of nucleic acid conformational space: dinucleoside monophosphate conformations and transition pathways among conformational classes. Nucleic Acids Res. 2003; 31(19):5607-16. PMC: 206451. DOI: 10.1093/nar/gkg750. View

5.
Lilley D . Structures of helical junctions in nucleic acids. Q Rev Biophys. 2000; 33(2):109-59. DOI: 10.1017/s0033583500003590. View