» Articles » PMID: 36425998

A UK-based Ground Truth Data Set of GCMS Analysed Ignitable Liquid Samples - a Template for Making Chromatographic Data Accessible As an Open Source Data Set

Overview
Journal Data Brief
Date 2022 Nov 25
PMID 36425998
Authors
Affiliations
Soon will be listed here.
Abstract

Fire debris is often recovered as part of a fire scene investigation to determine whether an ignitable liquid might be present which may be evidence of a deliberate fire. The analysis of fire debris produces chromatograms that a forensic chemist uses to determine whether or not an ignitable liquid may be present. Currently there are very few publicly available data sets that can be used for training and statistical modelling in this area. The data set in this paper has been prepared with these two applications in mind and covers a wide range of ignitable liquids available in the UK. We created a data set of 35 ignitable liquids including petrol (gasoline), light, medium and heavy petroleum distillates (i.e diesel) from several retailers. Each ignitable liquid was systematically evaporated to produce six additional samples. Each sample was repetitively analysed to provide an overall data set of 751 analytical outputs (including chromatograms). Each data sample is expressed in multiple formats and the metadata containing any data used in the production of the samples is included. The folder and file names are designed to avoid misplacements and to manipulate folders and files systematically using computer code.

Citing Articles

A ground truth data set of gas chromatography mass spectrometry (GCMS) analysed synthesised methylenedioxymethylamphetamine (MDMA).

Miller J, Puch-Solis R, Buchanan H, Daeid N Data Brief. 2023; 47:108931.

PMID: 36819899 PMC: 9929198. DOI: 10.1016/j.dib.2023.108931.

References
1.
Deutsch E . Mass spectrometer output file format mzML. Methods Mol Biol. 2009; 604:319-31. PMC: 3073315. DOI: 10.1007/978-1-60761-444-9_22. View

2.
Wenig P, Odermatt J . OpenChrom: a cross-platform open source software for the mass spectrometric analysis of chromatographic data. BMC Bioinformatics. 2010; 11:405. PMC: 2920884. DOI: 10.1186/1471-2105-11-405. View

3.
Mat-Desa W, Ismail D, NicDaeid N . Classification and source determination of medium petroleum distillates by chemometric and artificial neural networks: a self organizing feature approach. Anal Chem. 2011; 83(20):7745-54. DOI: 10.1021/ac202315y. View