Data Management Pipeline for Plant Phenotyping in a Multisite Project
Overview
Affiliations
In plant breeding, plants have to be characterised precisely, consistently and rapidly by different people at several field sites within defined time spans. For a meaningful data evaluation and statistical analysis, standardised data storage is required. Data access must be provided on a long-term basis and be independent of organisational barriers without endangering data integrity or intellectual property rights. We discuss the associated technical challenges and demonstrate adequate solutions exemplified in a data management pipeline for a project to identify markers for drought tolerance in potato. This project involves 11 groups from academia and breeding companies, 11 sites and four analytical platforms. Our data warehouse concept combines central data storage in databases and a file server and integrates existing and specialised database solutions for particular data types with new, project-specific databases. The strict use of controlled vocabularies and the application of web-access technologies proved vital to the successful data exchange between diverse institutes and data management concepts and infrastructures. By presenting our data management system and making the software available, we aim to support related phenotyping projects.
Yang W, Feng H, Hu X, Song J, Guo J, Lu B Methods Mol Biol. 2024; 2787:3-38.
PMID: 38656479 DOI: 10.1007/978-1-0716-3778-4_1.
Data synthesis for crop variety evaluation. A review.
Brown D, Van den Bergh I, de Bruin S, Machida L, van Etten J Agron Sustain Dev. 2020; 40(4):25.
PMID: 32863892 PMC: 7440334. DOI: 10.1007/s13593-020-00630-7.
Honecker A, Schumann H, Becirevic D, Klingbeil L, Volland K, Forberig S Plant Methods. 2020; 16:55.
PMID: 32336978 PMC: 7171732. DOI: 10.1186/s13007-020-00596-3.
Modeling Crop Genetic Resources Phenotyping Information Systems.
Germeier C, Unger S Front Plant Sci. 2019; 10:728.
PMID: 31281323 PMC: 6597887. DOI: 10.3389/fpls.2019.00728.
Kohl K, Gremmels J Plant Methods. 2015; 11:25.
PMID: 25866550 PMC: 4393613. DOI: 10.1186/s13007-015-0069-3.