Galaxy-M: A Galaxy workflow for processing and analyzing direct infusion and liquid chromatography mass spectrometry-based metabolomics data

Research output: Contribution to journalArticlepeer-review

Colleges, School and Institutes

External organisations

  • GigaScience, BGI-Hong Kong Co. Ltd, Tai Po Industrial Estate, 16 Dai Fu Street, Tai Po, NT Hong Kong ; School of Biosciences, University of Birmingham, Birmingham, B15 2TT UK.
  • School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Abstract

BACKGROUND: Metabolomics is increasingly recognized as an invaluable tool in the biological, medical and environmental sciences yet lags behind the methodological maturity of other omics fields. To achieve its full potential, including the integration of multiple omics modalities, the accessibility, standardization and reproducibility of computational metabolomics tools must be improved significantly.

RESULTS: Here we present our end-to-end mass spectrometry metabolomics workflow in the widely used platform, Galaxy. Named Galaxy-M, our workflow has been developed for both direct infusion mass spectrometry (DIMS) and liquid chromatography mass spectrometry (LC-MS) metabolomics. The range of tools presented spans from processing of raw data, e.g. peak picking and alignment, through data cleansing, e.g. missing value imputation, to preparation for statistical analysis, e.g. normalization and scaling, and principal components analysis (PCA) with associated statistical evaluation. We demonstrate the ease of using these Galaxy workflows via the analysis of DIMS and LC-MS datasets, and provide PCA scores and associated statistics to help other users to ensure that they can accurately repeat the processing and analysis of these two datasets. Galaxy and data are all provided pre-installed in a virtual machine (VM) that can be downloaded from the GigaDB repository. Additionally, source code, executables and installation instructions are available from GitHub.

CONCLUSIONS: The Galaxy platform has enabled us to produce an easily accessible and reproducible computational metabolomics workflow. More tools could be added by the community to expand its functionality. We recommend that Galaxy-M workflow files are included within the supplementary information of publications, enabling metabolomics studies to achieve greater reproducibility.

Details

Original languageEnglish
Article number10
JournalGigaScience
Volume5
Issue number1
Early online date23 Feb 2016
Publication statusPublished - 2016

Keywords

  • Chromatography, Liquid, Computational Biology, Mass Spectrometry, Metabolome, Metabolomics, Principal Component Analysis, Reproducibility of Results, Software, Workflow, Journal Article, Research Support, Non-U.S. Gov't