apLCMS - RASpicer/MetabolomicsTools GitHub Wiki

apLCMS

Version: 6.3.3

Website

http://web1.sph.emory.edu/apLCMS/

Description

This is an R package for the preprocessing of high resolution LC-MS data. It contains functions to reduce noise, identify features, correct for retention time deviation, align features across samples and recover weak-signals from LC-FTMS. First peaks are grouped based on their m/z, using a threshold value which is set using a mixture model. These groups are then split into smaller groups using kernel density functions by their rt and m/z. A run-filter is used to distinguish features from noise. The data are then grouped, with each group tentatively said to contain either one or a few features, with the aim of disentangling features with close m/z and rts. A kernel smoother is first fitted to estimate the number of features and their locations; this can return either a single peak, or if multiple peaks are found a maximisation-expectation (EM) algorithm with pseudo-likelihood is used for estimation. Next a feature table is generated for retention time alignment. Features are first aligned by m/z, using the same strategy as peak picking. A rt threshold value is then found using a model-based search. Pairwise rt differences between all features in a group are found and a triangular density function is to find the threshold. A kernel smoother is then used for rt alignment, using the profile with the most number of features as the template. Grouping is then reperformed. Weak signals from LC-FTMS data can then be recovered. Data is separated into XICs using adaptive binning. A large number of feature measurements are taken from every XIC. XICs are then matched to known metabolites; features mapped to known metabolites are more likely to be real signals. Machine learning techniques are used to find the optimum way of distinguishing between matched and unmatched XICs. Models are compared using cross validation.

Functionality

  • Preprocessing

Instrument Data Type

  • MS/LC-MS/LC-FT-MS

Approaches

  • Untargeted

Computer Skills

Advanced

Software Type

R Package

Interface

Command line interface

Operating System (OS)

  • Unix/Linux
  • Mac OS
  • Windows

Language

R

Dependencies

MASS, rgl, mzR, splines, doSNOW, foreach, iterators, snow, ROCR, ROCS, e1071, randomForest, gbm

Input Formats - Open

mzData, mzML, mzXML, netCDF

Input Formats - Proprietary

N/A

Published

2009

Last Updated

2017

License

GPL (≥ 2)

Paper

http://www.ncbi.nlm.nih.gov/pubmed/25005748

PMID

25005748