Home > Research > Publications & Outputs > Automated classification of diglucosides glycos...

Electronic data

  • 2020radeckimscbyresearch

    Final published version, 1.2 MB, PDF document

    Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Text available via DOI:

View graph of relations

Automated classification of diglucosides glycosidic linkage with ion mobility spectrometry data by machine learning approaches

Research output: ThesisMaster's Thesis

Published

Standard

Automated classification of diglucosides glycosidic linkage with ion mobility spectrometry data by machine learning approaches. / Radecki, Michal.
Lancaster University, 2020. 41 p.

Research output: ThesisMaster's Thesis

Harvard

APA

Vancouver

Author

Bibtex

@mastersthesis{7869c64122e34b1796f93e17de834c8a,
title = "Automated classification of diglucosides glycosidic linkage with ion mobility spectrometry data by machine learning approaches",
abstract = "Unambiguous characterization of carbohydrate products remains a challenging endeavour. The current state-of-the-art techniques are NMR-based approaches, which require large amounts of purified sample and are challenging to annotate. Recently, a strategy has been developed combining gas-phase ion-mobility spectrometry with tandem mass spectrometry to separate and characterize isomeric product ions. Crucially, this can provide information about the stereochemistry that MS alone is often “blind” to because certain monosaccharides have the same m/z (i.e. they are isomeric) and are therefore indistinguishable. Given the amount of data this approach produces, there is a need for a method to rapidly annotate the produced data. The initial strategy involves developing a method that can discern the number of peaks within an IMS spectrum, where it has been shown that product ions derived from α-glucosides produced similar features (2 peaks) whereas β-glucosides only produced a single peak. It was reported that an IMS signal can be approximated as a sum of spectral line shapes (such as Gaussian, Lorentzian or Voigt). Current results show that the approximation method allows analysis of the signal in terms of peaks description. Using this approach, we built a feature matrix from IMS data for different diglucosides, which was then used to train a machine learning classifier able to distinguish between α- and β-glucosides. The performance of the classifier proved that automated classification of glucosides by their bonding type is achievable.",
author = "Michal Radecki",
year = "2020",
doi = "10.17635/lancaster/thesis/1698",
language = "English",
publisher = "Lancaster University",
school = "Lancaster University",

}

RIS

TY - GEN

T1 - Automated classification of diglucosides glycosidic linkage with ion mobility spectrometry data by machine learning approaches

AU - Radecki, Michal

PY - 2020

Y1 - 2020

N2 - Unambiguous characterization of carbohydrate products remains a challenging endeavour. The current state-of-the-art techniques are NMR-based approaches, which require large amounts of purified sample and are challenging to annotate. Recently, a strategy has been developed combining gas-phase ion-mobility spectrometry with tandem mass spectrometry to separate and characterize isomeric product ions. Crucially, this can provide information about the stereochemistry that MS alone is often “blind” to because certain monosaccharides have the same m/z (i.e. they are isomeric) and are therefore indistinguishable. Given the amount of data this approach produces, there is a need for a method to rapidly annotate the produced data. The initial strategy involves developing a method that can discern the number of peaks within an IMS spectrum, where it has been shown that product ions derived from α-glucosides produced similar features (2 peaks) whereas β-glucosides only produced a single peak. It was reported that an IMS signal can be approximated as a sum of spectral line shapes (such as Gaussian, Lorentzian or Voigt). Current results show that the approximation method allows analysis of the signal in terms of peaks description. Using this approach, we built a feature matrix from IMS data for different diglucosides, which was then used to train a machine learning classifier able to distinguish between α- and β-glucosides. The performance of the classifier proved that automated classification of glucosides by their bonding type is achievable.

AB - Unambiguous characterization of carbohydrate products remains a challenging endeavour. The current state-of-the-art techniques are NMR-based approaches, which require large amounts of purified sample and are challenging to annotate. Recently, a strategy has been developed combining gas-phase ion-mobility spectrometry with tandem mass spectrometry to separate and characterize isomeric product ions. Crucially, this can provide information about the stereochemistry that MS alone is often “blind” to because certain monosaccharides have the same m/z (i.e. they are isomeric) and are therefore indistinguishable. Given the amount of data this approach produces, there is a need for a method to rapidly annotate the produced data. The initial strategy involves developing a method that can discern the number of peaks within an IMS spectrum, where it has been shown that product ions derived from α-glucosides produced similar features (2 peaks) whereas β-glucosides only produced a single peak. It was reported that an IMS signal can be approximated as a sum of spectral line shapes (such as Gaussian, Lorentzian or Voigt). Current results show that the approximation method allows analysis of the signal in terms of peaks description. Using this approach, we built a feature matrix from IMS data for different diglucosides, which was then used to train a machine learning classifier able to distinguish between α- and β-glucosides. The performance of the classifier proved that automated classification of glucosides by their bonding type is achievable.

U2 - 10.17635/lancaster/thesis/1698

DO - 10.17635/lancaster/thesis/1698

M3 - Master's Thesis

PB - Lancaster University

ER -