EnvCNN: A Convolutional Neural Network Model for Evaluating Isotopic Envelopes in Top-Down Mass-Spectral Deconvolution

Anal Chem. 2020 Jun 2;92(11):7778-7785. doi: 10.1021/acs.analchem.0c00903. Epub 2020 May 13.

Abstract

Top-down mass spectrometry has become the main method for intact proteoform identification, characterization, and quantitation. Because of the complexity of top-down mass spectrometry data, spectral deconvolution is an indispensable step in spectral data analysis, which groups spectral peaks into isotopic envelopes and extracts monoisotopic masses of precursor or fragment ions. The performance of spectral deconvolution methods relies heavily on their scoring functions, which distinguish correct envelopes from incorrect ones. A good scoring function increases the accuracy of deconvoluted masses reported from mass spectra. In this paper, we present EnvCNN, a convolutional neural network-based model for evaluating isotopic envelopes. We show that the model outperforms other scoring functions in distinguishing correct envelopes from incorrect ones and that it increases the number of identifications and improves the statistical significance of identifications in top-down spectral interpretation.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Brain
  • Databases, Protein
  • Female
  • Humans
  • Machine Learning
  • Mass Spectrometry
  • Neural Networks, Computer*
  • Ovarian Neoplasms / pathology*
  • Zebrafish