Logo image
A Non-parametric Cutout Index for Robust Evaluation of Identified Proteins
Journal article   Open access   Peer reviewed

A Non-parametric Cutout Index for Robust Evaluation of Identified Proteins

Oliver Serang, Joao Paulo, Hanno Steen and Judith A. Steen
Molecular & cellular proteomics, Vol.12(3), pp.807-812
01/03/2013
PMID: 23292186

Abstract

Biochemical Research Methods Biochemistry & Molecular Biology Life Sciences & Biomedicine Science & Technology
This paper proposes a novel, automated method for evaluating sets of proteins identified using mass spectrometry. The remaining peptide-spectrum match score distributions of protein sets are compared to an empirical absent peptide-spectrum match score distribution, and a Bayesian non-parametric method reminiscent of the Dirichlet process is presented to accurately perform this comparison. Thus, for a given protein set, the process computes the likelihood that the proteins identified are correctly identified. First, the method is used to evaluate protein sets chosen using different protein-level false discovery rate (FDR) thresholds, assigning each protein set a likelihood. The protein set assigned the highest likelihood is used to choose a non-arbitrary protein-level FDR threshold. Because the method can be used to evaluate any protein identification strategy (and is not limited to mere comparisons of different FDR thresholds), we subsequently use the method to compare and evaluate multiple simple methods for merging peptide evidence over replicate experiments. The general statistical approach can be applied to other types of data (e. g. RNA sequencing) and generalizes to multivariate problems. Molecular & Cellular Proteomics 12: 10.1074/mcp.O112.022863, 807-812, 2013.
url
https://doi.org/10.1074/mcp.O112.022863View
Published (Version of record) Open

Metrics

Details

Logo image