pFind Studio: a computational solution for mass spectrometry-based proteomics

 

pFind

 

 

 

Institute of Computing Technology,

Chinese Academy of Sciences,

Beijing, China

 

 

Overview

pFind is a search engine system for automated peptide and protein identification from tandem mass spectra.

Although many available database searching tools have been developed, there are still a lot of challenges in identification reliability, sensitivity and usability. For example, the target-decoy database strategy has been widely used for the estimation of false positive rate (FPR) of peptide identification. However, this is usually done manually by users and all of existing tools lack an automated module to estimate the FPR. Another problem is the speed of searching high-throughput spectra against huge peptide and protein databases. The improvements in the sensitivity of mass spectrometers and the rapid expansion of databases have increased the scope and complexity of searching. Traditional software architecture, i.e., running all tasks in a stand-alone process and having not any data index, is more and more inadequate.

 

The newest version of pFind incorporate several newly developed or improved algorithms, modules and workflows: The system incorporates the target-decoy database search strategy for automated FPR estimation. Users only need to specify a required FPR before searching. Then the system will calculate a threshold that achieves the FPR and filter search results automatically. We developed a toolbox to index protein databases for high-throughput application and designed all modules under a parallel-processing-oriented architecture for distributing the computational load efficiently among a lot of computers. These developments greatly improve the overall searching speed.

Video and manual

download video(1) video(2) manual...

Publications

  • Le-heng Wang, De-Quan Li, Yan Fu, Hai-Peng Wang, Jing-Fen Zhang, Zuo-Fei Yuan, Rui-Xiang Sun, Rong Zeng, Si-Min He and Wen Gao. pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry. Rapid Communications in Mass Spectrometry (RCMS) Vol.21, No.18, pp2985-2991, 2007. [pdf] [abstract ] [supplementary information]
  • Dequan Li, Yan Fu, Ruixiang Sun, Charles X. Ling, Yonggang Wei, Hu Zhou, Rong Zeng, Qiang Yang, Simin He and Wen Gao. pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry. Bioinformatics, 21(13), 3049-3050, 2005 [pdf][abstract]
  • Yan Fu, Qiang Yang, Ruixiang Sun, Dequan Li, Rong Zeng, Charles X. Ling, Wen Gao. Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry. Bioinformatics 20, 1948-1954, 2004 [pdf] [abstract]