TY - JOUR
T1 - ReSAPP
T2 - Predicting overlapping protein complexes by merging multiple-sampled partitions of proteins
AU - Kobiki, So
AU - Maruyama, Osamu
N1 - Funding Information:
The authors would like to thank anonymous reviewers for their valuable comments and suggestions, which were helpful in improving the draft. We would also like to thank Chern Han Yong and Limsoon Wong for their comments on CMC and an execution of CMC with a normalized WI–PHI PPIs. This work was partially supported by JSPS KAKENHI Grant Number 26330330.
Publisher Copyright:
© 2014 The Authors.
PY - 2014/12/29
Y1 - 2014/12/29
N2 - Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.
AB - Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.
UR - http://www.scopus.com/inward/record.url?scp=84930033464&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84930033464&partnerID=8YFLogxK
U2 - 10.1142/S0219720014420049
DO - 10.1142/S0219720014420049
M3 - Article
C2 - 25385080
AN - SCOPUS:84930033464
VL - 12
JO - Journal of Bioinformatics and Computational Biology
JF - Journal of Bioinformatics and Computational Biology
SN - 0219-7200
IS - 6
M1 - 1442004
ER -