ReSAPP: Predicting overlapping protein complexes by merging multiple-sampled partitions of proteins

So Kobiki, Osamu Maruyama

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.

Original languageEnglish
Article number1442004
JournalJournal of bioinformatics and computational biology
Volume12
Issue number6
DOIs
Publication statusPublished - Dec 29 2014

Fingerprint

Simulated annealing
Merging
Proteins
Systems Biology

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Cite this

@article{b5cb980eb34149f3a852ea40afd0a9ea,
title = "ReSAPP: Predicting overlapping protein complexes by merging multiple-sampled partitions of proteins",
abstract = "Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.",
author = "So Kobiki and Osamu Maruyama",
year = "2014",
month = "12",
day = "29",
doi = "10.1142/S0219720014420049",
language = "English",
volume = "12",
journal = "Journal of Bioinformatics and Computational Biology",
issn = "0219-7200",
publisher = "World Scientific Publishing Co. Pte Ltd",
number = "6",

}

TY - JOUR

T1 - ReSAPP

T2 - Predicting overlapping protein complexes by merging multiple-sampled partitions of proteins

AU - Kobiki, So

AU - Maruyama, Osamu

PY - 2014/12/29

Y1 - 2014/12/29

N2 - Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.

AB - Many proteins are known to perform their own functions when they form particular groups of proteins, called protein complexes. With the advent of large-scale protein-protein interaction (PPI) studies, it has been a challenging problem in systems biology to predict protein complexes from PPIs. In this paper, we propose a novel method, called Repeated Simulated Annealing of Partitions of Proteins (ReSAPP), which predicts protein complexes from weighted PPIs. ReSAPP, in the first stage, generates multiple (possibly different) partitions of all proteins of given PPIs by repeatedly applying a simulated annealing based optimization algorithm to the PPIs. In the second stage, all different clusters of size two or more in those multiple partitions are merged into a collection of those clusters, which are outputted as predicted protein complexes. In performance comparison of ReSAPP with our previous algorithm, PPSampler2, as well as other various tools, MCL, MCODE, DPClus, CMC, COACH, RRW, NWE, and PPSampler1, ReSAPP is shown to outperform the other methods. Furthermore, the value of F-measure of ReSAPP is higher than that of the variant of ReSAPP without merging partitions. Thus, we empirically conclude that the combination of sampling multiple partitions and merging them is effective to predict protein complexes.

UR - http://www.scopus.com/inward/record.url?scp=84930033464&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84930033464&partnerID=8YFLogxK

U2 - 10.1142/S0219720014420049

DO - 10.1142/S0219720014420049

M3 - Article

C2 - 25385080

AN - SCOPUS:84930033464

VL - 12

JO - Journal of Bioinformatics and Computational Biology

JF - Journal of Bioinformatics and Computational Biology

SN - 0219-7200

IS - 6

M1 - 1442004

ER -