TY - JOUR
T1 - Prediction of missing enzyme genes in a bacterial metabolic network
T2 - Reconstruction of the lysine-degradation pathway of Pseudomonas aeruginosa
AU - Yamanishi, Yoshihiro
AU - Mihara, Hisaaki
AU - Osaki, Motoharu
AU - Muramatsu, Hisashi
AU - Esaki, Nobuyoshi
AU - Sato, Tetsuya
AU - Hizukuri, Yoshiyuki
AU - Goto, Susumu
AU - Kanehisa, Minoru
PY - 2007/5
Y1 - 2007/5
N2 - The metabolic network is an important biological network which consists of enzymes and chemical compounds. However, a large number of metabolic pathways remains unknown, and most organism-specific metabolic pathways contain many missing enzymes. We present a novel method to identify the genes coding for missing enzymes using available genomic and chemical information from bacterial genomes. The proposed method consists of two steps: (a) estimation of the functional association between the genes with respect to chromosomal proximity and evolutionary association, using supervised network inference; and (b) selection of gene candidates for missing enzymes based on the original candidate score and the chemical reaction information encoded in the EC number. We applied the proposed methods to infer the metabolic network for the bacteria Pseudomonas aeruginosa from two genomic datasets: gene position and phylogenetic profiles. Next, we predicted several missing enzyme genes to reconstruct the lysine-degradation pathway in P. aeruginosa using EC number information. As a result, we identified PA0266 as a putative 5-aminovalerate aminotransferase (EC 2.6.1.48) and PA0265 as a putative glutarate semialdehyde dehydrogenase (EC 1.2.1.20). To verify our prediction, we conducted biochemical assays and examined the activity of the products of the predicted genes, PA0265 and PA0266, in a coupled reaction. We observed that the predicted gene products catalyzed the expected reactions; no activity was seen when both gene products were omitted from the reaction.
AB - The metabolic network is an important biological network which consists of enzymes and chemical compounds. However, a large number of metabolic pathways remains unknown, and most organism-specific metabolic pathways contain many missing enzymes. We present a novel method to identify the genes coding for missing enzymes using available genomic and chemical information from bacterial genomes. The proposed method consists of two steps: (a) estimation of the functional association between the genes with respect to chromosomal proximity and evolutionary association, using supervised network inference; and (b) selection of gene candidates for missing enzymes based on the original candidate score and the chemical reaction information encoded in the EC number. We applied the proposed methods to infer the metabolic network for the bacteria Pseudomonas aeruginosa from two genomic datasets: gene position and phylogenetic profiles. Next, we predicted several missing enzyme genes to reconstruct the lysine-degradation pathway in P. aeruginosa using EC number information. As a result, we identified PA0266 as a putative 5-aminovalerate aminotransferase (EC 2.6.1.48) and PA0265 as a putative glutarate semialdehyde dehydrogenase (EC 1.2.1.20). To verify our prediction, we conducted biochemical assays and examined the activity of the products of the predicted genes, PA0265 and PA0266, in a coupled reaction. We observed that the predicted gene products catalyzed the expected reactions; no activity was seen when both gene products were omitted from the reaction.
UR - http://www.scopus.com/inward/record.url?scp=34247374704&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247374704&partnerID=8YFLogxK
U2 - 10.1111/j.1742-4658.2007.05763.x
DO - 10.1111/j.1742-4658.2007.05763.x
M3 - Article
C2 - 17388807
AN - SCOPUS:34247374704
SN - 1742-464X
VL - 274
SP - 2262
EP - 2273
JO - FEBS Journal
JF - FEBS Journal
IS - 9
ER -