TY - JOUR
T1 - Benchmarking a wide range of chemical descriptors for drug-target interaction prediction using a chemogenomic approach
AU - Sawada, Ryusuke
AU - Kotera, Masaaki
AU - Yamanishi, Yoshihiro
N1 - Publisher Copyright:
© 2014 Wiley-VCH Verlag GmbH & Co. KGaA.
Copyright:
Copyright 2014 Elsevier B.V., All rights reserved.
PY - 2014/11/24
Y1 - 2014/11/24
N2 - (Figure Presented) The identification of drug-target interactions, or interactions between drug candidate compounds and target candidate proteins, is a crucial process in genomic drug discovery. In silico chemogenomic methods are recently recognized as a promising approach for genomewide scale prediction of drug-target interactions, but the prediction performance depends heavily on the descriptors and similarity measures of drugs and proteins. In this paper, we investigated the performance of various descriptors and similarity measures of drugs and proteins for the drug-target interaction prediction using a chemogenomic approach. We compared the prediction accuracy of 18 chemical descriptors of drugs (e.g., ECFP, FCFP,E-state, CDK, Klekota-Roth, MACCS, PubChem, Dragon, KCF-S, and graph kernels) and 4 descriptors of proteins (e.g., amino acid composition, domain profile, local sequence similarity, and string kernel) on about one hundred thousand drug-target interactions. We examined the combinatorial effects of drug descriptors and protein descriptors using the same benchmark data under several experimental conditions. Large-scale experiments showed that our proposed KCF-S descriptor worked the best in terms of prediction accuracy. The comparative results are expected to be useful for selecting chemical descriptors in various pharmaceutical applications.
AB - (Figure Presented) The identification of drug-target interactions, or interactions between drug candidate compounds and target candidate proteins, is a crucial process in genomic drug discovery. In silico chemogenomic methods are recently recognized as a promising approach for genomewide scale prediction of drug-target interactions, but the prediction performance depends heavily on the descriptors and similarity measures of drugs and proteins. In this paper, we investigated the performance of various descriptors and similarity measures of drugs and proteins for the drug-target interaction prediction using a chemogenomic approach. We compared the prediction accuracy of 18 chemical descriptors of drugs (e.g., ECFP, FCFP,E-state, CDK, Klekota-Roth, MACCS, PubChem, Dragon, KCF-S, and graph kernels) and 4 descriptors of proteins (e.g., amino acid composition, domain profile, local sequence similarity, and string kernel) on about one hundred thousand drug-target interactions. We examined the combinatorial effects of drug descriptors and protein descriptors using the same benchmark data under several experimental conditions. Large-scale experiments showed that our proposed KCF-S descriptor worked the best in terms of prediction accuracy. The comparative results are expected to be useful for selecting chemical descriptors in various pharmaceutical applications.
UR - http://www.scopus.com/inward/record.url?scp=84915753460&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84915753460&partnerID=8YFLogxK
U2 - 10.1002/minf.201400066
DO - 10.1002/minf.201400066
M3 - Review article
AN - SCOPUS:84915753460
SN - 1868-1743
VL - 33
SP - 719
EP - 731
JO - Molecular Informatics
JF - Molecular Informatics
IS - 11-12
ER -