Benchmarking a wide range of chemical descriptors for drug-target interaction prediction using a chemogenomic approach

Ryusuke Sawada, Masaaki Kotera, Yoshihiro Yamanishi

Research output: Contribution to journalReview articlepeer-review

39 Citations (Scopus)

Abstract

(Figure Presented) The identification of drug-target interactions, or interactions between drug candidate compounds and target candidate proteins, is a crucial process in genomic drug discovery. In silico chemogenomic methods are recently recognized as a promising approach for genomewide scale prediction of drug-target interactions, but the prediction performance depends heavily on the descriptors and similarity measures of drugs and proteins. In this paper, we investigated the performance of various descriptors and similarity measures of drugs and proteins for the drug-target interaction prediction using a chemogenomic approach. We compared the prediction accuracy of 18 chemical descriptors of drugs (e.g., ECFP, FCFP,E-state, CDK, Klekota-Roth, MACCS, PubChem, Dragon, KCF-S, and graph kernels) and 4 descriptors of proteins (e.g., amino acid composition, domain profile, local sequence similarity, and string kernel) on about one hundred thousand drug-target interactions. We examined the combinatorial effects of drug descriptors and protein descriptors using the same benchmark data under several experimental conditions. Large-scale experiments showed that our proposed KCF-S descriptor worked the best in terms of prediction accuracy. The comparative results are expected to be useful for selecting chemical descriptors in various pharmaceutical applications.

Original languageEnglish
Pages (from-to)719-731
Number of pages13
JournalMolecular Informatics
Volume33
Issue number11-12
DOIs
Publication statusPublished - Nov 24 2014

All Science Journal Classification (ASJC) codes

  • Structural Biology
  • Molecular Medicine
  • Drug Discovery
  • Computer Science Applications
  • Organic Chemistry

Fingerprint

Dive into the research topics of 'Benchmarking a wide range of chemical descriptors for drug-target interaction prediction using a chemogenomic approach'. Together they form a unique fingerprint.

Cite this