Predicting target proteins for drug candidate compounds based on drug-induced gene expression data in a chemical structure-independent manner

Yoshiyuki Hizukuri, Ryusuke Sawada, Yoshihiro Yamanishi

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Background: Phenotype-based high-throughput screening is a useful technique for identifying drug candidate compounds that have a desired phenotype. However, the molecular mechanisms of the hit compounds remain unknown, and substantial effort is required to identify the target proteins associated with the phenotype. Methods: In this study, we propose a new method to predict target proteins of drug candidate compounds based on drug-induced gene expression data in Connectivity Map and a machine learning classification technique, which we call the "transcriptomic approach." Results: Unlike existing methods such as the chemogenomic approach, the transcriptomic approach enabled the prediction of target proteins without dependence on prior knowledge of compound chemical structures. The prediction accuracy of the chemogenomic approach was highly depended on compounds structure similarities in data sets. In contrast, the prediction accuracy of the transcriptomic approach was maintained at a sufficient level, even for benchmark data consisting of structurally diverse compounds. Conclusions: The transcriptomic approach reported here is expected to be a useful tool for structure-independent prediction of target proteins for drug candidate compounds.

Original languageEnglish
Article number82
JournalBMC Medical Genomics
Volume8
Issue number1
DOIs
Publication statusPublished - Dec 18 2015

Fingerprint

Gene Expression
Phenotype
Pharmaceutical Preparations
Proteins
Benchmarking
Datasets
Machine Learning

All Science Journal Classification (ASJC) codes

  • Genetics
  • Genetics(clinical)

Cite this

Predicting target proteins for drug candidate compounds based on drug-induced gene expression data in a chemical structure-independent manner. / Hizukuri, Yoshiyuki; Sawada, Ryusuke; Yamanishi, Yoshihiro.

In: BMC Medical Genomics, Vol. 8, No. 1, 82, 18.12.2015.

Research output: Contribution to journalArticle

@article{c8e8b0edd9604d5b908c8d6e1c62f94b,
title = "Predicting target proteins for drug candidate compounds based on drug-induced gene expression data in a chemical structure-independent manner",
abstract = "Background: Phenotype-based high-throughput screening is a useful technique for identifying drug candidate compounds that have a desired phenotype. However, the molecular mechanisms of the hit compounds remain unknown, and substantial effort is required to identify the target proteins associated with the phenotype. Methods: In this study, we propose a new method to predict target proteins of drug candidate compounds based on drug-induced gene expression data in Connectivity Map and a machine learning classification technique, which we call the {"}transcriptomic approach.{"} Results: Unlike existing methods such as the chemogenomic approach, the transcriptomic approach enabled the prediction of target proteins without dependence on prior knowledge of compound chemical structures. The prediction accuracy of the chemogenomic approach was highly depended on compounds structure similarities in data sets. In contrast, the prediction accuracy of the transcriptomic approach was maintained at a sufficient level, even for benchmark data consisting of structurally diverse compounds. Conclusions: The transcriptomic approach reported here is expected to be a useful tool for structure-independent prediction of target proteins for drug candidate compounds.",
author = "Yoshiyuki Hizukuri and Ryusuke Sawada and Yoshihiro Yamanishi",
year = "2015",
month = "12",
day = "18",
doi = "10.1186/s12920-015-0158-1",
language = "English",
volume = "8",
journal = "BMC Medical Genomics",
issn = "1755-8794",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Predicting target proteins for drug candidate compounds based on drug-induced gene expression data in a chemical structure-independent manner

AU - Hizukuri, Yoshiyuki

AU - Sawada, Ryusuke

AU - Yamanishi, Yoshihiro

PY - 2015/12/18

Y1 - 2015/12/18

N2 - Background: Phenotype-based high-throughput screening is a useful technique for identifying drug candidate compounds that have a desired phenotype. However, the molecular mechanisms of the hit compounds remain unknown, and substantial effort is required to identify the target proteins associated with the phenotype. Methods: In this study, we propose a new method to predict target proteins of drug candidate compounds based on drug-induced gene expression data in Connectivity Map and a machine learning classification technique, which we call the "transcriptomic approach." Results: Unlike existing methods such as the chemogenomic approach, the transcriptomic approach enabled the prediction of target proteins without dependence on prior knowledge of compound chemical structures. The prediction accuracy of the chemogenomic approach was highly depended on compounds structure similarities in data sets. In contrast, the prediction accuracy of the transcriptomic approach was maintained at a sufficient level, even for benchmark data consisting of structurally diverse compounds. Conclusions: The transcriptomic approach reported here is expected to be a useful tool for structure-independent prediction of target proteins for drug candidate compounds.

AB - Background: Phenotype-based high-throughput screening is a useful technique for identifying drug candidate compounds that have a desired phenotype. However, the molecular mechanisms of the hit compounds remain unknown, and substantial effort is required to identify the target proteins associated with the phenotype. Methods: In this study, we propose a new method to predict target proteins of drug candidate compounds based on drug-induced gene expression data in Connectivity Map and a machine learning classification technique, which we call the "transcriptomic approach." Results: Unlike existing methods such as the chemogenomic approach, the transcriptomic approach enabled the prediction of target proteins without dependence on prior knowledge of compound chemical structures. The prediction accuracy of the chemogenomic approach was highly depended on compounds structure similarities in data sets. In contrast, the prediction accuracy of the transcriptomic approach was maintained at a sufficient level, even for benchmark data consisting of structurally diverse compounds. Conclusions: The transcriptomic approach reported here is expected to be a useful tool for structure-independent prediction of target proteins for drug candidate compounds.

UR - http://www.scopus.com/inward/record.url?scp=84951143211&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951143211&partnerID=8YFLogxK

U2 - 10.1186/s12920-015-0158-1

DO - 10.1186/s12920-015-0158-1

M3 - Article

C2 - 26684652

AN - SCOPUS:84951143211

VL - 8

JO - BMC Medical Genomics

JF - BMC Medical Genomics

SN - 1755-8794

IS - 1

M1 - 82

ER -