TY - JOUR
T1 - Machine learning for classification of postoperative patient status using standardized medical data
AU - Yamashita, Takanori
AU - Wakata, Yoshifumi
AU - Nakaguma, Hideki
AU - Nohara, Yasunobu
AU - Hato, Shinj
AU - Kawamura, Susumu
AU - Muraoka, Shuko
AU - Sugita, Masatoshi
AU - Okada, Mihoko
AU - Nakashima, Naoki
AU - Soejima, Hidehisa
N1 - Funding Information:
This work was supported by the AMED (Japan Agency for Medical Research and Development) (20le0210001h0003).
Publisher Copyright:
© 2021
PY - 2022/2
Y1 - 2022/2
N2 - Background and objective: Real-world evidence is defined as clinical evidence regarding the use and potential benefits or risks of a medical product derived from real-world data analyses. Standardization and structuring of data are necessary to analyze medical real-world data collected from different medical institutions. An electronic message and repository have been developed to link electronic medical records in this research project, which has simplified the data integration. Therefore, this paper proposes an analysis method and learning health systems to determine the priority of clinical intervention by clustering and visualizing time-series and prioritizing patient outcomes and status during hospitalization. Methods: Common data items for reimbursement (Diagnosis Procedure Combination [DPC]) and clinical pathway data were examined in this project at each participating institution that runs the verification test. Long-term hospitalization data were analyzed using the data stored in the cloud platform of the institutions’ repositories using multiple machine learning methods for classification, visualization, and interpretation. Results: The ePath platform contributed to integrate the standardized data from multiple institutions. The distribution of DPC items or variances could be confirmed by clustering, temporal tendency through the directed graph, and extracting variables that contributed to the prediction and evaluation of SHapley Additive Explanation effects. Constipation was determined to be the risk factor most strongly related to long-term hospitalization. Drainage management was identified as a factor that can improve long-term hospitalization. These analyses effectively extracted patient status to provide feedback to the learning health system. Conclusions: We successfully generated evidence of medical processes by gathering patient status, medical purposes, and patient outcomes with high data quality from multiple institutions, which were difficult with conventional electronic medical records. Regarding the significant analysis results, the learning health system will be used on this project to provide feedback to each institution, operate it for a certain period, and analyze and re-evaluate it.
AB - Background and objective: Real-world evidence is defined as clinical evidence regarding the use and potential benefits or risks of a medical product derived from real-world data analyses. Standardization and structuring of data are necessary to analyze medical real-world data collected from different medical institutions. An electronic message and repository have been developed to link electronic medical records in this research project, which has simplified the data integration. Therefore, this paper proposes an analysis method and learning health systems to determine the priority of clinical intervention by clustering and visualizing time-series and prioritizing patient outcomes and status during hospitalization. Methods: Common data items for reimbursement (Diagnosis Procedure Combination [DPC]) and clinical pathway data were examined in this project at each participating institution that runs the verification test. Long-term hospitalization data were analyzed using the data stored in the cloud platform of the institutions’ repositories using multiple machine learning methods for classification, visualization, and interpretation. Results: The ePath platform contributed to integrate the standardized data from multiple institutions. The distribution of DPC items or variances could be confirmed by clustering, temporal tendency through the directed graph, and extracting variables that contributed to the prediction and evaluation of SHapley Additive Explanation effects. Constipation was determined to be the risk factor most strongly related to long-term hospitalization. Drainage management was identified as a factor that can improve long-term hospitalization. These analyses effectively extracted patient status to provide feedback to the learning health system. Conclusions: We successfully generated evidence of medical processes by gathering patient status, medical purposes, and patient outcomes with high data quality from multiple institutions, which were difficult with conventional electronic medical records. Regarding the significant analysis results, the learning health system will be used on this project to provide feedback to each institution, operate it for a certain period, and analyze and re-evaluate it.
UR - http://www.scopus.com/inward/record.url?scp=85121580926&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121580926&partnerID=8YFLogxK
U2 - 10.1016/j.cmpb.2021.106583
DO - 10.1016/j.cmpb.2021.106583
M3 - Article
C2 - 34959156
AN - SCOPUS:85121580926
SN - 0169-2607
VL - 214
JO - Computer Methods and Programs in Biomedicine
JF - Computer Methods and Programs in Biomedicine
M1 - 106583
ER -