Missing Value Imputation for Remote Healthcare Data: A Case study of Portable Health Clinic System

Yosuke Imamura, Nuren Abedin, Luo Sixian, Shaira Tabassum, Ashir Ahmed

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This study aims to investigate the best method for imputing missing values in remote healthcare data set. Missing value means an empty field in a health record. It may occur for three major reasons- (i) the parameter was not measured (ii) measured but not recorded and (iii) lost during communications. Our case study, Portable Health Clinic (PHC) data has been collected from multiple regions, by different authorities in different time. PHC data contains manual errors too. Missing and erroneous data are problematic for data analysis and for making accurate predictions. Hence, it is necessary to detect and eliminate error data and also fill the empty fields. Missing value imputation methods are widely known for processing numerical data. PHC data has both numerical and categorical data which makes it difficult to impute. We came up with a new data processing mechanism to feed into existing machine learning algorithm. To test our idea, we used a complete PHC data set (numerical only) without any missing values. Then we generated missing values by randomly erasing a part of the data set. We used several existing imputation methods and our proposed method on the same target data set to compare their performances. It is found that the Mean Imputer, kNN and MissForest are not effective. Iterative Imputer predicted best in 7 features and ours in 4 cases. Therefore, it can be concluded that the effectiveness of imputation methods may vary depending on the specific data set and features. Our future work is to include the categorical data and monitor the performance.

Original languageEnglish
Title of host publicationProceedings of the 2021 International Japan-Africa Conference on Electronics, Communications, and Computations, JAC-ECC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages85-88
Number of pages4
ISBN (Electronic)9781665482929
DOIs
Publication statusPublished - 2021
Event9th International Japan-Africa Conference on Electronics, Communications, and Computations, JAC-ECC 2021 - Virtual, Online, Egypt
Duration: Dec 13 2021Dec 14 2021

Publication series

NameProceedings of the 2021 International Japan-Africa Conference on Electronics, Communications, and Computations, JAC-ECC 2021

Conference

Conference9th International Japan-Africa Conference on Electronics, Communications, and Computations, JAC-ECC 2021
Country/TerritoryEgypt
CityVirtual, Online
Period12/13/2112/14/21

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Computer Networks and Communications
  • Hardware and Architecture
  • Electrical and Electronic Engineering
  • Instrumentation

Fingerprint

Dive into the research topics of 'Missing Value Imputation for Remote Healthcare Data: A Case study of Portable Health Clinic System'. Together they form a unique fingerprint.

Cite this