CBRL and CBRC: Novel algorithms for improving missing value imputation accuracy based on bayesian ridge regression

Samih M. Mostafa, Abdelrahman S. Eladimy, Safwat Hamad, Hirofumi Amano

研究成果: Contribution to journalArticle査読

抄録

In most scientific studies such as data analysis, the existence of missing data is a critical problem, and selecting the appropriate approach to deal with missing data is a challenge. In this paper, the authors perform a fair comparative study of some practical imputation methods used for handling missing values against two proposed imputation algorithms. The proposed algorithms depend on the Bayesian Ridge technique under two different feature selection conditions. The proposed algorithms differ from the existing approaches in that they cumulate the imputed features; those imputed features will be incorporated within the Bayesian Ridge equation for predicting the missing values in the next incomplete selected feature. The authors applied the proposed algorithms on eight datasets with different amount of missing values created from different missingness mechanisms. The performance was measured in terms of imputation time, root-mean-square error (RMSE), coefficient of determination (R2), and mean absolute error (MAE). The results showed that the performance varies depending on missing values percentage, size of the dataset, and the missingness mechanism. In addition, the performance of the proposed methods is slightly better.

本文言語英語
論文番号1594
ジャーナルSymmetry
12
10
DOI
出版ステータス出版済み - 10 2020

All Science Journal Classification (ASJC) codes

  • コンピュータ サイエンス(その他)
  • 化学(その他)
  • 数学 (全般)
  • 物理学および天文学(その他)

フィンガープリント

「CBRL and CBRC: Novel algorithms for improving missing value imputation accuracy based on bayesian ridge regression」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル