TY - GEN
T1 - Recognition of doctors' cursive handwritten medical words by using bidirectional LSTM and SRP data augmentation
AU - Tabassum, Shaira
AU - Takahashi, Ryo
AU - Rahman, Md Mahmudur
AU - Imamura, Yosuke
AU - Sixian, Luo
AU - Rahman, Md Moshiur
AU - Ahmed, Ashir
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/5/17
Y1 - 2021/5/17
N2 - Inability to read doctors' handwritten prescriptions causes 7, 000 deaths a year in a developed country like the US. The situation should be worse in developing countries where more doctors use handwriting prescriptions. In Bangladesh, the writings become more indecipherable as they contain both English and Bangla words with Latin abbreviations of medical terms. As a result, patients and pharmacists find them difficult to read and the pharmacists provide wrong medicines. In order to ease the difficulty of reading doctors' prescriptions, this paper proposes an online handwritten recognition system to predict the doctors' handwriting and develop a digital prescription. To build this system, the 'Handwritten Medical Term Corpus' dataset is introduced which contains 17, 431 data samples of 480 words (360 English and 120 Bangla) from 39 Bangladeshi doctors and medical professionals. A bigger sample size can improve the recognition efficiency. A new data augmentation technique SRP (Stroke Rotation and Parallel shift) method is proposed to widen the variety of handwriting styles and increase the sample size. A sequence of line data is extracted from the augmented image dataset of 1, 591, 100 samples which is fed to a Bidirectional LSTM model. The proposed method has achieved 89.5% accuracy which is 16.1% higher than the recognition accuracy with no data expansion. This technology can reduce medical errors and save medical cost and ensure healthy living.
AB - Inability to read doctors' handwritten prescriptions causes 7, 000 deaths a year in a developed country like the US. The situation should be worse in developing countries where more doctors use handwriting prescriptions. In Bangladesh, the writings become more indecipherable as they contain both English and Bangla words with Latin abbreviations of medical terms. As a result, patients and pharmacists find them difficult to read and the pharmacists provide wrong medicines. In order to ease the difficulty of reading doctors' prescriptions, this paper proposes an online handwritten recognition system to predict the doctors' handwriting and develop a digital prescription. To build this system, the 'Handwritten Medical Term Corpus' dataset is introduced which contains 17, 431 data samples of 480 words (360 English and 120 Bangla) from 39 Bangladeshi doctors and medical professionals. A bigger sample size can improve the recognition efficiency. A new data augmentation technique SRP (Stroke Rotation and Parallel shift) method is proposed to widen the variety of handwriting styles and increase the sample size. A sequence of line data is extracted from the augmented image dataset of 1, 591, 100 samples which is fed to a Bidirectional LSTM model. The proposed method has achieved 89.5% accuracy which is 16.1% higher than the recognition accuracy with no data expansion. This technology can reduce medical errors and save medical cost and ensure healthy living.
UR - http://www.scopus.com/inward/record.url?scp=85112223652&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112223652&partnerID=8YFLogxK
U2 - 10.1109/TEMSCON-EUR52034.2021.9488622
DO - 10.1109/TEMSCON-EUR52034.2021.9488622
M3 - Conference contribution
AN - SCOPUS:85112223652
T3 - 2021 IEEE Technology and Engineering Management Conference - Europe, TEMSCON-EUR 2021
BT - 2021 IEEE Technology and Engineering Management Conference - Europe, TEMSCON-EUR 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE Technology and Engineering Management Conference - Europe, TEMSCON-EUR 2021
Y2 - 17 May 2021 through 20 May 2021
ER -