TY - JOUR
T1 - Combining noise-to-image and image-to-image GANs
T2 - Brain MR image augmentation for tumor detection
AU - Han, Changhee
AU - Rundo, Leonardo
AU - Araki, Ryosuke
AU - Nagano, Yudai
AU - Furukawa, Yujiro
AU - Mauri, Giancarlo
AU - Nakayama, Hideki
AU - Hayashi, Hideaki
N1 - Funding Information:
This work was supported in part by the Qdai-jump Research Program, in part by the JSPS KAKENHI under Grant JP17K12752, and in part by the AMED under Grant JP18lk1010028.
Publisher Copyright:
© 2013 IEEE.
PY - 2019
Y1 - 2019
N2 - Convolutional Neural Networks (CNNs) achieve excellent computer-assisted diagnosis with sufficient annotated training data. However, most medical imaging datasets are small and fragmented. In this context, Generative Adversarial Networks (GANs) can synthesize realistic/diverse additional training images to fill the data lack in the real image distribution; researchers have improved classification by augmenting data with noise-to-image (e.g., random noise samples to diverse pathological images) or image-to-image GANs (e.g., a benign image to a malignant one). Yet, no research has reported results combining noise-to-image and image-to-image GANs for further performance boost. Therefore, to maximize the DA effect with the GAN combinations, we propose a two-step GAN-based DA that generates and refines brain Magnetic Resonance (MR) images with/without tumors separately: (i) Progressive Growing of GANs (PGGANs), multi-stage noise-to-image GAN for high-resolution MR image generation, first generates realistic/diverse 256 × 256 images; (ii) Multimodal UNsupervised Image-to-image Translation (MUNIT) that combines GANs/Variational AutoEncoders or SimGAN that uses a DA-focused GAN loss, further refines the texture/shape of the PGGAN-generated images similarly to the real ones. We thoroughly investigate CNN-based tumor classification results, also considering the influence of pre-training on ImageNet and discarding weird-looking GAN-generated images. The results show that, when combined with classic DA, our two-step GAN-based DA can significantly outperform the classic DA alone, in tumor detection (i.e., boosting sensitivity 93.67% to 97.48%) and also in other medical imaging tasks.
AB - Convolutional Neural Networks (CNNs) achieve excellent computer-assisted diagnosis with sufficient annotated training data. However, most medical imaging datasets are small and fragmented. In this context, Generative Adversarial Networks (GANs) can synthesize realistic/diverse additional training images to fill the data lack in the real image distribution; researchers have improved classification by augmenting data with noise-to-image (e.g., random noise samples to diverse pathological images) or image-to-image GANs (e.g., a benign image to a malignant one). Yet, no research has reported results combining noise-to-image and image-to-image GANs for further performance boost. Therefore, to maximize the DA effect with the GAN combinations, we propose a two-step GAN-based DA that generates and refines brain Magnetic Resonance (MR) images with/without tumors separately: (i) Progressive Growing of GANs (PGGANs), multi-stage noise-to-image GAN for high-resolution MR image generation, first generates realistic/diverse 256 × 256 images; (ii) Multimodal UNsupervised Image-to-image Translation (MUNIT) that combines GANs/Variational AutoEncoders or SimGAN that uses a DA-focused GAN loss, further refines the texture/shape of the PGGAN-generated images similarly to the real ones. We thoroughly investigate CNN-based tumor classification results, also considering the influence of pre-training on ImageNet and discarding weird-looking GAN-generated images. The results show that, when combined with classic DA, our two-step GAN-based DA can significantly outperform the classic DA alone, in tumor detection (i.e., boosting sensitivity 93.67% to 97.48%) and also in other medical imaging tasks.
UR - http://www.scopus.com/inward/record.url?scp=85074646719&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074646719&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2947606
DO - 10.1109/ACCESS.2019.2947606
M3 - Article
AN - SCOPUS:85074646719
SN - 2169-3536
VL - 7
SP - 156966
EP - 156977
JO - IEEE Access
JF - IEEE Access
M1 - 8869751
ER -