TY - JOUR
T1 - Sleeping contextual/non-contextual thompson sampling mab for mmwave d2d two-hop relay probing
AU - Mohamed, Ehab Mahmoud
AU - Hashima, Sherief
AU - Hatano, Kohei
AU - Fouda, Mostafa M.
AU - Fadlullah, Zubair Md
N1 - Publisher Copyright:
© 2021 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
PY - 2021/11/1
Y1 - 2021/11/1
N2 - Millimeter wave (mmWave) band, i.e., 30 to 300 GHz, is characterized by short range transmissions and vulnerability to path blockage necessitating the use of relaying. Probing more relays finds out the best relay having the highest spectral efficiency but at the expense of increasing the probing overhead due to excessive beamforming training (BT) causing a decrease in the overall throughput. In this paper, mmWave two-hop relaying will be formulated as a single player multi-armed bandit (MAB) problem enabling one relay probing while maximizing the achievable spectral efficiency. Moreover, the relays could not establish the mmWave link due to blockage for instance will be identified as sleeping relays and eliminated from the rest of the MAB game. Thus, sleeping non contextual MAB (S-MAB) algorithm, namely sleeping Thompson sampling (S-TS) will be proposed to handle the problem. Furthermore, by utilizing the multiband capability of standardized WiGig devices containing both 2.4/5 GHz WiFi and 60 GHzmmWave bands,WiFi information will be used as contexts of theMAB game. Therefore, sleeping contextual MAB (S-CMAB) algorithm, namely S-CTS, will be proposed as well. Numerical and regret analysis ensure the superior performance of the S-CMAB algorithm over the S-MAB counterpart and the existing mmWave relay probing solutions accompanied with high convergence rates.
AB - Millimeter wave (mmWave) band, i.e., 30 to 300 GHz, is characterized by short range transmissions and vulnerability to path blockage necessitating the use of relaying. Probing more relays finds out the best relay having the highest spectral efficiency but at the expense of increasing the probing overhead due to excessive beamforming training (BT) causing a decrease in the overall throughput. In this paper, mmWave two-hop relaying will be formulated as a single player multi-armed bandit (MAB) problem enabling one relay probing while maximizing the achievable spectral efficiency. Moreover, the relays could not establish the mmWave link due to blockage for instance will be identified as sleeping relays and eliminated from the rest of the MAB game. Thus, sleeping non contextual MAB (S-MAB) algorithm, namely sleeping Thompson sampling (S-TS) will be proposed to handle the problem. Furthermore, by utilizing the multiband capability of standardized WiGig devices containing both 2.4/5 GHz WiFi and 60 GHzmmWave bands,WiFi information will be used as contexts of theMAB game. Therefore, sleeping contextual MAB (S-CMAB) algorithm, namely S-CTS, will be proposed as well. Numerical and regret analysis ensure the superior performance of the S-CMAB algorithm over the S-MAB counterpart and the existing mmWave relay probing solutions accompanied with high convergence rates.
UR - http://www.scopus.com/inward/record.url?scp=85119692329&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119692329&partnerID=8YFLogxK
U2 - 10.1109/TVT.2021.3116223
DO - 10.1109/TVT.2021.3116223
M3 - Article
AN - SCOPUS:85119692329
VL - 70
SP - 12101
EP - 12112
JO - IEEE Transactions on Vehicular Communications
JF - IEEE Transactions on Vehicular Communications
SN - 0018-9545
IS - 11
ER -