TY - JOUR
T1 - Intrinsic regularization effect in Bayesian nonlinear regression scaled by observed data
AU - Tokuda, Satoru
AU - Nagata, Kenji
AU - Okada, Masato
N1 - Funding Information:
The authors are grateful to Chihiro H. Nakajima, Koji Hukushima, Kouki Yonaga, Masayuki Ohzeki, Shotaro Akaho, Sumio Watanabe, Tomoyuki Obuchi, and Yoshiyuki Kabashima for valuable discussions. S.T. was supported by JSPS KAKENHI (No. JP20K19889). M.O. was supported by JST CREST (No. JPMJCR1761), JSPS KAKENHI (No. JP25120009), the “Materials Research by Information Integration” Initiative (MI2I) project of the Support Program for Starting Up Innovation Hub from the Japan Science and Technology Agency (JST), and the Council for Science, Technology and Innovation (CSTI), Cross-ministerial Strategic Innovation Promotion Program (SIP), “Structural Materials for Innovation” (Funding agency: JST).
Publisher Copyright:
© 2022 authors. Published by the American Physical Society. Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI.
PY - 2022/10
Y1 - 2022/10
N2 - Occam's razor is a guiding principle that models should be simple enough to describe observed data. While Bayesian model selection (BMS) embodies it by the intrinsic regularization effect (IRE), how observed data scale the IRE has not been fully understood. In the nonlinear regression with conditionally independent observations, we show that the IRE is scaled by observations' fineness, defined by the amount and quality of observed data. We introduce an observable that quantifies the IRE, referred to as the Bayes specific heat, inspired by the correspondence between statistical inference and statistical physics. We derive its scaling relation to observations' fineness. We demonstrate that the optimal model chosen by the BMS changes at critical values of observations' fineness, accompanying the IRE's variation. The changes are from choosing a coarse-grained model to a fine-grained one as observations' fineness increases. Our findings expand an understanding of BMS's typicality when observed data are insufficient.
AB - Occam's razor is a guiding principle that models should be simple enough to describe observed data. While Bayesian model selection (BMS) embodies it by the intrinsic regularization effect (IRE), how observed data scale the IRE has not been fully understood. In the nonlinear regression with conditionally independent observations, we show that the IRE is scaled by observations' fineness, defined by the amount and quality of observed data. We introduce an observable that quantifies the IRE, referred to as the Bayes specific heat, inspired by the correspondence between statistical inference and statistical physics. We derive its scaling relation to observations' fineness. We demonstrate that the optimal model chosen by the BMS changes at critical values of observations' fineness, accompanying the IRE's variation. The changes are from choosing a coarse-grained model to a fine-grained one as observations' fineness increases. Our findings expand an understanding of BMS's typicality when observed data are insufficient.
UR - http://www.scopus.com/inward/record.url?scp=85144625426&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85144625426&partnerID=8YFLogxK
U2 - 10.1103/PhysRevResearch.4.043165
DO - 10.1103/PhysRevResearch.4.043165
M3 - Article
AN - SCOPUS:85144625426
SN - 2643-1564
VL - 4
JO - Physical Review Research
JF - Physical Review Research
IS - 4
M1 - 043165
ER -