TY - JOUR
T1 - Categorical modeling of the flow pattern of liquid organic compounds between blade electrodes using semiempirical and ab initio quantum chemical descriptors
AU - Suzuki, Takahiro
AU - Yoshida, Kohei
AU - Onizuka, Hiroya
AU - Iwai, Yoshio
AU - Arai, Yasuhiko
AU - Aptula, Aynur
AU - Kühne, Ralph
AU - Ebert, Ralf Uwe
AU - Schüürmann, Gerrit
PY - 2004/5/1
Y1 - 2004/5/1
N2 - For a data set of 30 organic fluids, categorical modeling has been employed to predict the flow pattern under an external electric field. To this end, a previously generated data set was augmented by 10 compounds with new experimental results, and quantum chemical methods have been used to characterize the geometric and electronic structure of the molecules on both the semiempirical and ab initio levels of theory. Both linear discriminant analysis (LDA) and binary logistic regression (BLR) have been employed to model the flow rate (high vs. low) and flow direction (left vs. right). For the flow rate, good LDA and BLR calibration statistics using the dipole moment, hydrophobicity and some charged partial surface area (CPSA) descriptors is accompanied with moderate prediction statistics, as evaluated through simulated external validation, and activity scrambling shows that chance correlation is not relevant. Additional neural network analyses yielded no stable models due to constraints imposed by the data set size. For the flow direction, LDA and BLR calibration and prediction statistics show more variation among the different models generated, with an overall performance inferior to the one for the flow rate. Here, besides CPSA descriptors, two parameters characterizing the softness of the electronic structure are involved. In general, BLR is slightly superior to LDA for both properties. The results are discussed in terms of contingency table statistics and with respect to the mechanistic meaning of molecular descriptors.
AB - For a data set of 30 organic fluids, categorical modeling has been employed to predict the flow pattern under an external electric field. To this end, a previously generated data set was augmented by 10 compounds with new experimental results, and quantum chemical methods have been used to characterize the geometric and electronic structure of the molecules on both the semiempirical and ab initio levels of theory. Both linear discriminant analysis (LDA) and binary logistic regression (BLR) have been employed to model the flow rate (high vs. low) and flow direction (left vs. right). For the flow rate, good LDA and BLR calibration statistics using the dipole moment, hydrophobicity and some charged partial surface area (CPSA) descriptors is accompanied with moderate prediction statistics, as evaluated through simulated external validation, and activity scrambling shows that chance correlation is not relevant. Additional neural network analyses yielded no stable models due to constraints imposed by the data set size. For the flow direction, LDA and BLR calibration and prediction statistics show more variation among the different models generated, with an overall performance inferior to the one for the flow rate. Here, besides CPSA descriptors, two parameters characterizing the softness of the electronic structure are involved. In general, BLR is slightly superior to LDA for both properties. The results are discussed in terms of contingency table statistics and with respect to the mechanistic meaning of molecular descriptors.
UR - http://www.scopus.com/inward/record.url?scp=3042741416&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=3042741416&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:3042741416
SN - 0011-1643
VL - 77
SP - 377
EP - 389
JO - Croatica Chemica Acta
JF - Croatica Chemica Acta
IS - 1-2
ER -