### Abstract

For a data set of 30 organic fluids, categorical modeling has been employed to predict the flow pattern under an external electric field. To this end, a previously generated data set was augmented by 10 compounds with new experimental results, and quantum chemical methods have been used to characterize the geometric and electronic structure of the molecules on both the semiempirical and ab initio levels of theory. Both linear discriminant analysis (LDA) and binary logistic regression (BLR) have been employed to model the flow rate (high vs. low) and flow direction (left vs. right). For the flow rate, good LDA and BLR calibration statistics using the dipole moment, hydrophobicity and some charged partial surface area (CPSA) descriptors is accompanied with moderate prediction statistics, as evaluated through simulated external validation, and activity scrambling shows that chance correlation is not relevant. Additional neural network analyses yielded no stable models due to constraints imposed by the data set size. For the flow direction, LDA and BLR calibration and prediction statistics show more variation among the different models generated, with an overall performance inferior to the one for the flow rate. Here, besides CPSA descriptors, two parameters characterizing the softness of the electronic structure are involved. In general, BLR is slightly superior to LDA for both properties. The results are discussed in terms of contingency table statistics and with respect to the mechanistic meaning of molecular descriptors.

Original language | English |
---|---|

Pages (from-to) | 377-389 |

Number of pages | 13 |

Journal | Croatica Chemica Acta |

Volume | 77 |

Issue number | 1-2 |

Publication status | Published - May 1 2004 |

### Fingerprint

### All Science Journal Classification (ASJC) codes

- Chemistry(all)

### Cite this

*Croatica Chemica Acta*,

*77*(1-2), 377-389.

**Categorical modeling of the flow pattern of liquid organic compounds between blade electrodes using semiempirical and ab initio quantum chemical descriptors.** / Suzuki, Takahiro; Yoshida, Kohei; Onizuka, Hiroya; Iwai, Yoshio; Arai, Yasuhiko; Aptula, Aynur; Kühne, Ralph; Ebert, Ralf Uwe; Schüürmann, Gerrit.

Research output: Contribution to journal › Article

*Croatica Chemica Acta*, vol. 77, no. 1-2, pp. 377-389.

}

TY - JOUR

T1 - Categorical modeling of the flow pattern of liquid organic compounds between blade electrodes using semiempirical and ab initio quantum chemical descriptors

AU - Suzuki, Takahiro

AU - Yoshida, Kohei

AU - Onizuka, Hiroya

AU - Iwai, Yoshio

AU - Arai, Yasuhiko

AU - Aptula, Aynur

AU - Kühne, Ralph

AU - Ebert, Ralf Uwe

AU - Schüürmann, Gerrit

PY - 2004/5/1

Y1 - 2004/5/1

N2 - For a data set of 30 organic fluids, categorical modeling has been employed to predict the flow pattern under an external electric field. To this end, a previously generated data set was augmented by 10 compounds with new experimental results, and quantum chemical methods have been used to characterize the geometric and electronic structure of the molecules on both the semiempirical and ab initio levels of theory. Both linear discriminant analysis (LDA) and binary logistic regression (BLR) have been employed to model the flow rate (high vs. low) and flow direction (left vs. right). For the flow rate, good LDA and BLR calibration statistics using the dipole moment, hydrophobicity and some charged partial surface area (CPSA) descriptors is accompanied with moderate prediction statistics, as evaluated through simulated external validation, and activity scrambling shows that chance correlation is not relevant. Additional neural network analyses yielded no stable models due to constraints imposed by the data set size. For the flow direction, LDA and BLR calibration and prediction statistics show more variation among the different models generated, with an overall performance inferior to the one for the flow rate. Here, besides CPSA descriptors, two parameters characterizing the softness of the electronic structure are involved. In general, BLR is slightly superior to LDA for both properties. The results are discussed in terms of contingency table statistics and with respect to the mechanistic meaning of molecular descriptors.

AB - For a data set of 30 organic fluids, categorical modeling has been employed to predict the flow pattern under an external electric field. To this end, a previously generated data set was augmented by 10 compounds with new experimental results, and quantum chemical methods have been used to characterize the geometric and electronic structure of the molecules on both the semiempirical and ab initio levels of theory. Both linear discriminant analysis (LDA) and binary logistic regression (BLR) have been employed to model the flow rate (high vs. low) and flow direction (left vs. right). For the flow rate, good LDA and BLR calibration statistics using the dipole moment, hydrophobicity and some charged partial surface area (CPSA) descriptors is accompanied with moderate prediction statistics, as evaluated through simulated external validation, and activity scrambling shows that chance correlation is not relevant. Additional neural network analyses yielded no stable models due to constraints imposed by the data set size. For the flow direction, LDA and BLR calibration and prediction statistics show more variation among the different models generated, with an overall performance inferior to the one for the flow rate. Here, besides CPSA descriptors, two parameters characterizing the softness of the electronic structure are involved. In general, BLR is slightly superior to LDA for both properties. The results are discussed in terms of contingency table statistics and with respect to the mechanistic meaning of molecular descriptors.

UR - http://www.scopus.com/inward/record.url?scp=3042741416&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3042741416&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:3042741416

VL - 77

SP - 377

EP - 389

JO - Croatica Chemica Acta

JF - Croatica Chemica Acta

SN - 0011-1643

IS - 1-2

ER -