Effect of model formulation on the optimization of a genetic Takagi-Sugeno fuzzy system for fish habitat suitability evaluation

Shinji Fukuda, Bernard De Baets, Ans M. Mouton, Willem Waegeman, Jun Nakajima, Takahiko Mukai, Kazuaki Hiramatsu, Norio Onikura

研究成果: ジャーナルへの寄稿記事

38 引用 (Scopus)

抄録

Species distribution models (SDMs), which evaluate species-environment relationships, are one of the key topics in ecology and biogeography. These models evaluate the current status of target ecosystems and potential impacts in both time and space. Although species distributions are often calculated based on the composite habitat suitability of several variables, there are no guidelines for calculating them. The present study assessed the effects of model formulation on habitat suitability evaluation and the accuracy of species distribution modelling. We employed a genetic algorithm (GA)-optimized fuzzy habitat preference model (FHPM) for evaluating habitat suitability of topmouth gudgeon (Pseudorasbora parva) in the Northwestern part of Kyushu Island in Japan. Four operations were used to calculate the composite habitat suitability from multiple habitat variables: arithmetic mean, geometric mean, product and minimum. To transform model outputs to presence/absence, four threshold criteria were compared based on model accuracy: prevalence, conventional 0.5, minimization of the sensitivity-specificity difference threshold (MDT), and maximization of the sensitivity-specificity sum threshold (MST). The models were first calibrated and validated based on the mean squared error (MSE) between composite habitat suitability and the observed presence-absence of the fish, and then evaluated using confusion matrix-derived measures such as the area under the receiver operating characteristics (ROC) curve (AUC), correctly classified instances (CCI), kappa and true skill statistic (TSS). The results clearly illustrated the effects of model formulation and threshold criteria on habitat suitability curves (HSCs) and accuracy in modelling species distributions. The use of the product model formulation led to the best accuracy in terms of MSE and AUC, and consistency in the shape of HSCs. The two threshold criteria of MST and MDT are also recommended for the consistently higher performance in terms of CCI, kappa and TSS. This case study of topmouth gudgeon illustrates the need for further studies on the model behaviour with regard to data characteristics (i.e., sample size and prevalence) and model structure (i.e., fuzzy sets and parameter settings of the GA).

元の言語英語
ページ(範囲)1401-1413
ページ数13
ジャーナルEcological Modelling
222
発行部数8
DOI
出版物ステータス出版済み - 4 24 2011

Fingerprint

habitat
fish
genetic algorithm
effect
evaluation
biogeography
habitat selection
modeling
transform
ecology
matrix
distribution
ecosystem

All Science Journal Classification (ASJC) codes

  • Ecological Modelling

これを引用

Effect of model formulation on the optimization of a genetic Takagi-Sugeno fuzzy system for fish habitat suitability evaluation. / Fukuda, Shinji; De Baets, Bernard; Mouton, Ans M.; Waegeman, Willem; Nakajima, Jun; Mukai, Takahiko; Hiramatsu, Kazuaki; Onikura, Norio.

:: Ecological Modelling, 巻 222, 番号 8, 24.04.2011, p. 1401-1413.

研究成果: ジャーナルへの寄稿記事

Fukuda, Shinji ; De Baets, Bernard ; Mouton, Ans M. ; Waegeman, Willem ; Nakajima, Jun ; Mukai, Takahiko ; Hiramatsu, Kazuaki ; Onikura, Norio. / Effect of model formulation on the optimization of a genetic Takagi-Sugeno fuzzy system for fish habitat suitability evaluation. :: Ecological Modelling. 2011 ; 巻 222, 番号 8. pp. 1401-1413.
@article{d2aa7bd8debb47f583747fd63c96889c,
title = "Effect of model formulation on the optimization of a genetic Takagi-Sugeno fuzzy system for fish habitat suitability evaluation",
abstract = "Species distribution models (SDMs), which evaluate species-environment relationships, are one of the key topics in ecology and biogeography. These models evaluate the current status of target ecosystems and potential impacts in both time and space. Although species distributions are often calculated based on the composite habitat suitability of several variables, there are no guidelines for calculating them. The present study assessed the effects of model formulation on habitat suitability evaluation and the accuracy of species distribution modelling. We employed a genetic algorithm (GA)-optimized fuzzy habitat preference model (FHPM) for evaluating habitat suitability of topmouth gudgeon (Pseudorasbora parva) in the Northwestern part of Kyushu Island in Japan. Four operations were used to calculate the composite habitat suitability from multiple habitat variables: arithmetic mean, geometric mean, product and minimum. To transform model outputs to presence/absence, four threshold criteria were compared based on model accuracy: prevalence, conventional 0.5, minimization of the sensitivity-specificity difference threshold (MDT), and maximization of the sensitivity-specificity sum threshold (MST). The models were first calibrated and validated based on the mean squared error (MSE) between composite habitat suitability and the observed presence-absence of the fish, and then evaluated using confusion matrix-derived measures such as the area under the receiver operating characteristics (ROC) curve (AUC), correctly classified instances (CCI), kappa and true skill statistic (TSS). The results clearly illustrated the effects of model formulation and threshold criteria on habitat suitability curves (HSCs) and accuracy in modelling species distributions. The use of the product model formulation led to the best accuracy in terms of MSE and AUC, and consistency in the shape of HSCs. The two threshold criteria of MST and MDT are also recommended for the consistently higher performance in terms of CCI, kappa and TSS. This case study of topmouth gudgeon illustrates the need for further studies on the model behaviour with regard to data characteristics (i.e., sample size and prevalence) and model structure (i.e., fuzzy sets and parameter settings of the GA).",
author = "Shinji Fukuda and {De Baets}, Bernard and Mouton, {Ans M.} and Willem Waegeman and Jun Nakajima and Takahiko Mukai and Kazuaki Hiramatsu and Norio Onikura",
year = "2011",
month = "4",
day = "24",
doi = "10.1016/j.ecolmodel.2011.01.023",
language = "English",
volume = "222",
pages = "1401--1413",
journal = "Ecological Modelling",
issn = "0304-3800",
publisher = "Elsevier",
number = "8",

}

TY - JOUR

T1 - Effect of model formulation on the optimization of a genetic Takagi-Sugeno fuzzy system for fish habitat suitability evaluation

AU - Fukuda, Shinji

AU - De Baets, Bernard

AU - Mouton, Ans M.

AU - Waegeman, Willem

AU - Nakajima, Jun

AU - Mukai, Takahiko

AU - Hiramatsu, Kazuaki

AU - Onikura, Norio

PY - 2011/4/24

Y1 - 2011/4/24

N2 - Species distribution models (SDMs), which evaluate species-environment relationships, are one of the key topics in ecology and biogeography. These models evaluate the current status of target ecosystems and potential impacts in both time and space. Although species distributions are often calculated based on the composite habitat suitability of several variables, there are no guidelines for calculating them. The present study assessed the effects of model formulation on habitat suitability evaluation and the accuracy of species distribution modelling. We employed a genetic algorithm (GA)-optimized fuzzy habitat preference model (FHPM) for evaluating habitat suitability of topmouth gudgeon (Pseudorasbora parva) in the Northwestern part of Kyushu Island in Japan. Four operations were used to calculate the composite habitat suitability from multiple habitat variables: arithmetic mean, geometric mean, product and minimum. To transform model outputs to presence/absence, four threshold criteria were compared based on model accuracy: prevalence, conventional 0.5, minimization of the sensitivity-specificity difference threshold (MDT), and maximization of the sensitivity-specificity sum threshold (MST). The models were first calibrated and validated based on the mean squared error (MSE) between composite habitat suitability and the observed presence-absence of the fish, and then evaluated using confusion matrix-derived measures such as the area under the receiver operating characteristics (ROC) curve (AUC), correctly classified instances (CCI), kappa and true skill statistic (TSS). The results clearly illustrated the effects of model formulation and threshold criteria on habitat suitability curves (HSCs) and accuracy in modelling species distributions. The use of the product model formulation led to the best accuracy in terms of MSE and AUC, and consistency in the shape of HSCs. The two threshold criteria of MST and MDT are also recommended for the consistently higher performance in terms of CCI, kappa and TSS. This case study of topmouth gudgeon illustrates the need for further studies on the model behaviour with regard to data characteristics (i.e., sample size and prevalence) and model structure (i.e., fuzzy sets and parameter settings of the GA).

AB - Species distribution models (SDMs), which evaluate species-environment relationships, are one of the key topics in ecology and biogeography. These models evaluate the current status of target ecosystems and potential impacts in both time and space. Although species distributions are often calculated based on the composite habitat suitability of several variables, there are no guidelines for calculating them. The present study assessed the effects of model formulation on habitat suitability evaluation and the accuracy of species distribution modelling. We employed a genetic algorithm (GA)-optimized fuzzy habitat preference model (FHPM) for evaluating habitat suitability of topmouth gudgeon (Pseudorasbora parva) in the Northwestern part of Kyushu Island in Japan. Four operations were used to calculate the composite habitat suitability from multiple habitat variables: arithmetic mean, geometric mean, product and minimum. To transform model outputs to presence/absence, four threshold criteria were compared based on model accuracy: prevalence, conventional 0.5, minimization of the sensitivity-specificity difference threshold (MDT), and maximization of the sensitivity-specificity sum threshold (MST). The models were first calibrated and validated based on the mean squared error (MSE) between composite habitat suitability and the observed presence-absence of the fish, and then evaluated using confusion matrix-derived measures such as the area under the receiver operating characteristics (ROC) curve (AUC), correctly classified instances (CCI), kappa and true skill statistic (TSS). The results clearly illustrated the effects of model formulation and threshold criteria on habitat suitability curves (HSCs) and accuracy in modelling species distributions. The use of the product model formulation led to the best accuracy in terms of MSE and AUC, and consistency in the shape of HSCs. The two threshold criteria of MST and MDT are also recommended for the consistently higher performance in terms of CCI, kappa and TSS. This case study of topmouth gudgeon illustrates the need for further studies on the model behaviour with regard to data characteristics (i.e., sample size and prevalence) and model structure (i.e., fuzzy sets and parameter settings of the GA).

UR - http://www.scopus.com/inward/record.url?scp=79953024365&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79953024365&partnerID=8YFLogxK

U2 - 10.1016/j.ecolmodel.2011.01.023

DO - 10.1016/j.ecolmodel.2011.01.023

M3 - Article

VL - 222

SP - 1401

EP - 1413

JO - Ecological Modelling

JF - Ecological Modelling

SN - 0304-3800

IS - 8

ER -