Bayesian Networks for Preprocessing Water Management Data
Identificadores
ISSN: 2227-7390
DOI: 10.3390/math10101777
DOI: 10.3390/math10101777
Compartir
Metadatos
Mostrar el registro completo del ítemFecha
2022-05-23Resumen
Environmental data often present inconveniences that make modeling tasks difficult. During the phase of data collection, two problems were found: (i) a block of five months of data was unavailable, and (ii) no information was collected from the coastal area, which made flood-risk estimation difficult. Thus, our aim is to explore and provide possible solutions to both issues. To avoid removing a variable (or those missing months), the proposed solution is a BN-based regression model using fixed probabilistic graphical structures to impute the missing variable as accurately as possible. For the second problem, the lack of information, an unsupervised classification method based on BN was developed to predict flood risk in the coastal area. Results showed that the proposed regression solution could predict the behavior of the continuous missing variable, avoiding the initial drawback of rejecting it. Moreover, the unsupervised classifier could classify all observations into a set of group...
Palabra/s clave
Bayesian networks
missing values
lack of information
regression
unsupervised classification